Spaces:

InosLihka
/

rhythm_env

Running

App Files Files Community

1.88 MB

Ctrl+K

Ctrl+K

3 contributors

History: 57 commits

InosLihka's picture

Clarify documentation: anomaly signal explainer, GRPO scope notes

361aed7 14 days ago

docs
Clarify documentation: anomaly signal explainer, GRPO scope notes 14 days ago
plots
Add SFT v3 + GRPO refine results to README + results.md 16 days ago
scripts
Add SFT v3 + GRPO refine results to README + results.md 16 days ago
server
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses 18 days ago
tests
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses 18 days ago
training
Clarify documentation: anomaly signal explainer, GRPO scope notes 14 days ago
ui
refactor: rewrite blog around product vision; fix UI for Gradio 6 20 days ago
.dockerignore

92 Bytes
Initial commit: RhythmEnv daily planning RL environment about 1 month ago
.env.example

441 Bytes
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 19 days ago
.gitattributes

218 Bytes
Post-deadline: full eval results + bigger plots via Git LFS 18 days ago
.gitignore

211 Bytes
Clarify documentation: anomaly signal explainer, GRPO scope notes 14 days ago
BLOG.md

9.48 kB
Move blog to root as BLOG.md (per Meta mentor guidance) 19 days ago
Dockerfile

1.49 kB
Fix HF Space README rendering + Dockerfile encoding 14 days ago
README.md

23.2 kB
Clarify documentation: anomaly signal explainer, GRPO scope notes 14 days ago
__init__.py

724 Bytes
env: enrich observation with history, anomalies, and discovery bonus 20 days ago
client.py

5.04 kB
client: surface ALL observation fields (was dropping deltas, anomalies, last_action, step_history) 19 days ago
eval_baselines_v2.json

284 Bytes
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 19 days ago
inference.py

13.4 kB
iter4: fix the 'constant belief = free reward' bug + 6 other deep issues 19 days ago
models.py

4.17 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 19 days ago
openenv.yaml

93 Bytes
Initial commit: RhythmEnv daily planning RL environment about 1 month ago
pyproject.toml

909 Bytes
Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline 21 days ago
uv.lock

576 kB
Initial commit: RhythmEnv daily planning RL environment about 1 month ago