Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
InosLihka
/
rhythm_env
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
rhythm_env
1.88 MB
Ctrl+K
Ctrl+K
3 contributors
History:
57 commits
InosLihka
Clarify documentation: anomaly signal explainer, GRPO scope notes
361aed7
14 days ago
docs
Clarify documentation: anomaly signal explainer, GRPO scope notes
14 days ago
plots
Add SFT v3 + GRPO refine results to README + results.md
16 days ago
scripts
Add SFT v3 + GRPO refine results to README + results.md
16 days ago
server
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses
18 days ago
tests
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses
18 days ago
training
Clarify documentation: anomaly signal explainer, GRPO scope notes
14 days ago
ui
refactor: rewrite blog around product vision; fix UI for Gradio 6
20 days ago
.dockerignore
Safe
92 Bytes
Initial commit: RhythmEnv daily planning RL environment
about 1 month ago
.env.example
Safe
441 Bytes
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
19 days ago
.gitattributes
Safe
218 Bytes
Post-deadline: full eval results + bigger plots via Git LFS
18 days ago
.gitignore
Safe
211 Bytes
Clarify documentation: anomaly signal explainer, GRPO scope notes
14 days ago
BLOG.md
Safe
9.48 kB
Move blog to root as BLOG.md (per Meta mentor guidance)
19 days ago
Dockerfile
Safe
1.49 kB
Fix HF Space README rendering + Dockerfile encoding
14 days ago
README.md
Safe
23.2 kB
Clarify documentation: anomaly signal explainer, GRPO scope notes
14 days ago
__init__.py
Safe
724 Bytes
env: enrich observation with history, anomalies, and discovery bonus
20 days ago
client.py
Safe
5.04 kB
client: surface ALL observation fields (was dropping deltas, anomalies, last_action, step_history)
19 days ago
eval_baselines_v2.json
Safe
284 Bytes
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
19 days ago
inference.py
Safe
13.4 kB
iter4: fix the 'constant belief = free reward' bug + 6 other deep issues
19 days ago
models.py
Safe
4.17 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
19 days ago
openenv.yaml
Safe
93 Bytes
Initial commit: RhythmEnv daily planning RL environment
about 1 month ago
pyproject.toml
Safe
909 Bytes
Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline
21 days ago
uv.lock
Safe
576 kB
Initial commit: RhythmEnv daily planning RL environment
about 1 month ago