Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
InosLihka
/
rhythm_env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
rhythm_env
/
plots
Ctrl+K
Ctrl+K
3 contributors
History:
4 commits
InosLihka
Add SFT v3 + GRPO refine results to README + results.md
666b4ce
18 days ago
README.md
Safe
1.21 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
20 days ago
grpo_iter2_baseline_vs_trained.png
Safe
66.8 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
20 days ago
grpo_iter2_belief_accuracy.png
189 kB
xet
Post-deadline: full eval results + bigger plots via Git LFS
19 days ago
grpo_iter2_reward_components.png
263 kB
xet
Post-deadline: full eval results + bigger plots via Git LFS
19 days ago
grpo_iter2_reward_curve.png
179 kB
xet
Post-deadline: full eval results + bigger plots via Git LFS
19 days ago
grpo_iter2_training_loss.png
Safe
92.5 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
20 days ago
sft_grpo_comparison.png
Safe
34.9 kB
Add SFT v3 + GRPO refine results to README + results.md
18 days ago
sft_v3_baseline_vs_trained.png
Safe
39.8 kB
Post-deadline: full eval results + bigger plots via Git LFS
19 days ago
sft_v3_training_loss.png
Safe
38 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
20 days ago