Spaces:
Sleeping
Sleeping
Commit History
Fix adapter eval for 3B notebooks c66fcbe verified
Expose GRPO generation batch size 87004db verified
Add 4bit SFT training knobs eee84e5 verified
Add 4bit GRPO training knobs bbb6307 verified
Allow configurable SFT base model 6d2f03a verified
Allow configurable eval base model bfc3f5b verified
Allow configurable GRPO base model 130e9e4 verified
Add conservative warehouse SFT evidence cb5e5bf
Rishav commited on
Bias center SFT toward action states 939eba0
Rishav commited on
Tighten role training scaffold a2144da
Rishav commited on
Add eval action diagnostics ff949e6
Rishav commited on
Split dashboard curves by training phase 7579e54
Rishav commited on
Add SFT warm start training pipeline be8d222
Rishav commited on
Add HF adapter evaluation job 45cc878
Rishav commited on
Disable Trackio checkpoint sync aa7d416
Rishav commited on
Tune GRPO completion length cce2fba
Rishav commited on
Make GRPO config version tolerant 15084c8
Rishav commited on
Fix HF training config resolution d77a60c
Rishav commited on
Harden HF training startup 1cd6456
Rishav commited on
Add role-specific training scores 37049ad
Rishav commited on
Add training progress logs a7160ed
Rishav commited on
Fix training smoke task id 5ca7724
Rishav commited on
Add HF role GRPO training job 3f1eabc
Rishav commited on
Prepare SupplyMind finale submission 9432cbb
Rishav commited on
Add SupplyMind V2 multi-agent environment d5184f8
Rishav commited on
Add central procurement and subagent replay a0d2d1a
Rishav commited on
Initial SupplyMind environment a18f6ce
Rishav commited on