supplymind / scripts

Commit History

Align warehouse GRPO prompt with SFT
0ac1e98
verified

rishavutk commited on

Fix adapter eval for 3B notebooks
c66fcbe
verified

rishavutk commited on

Expose GRPO generation batch size
87004db
verified

rishavutk commited on

Add 4bit SFT training knobs
eee84e5
verified

rishavutk commited on

Add 4bit GRPO training knobs
bbb6307
verified

rishavutk commited on

Allow configurable SFT base model
6d2f03a
verified

rishavutk commited on

Allow configurable eval base model
bfc3f5b
verified

rishavutk commited on

Allow configurable GRPO base model
130e9e4
verified

rishavutk commited on

Add conservative warehouse SFT evidence
cb5e5bf

Rishav commited on

Bias center SFT toward action states
939eba0

Rishav commited on

Tighten role training scaffold
a2144da

Rishav commited on

Add eval action diagnostics
ff949e6

Rishav commited on

Split dashboard curves by training phase
7579e54

Rishav commited on

Add SFT warm start training pipeline
be8d222

Rishav commited on

Add HF adapter evaluation job
45cc878

Rishav commited on

Disable Trackio checkpoint sync
aa7d416

Rishav commited on

Tune GRPO completion length
cce2fba

Rishav commited on

Make GRPO config version tolerant
15084c8

Rishav commited on

Fix HF training config resolution
d77a60c

Rishav commited on

Harden HF training startup
1cd6456

Rishav commited on

Add role-specific training scores
37049ad

Rishav commited on

Add training progress logs
a7160ed

Rishav commited on

Fix training smoke task id
5ca7724

Rishav commited on

Add HF role GRPO training job
3f1eabc

Rishav commited on

Prepare SupplyMind finale submission
9432cbb

Rishav commited on

Add SupplyMind V2 multi-agent environment
d5184f8

Rishav commited on

Add central procurement and subagent replay
a0d2d1a

Rishav commited on

Initial SupplyMind environment
a18f6ce

Rishav commited on