Commit History

sync training files
d5c7b07
verified

Pathikreet commited on

sync training files
ea9c69b
verified

Pathikreet commited on

fix: OOM — NUM_GENERATIONS 32→16, max_completion_length 300→200, expandable_segments
c0d3d54
verified

Pathikreet commited on

fix: add 3 missing hard tasks to _TASK_DIFFICULTY (322 prompts)
a6c22c8
verified

Pathikreet commited on

Fix: kl_coeff -> beta in colab
edeb6d6
verified

Pathikreet commited on

Fix: kl_coeff -> beta (correct TRL GRPOConfig param name)
a47b370
verified

Pathikreet commited on

Auto-detect username from token for adapter + run folder upload
0f17c96
verified

Pathikreet commited on

Fix colab: format reward +-0.15, temp=0.7, kl_coeff=0.1
dc41c9d
verified

Pathikreet commited on

Fix: start_training yield count (9), plt.close memory leak
83e0e06
verified

Pathikreet commited on

Fix restart loop: variant=secondary, theme back in Blocks()
49d99dc
verified

Pathikreet commited on

Stop button: write flag file, wait 120s for clean save, fallback terminate
3eb85d0
verified

Pathikreet commited on

Graceful stop: save weights on /app/stop_requested flag
65ac9f8
verified

Pathikreet commited on

Add Stop button + save plots on stop; fix _refresh unpack bug; fix theme deprecation
b36df55
verified

Pathikreet commited on

Fix hard_currency_conversion task ID in TRAIN_TASKS and EVAL_TASKS
e27253e
verified

Pathikreet commited on

Baseline: Qwen2.5-7B-Instruct untrained 2026-04-26_0443
c10ea23
verified

Pathikreet commited on

Add oversight eval + bump seeds medium×8 hard/long×20
4c61d1e
verified

Pathikreet commited on

Bump seeds: medium×8, hard/long×20 (322 prompts total)
abf8676
verified

Pathikreet commited on

Full metrics in live JSON: format/diff/ep_len history
3869f27
verified

Pathikreet commited on

Add format/difficulty/ep-length live panels + full metrics in JSON
1744e09
verified

Pathikreet commited on

Add loss tracking callback + reward/loss PNG savers
8877328
verified

Pathikreet commited on

Add live loss curve panel + wire into dashboard
0e7659b
verified

Pathikreet commited on

UI: generations default 16→32, max 16→64
5ba0976
verified

Pathikreet commited on

Run 3: temp=0.7, kl=0.1, format±0.15, no curriculum, 20 tasks, G=32
0bfa536
verified

Pathikreet commited on

Fix bf16 crash + 17 tasks / 160 prompts dataset
6912151
verified

Pathikreet commited on

Update UI defaults: epochs 3→6 (max 10), generations 8→16 (max 32)
d6d586a
verified

Pathikreet commited on

Baseline: Qwen2.5-7B-Instruct untrained 2026-04-25_1626
e7727fc
verified

Pathikreet commited on

Update root eval_baseline.py to 17 tasks + long-horizon + health retry
951f2d1
verified

Pathikreet commited on

Restore original requirements: gradio + torch ML stack
0cf30cc
verified

Pathikreet commited on

Fix requirements: restore gradio + UI deps for training space
ea3f11d
verified

Pathikreet commited on

Upload requirements.txt with huggingface_hub
0d9ca72
verified

Pathikreet commited on

Upload requirements.txt with huggingface_hub
145ee9f
verified

Pathikreet commited on

Upload requirements.txt with huggingface_hub
5aad667
verified

Pathikreet commited on

Upload training/train.py with huggingface_hub
8bc5cc4
verified

Pathikreet commited on

Upload training/eval_baseline.py with huggingface_hub
61ca2ab
verified

Pathikreet commited on

Upload train.py with huggingface_hub
a9752a6
verified

Pathikreet commited on

Upload train.py with huggingface_hub
d826205
verified

Pathikreet commited on

Upload app.py with huggingface_hub
b05aab5
verified

Pathikreet commited on

Upload eval_baseline.py with huggingface_hub
9b6e30c
verified

Pathikreet commited on

Upload train.py with huggingface_hub
2865f24
verified

Pathikreet commited on

Upload app.py with huggingface_hub
f809ebf
verified

Pathikreet commited on

Upload train.py with huggingface_hub
16366ba
verified

Pathikreet commited on

Upload requirements.txt with huggingface_hub
5b67fd3
verified

Pathikreet commited on

Upload app.py with huggingface_hub
65e338a
verified

Pathikreet commited on

Upload train.py with huggingface_hub
da5a42f
verified

Pathikreet commited on

Upload requirements.txt with huggingface_hub
d8c51de
verified

Pathikreet commited on

Upload Dockerfile with huggingface_hub
ed07a32
verified

Pathikreet commited on

Upload train.py with huggingface_hub
6913549
verified

Pathikreet commited on

Upload requirements.txt with huggingface_hub
79d2a0c
verified

Pathikreet commited on

Upload requirements.txt with huggingface_hub
f677dba
verified

Pathikreet commited on

Upload app.py with huggingface_hub
3f72106
verified

Pathikreet commited on