Commit History

Upload ml_debug_env_grpo_fixed.ipynb
846b198
Running
verified

rak2315 commited on

Upload BLOG.md
b7357e2
verified

rak2315 commited on

Update README.md
95e314f
verified

rak2315 commited on

Update README.md
3773ce3
verified

rak2315 commited on

Upload BLOG.md
17d1efe
verified

rak2315 commited on

exclude images from hf space
1648f17

rak2315 commited on

track pngs with lfs
c436119

rak2315 commited on

add demo and reward curve images
b125241

rak2315 commited on

fix: one-liner curl commands for Windows CMD
fdad37f

rak2315 commited on

fix: module-level session store for cross-instance state
1099086

rak2315 commited on

fix: stateless demo using /grader endpoint
f350210

rak2315 commited on

fix: single worker to preserve session state
034f196

rak2315 commited on

add images, update readme, fix gitignore
7952623

rak2315 commited on

interactive demo landing page with curl commands
ffca774

rak2315 commited on

permanently remove images from HF Space tracking
a276060

rak2315 commited on

fix grader: other type penalty, gradient_not_zeroed loss check
c40e050

rak2315 commited on

fix image url
e630776

rak2315 commited on

remove images from HF Space, add to gitignore
0c87549

rak2315 commited on

update README with story arc, partial obs, 8 tasks, training results
e333b46

rak2315 commited on

add GRPO training notebook
78e60dc

rak2315 commited on

fix landing page route to /ui
2c2d4dc

rak2315 commited on

add landing page, blog link, HF Space UI
20d67a0

rak2315 commited on

Block A B C: partial observability, LLM judge, adversarial scheduler
49aa3ca

rak2315 commited on

v3: compound tasks, hardened graders, other type, 8 tasks total
6d9a8b2

rak2315 commited on

expand README with layman explanations and full file docs
5ce646c

rak2315 commited on

update README for 6 tasks
4e95b25

rak2315 commited on

remove hardcoded api key
f2b139b

rak2315 commited on

add 6 tasks, fix log format, multi-turn retry, grader improvements
4108ae8

rak2315 commited on

fix: scores strictly between 0 and 1 exclusive
ffa0040

rak2315 commited on

fix: inference.py calls LLM proxy directly
e749fdf

rak2315 commited on

fix: use injected API_KEY and API_BASE_URL
6d2b53f

rak2315 commited on

add Dockerfile to root for HF Space
08b8053

rak2315 commited on

fix: 20/20 all tasks 1.0
63eddc8

rak2315 commited on

fix: use API_BASE_URL and API_KEY env vars for LLM proxy
645efc4

rak2315 commited on

fix: self-contained inference.py, no network dependency
2a87ebe

rak2315 commited on

fix: emit [START]/[STEP]/[END] structured output for Phase 2 validator
d92195b

rak2315 commited on

Fix inference.py to hit deployed HF Space baseline endpoint
ff42cc0

rak2315 commited on

Add inference.py for hackathon checker
8abdf62

rak2315 commited on

Add gitignore
c502790

rak2315 commited on

ML Debug Environment - OpenEnv Hackathon submission
70a9d5e

rak2315 commited on