docs: update README task list with detailed objective descriptions and expand 72b_eval.txt coverage fe3a0f1 Navigam commited on Apr 8
feat: expand task suite to 22 challenges and update reward signal mechanics 6392732 Navigam commited on Apr 8
refactor: enforce strictly positive reward range [0.01, 0.99] and update documentation accordingly 02fd062 Navigam commited on Apr 8
docs: update task list in README and add evaluation file to gitignore 1b76d65 Navigam commited on Apr 7
docs: add design and rubric documentation and update gitignore to exclude temporary task files cfb1b83 Navigam commited on Apr 7
feat: implement ReAct agent architecture with robust JSON parsing and expanded task suite cd7967c Navigam commited on Apr 7