ToolRL: Reward is All Tool Learning Needs
emre can PRO
emrecanacikgoz
AI & ML interests
None yet
Organizations
models
18
emrecanacikgoz/Qwen2.5-7B-Instruct-ToolRL-grpo-cold
Updated
•
218
•
3
emrecanacikgoz/lorem-sft400-only
Updated
•
4
emrecanacikgoz/lorem-base
Updated
•
4
emrecanacikgoz/loremppo-sft-400
Updated
•
5
emrecanacikgoz/lorem-sft-400
Updated
•
5
emrecanacikgoz/SMARTAgent-Mistral-Small-24B-Instruct-2501
Updated
•
11
emrecanacikgoz/SMARTAgent-Mistral-Nemo-Instruct-2407
Updated
•
11
•
1
emrecanacikgoz/SMARTAgent-Mistral-7B-Instruct-v0.3
Updated
•
10
•
1
emrecanacikgoz/SMARTAgent-Llama-3.1-70B
Updated
•
6
emrecanacikgoz/SMARTAgent-Llama-3.1-8B
Updated
•
9
•
1