jinrui123/llmrnn-grpo-phase1-v2judge-run01-qwen3-8b-global-step-17-merged Text Generation • 8B • Updated about 1 month ago • 6
jinrui123/llmrnn-grpo-phase1-v2judge-run02-global-step-340-merged Text Generation • 3B • Updated May 13 • 1