Submitted by akhaliq 49 Teaching Large Language Models to Reason with Reinforcement Learning · 9 authors 2
Submitted by akhaliq 40 Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference · 11 authors 2
Submitted by akhaliq 24 LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error · 5 authors 123 1
Submitted by akhaliq 18 Common 7B Language Models Already Possess Strong Math Capabilities · 8 authors 1
Submitted by akhaliq 6 Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis · 8 authors 330 1