Training Bayesian Neural Networks with Sparse Subspace Variational Inference Paper • 2402.11025 • Published Feb 16, 2024
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs Paper • 2406.20098 • Published Jun 28, 2024
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability Paper • 2409.19924 • Published Sep 30, 2024 • 1
Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs Paper • 2512.17008 • Published 8 days ago • 10
Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs Paper • 2512.17008 • Published 8 days ago • 10