Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models Paper • 2603.24844 • Published 5 days ago • 7
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods Paper • 2502.01618 • Published Feb 3, 2025 • 10