MoZoo:Unleashing Video Diffusion power in animal fur and muscle simulation Paper • 2605.13857 • Published Apr 8 • 2
PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers Paper • 2605.26730 • Published 11 days ago • 15
RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains Paper • 2605.29156 • Published 11 days ago • 13
Reflective Prompt Tuning through Language Model Function-Calling Paper • 2605.21781 • Published 18 days ago • 9
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 6 days ago • 43