Auditing Multimodal LLM Raters: Central Tendency Bias in Clinical Ordinal Scoring Paper • 2605.16386 • Published 12 days ago • 3
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 23 days ago • 57
8Planetterraforming/parameter_golf_v13_fineweb_systemaware_core_control Updated 30 days ago • 118 • 1
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 325
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 629
MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines Paper • 2603.06679 • Published Mar 30 • 6
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 246