SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 246
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 195
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution Paper • 2605.18401 • Published May 18 • 127
RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation Paper • 2605.13542 • Published May 13 • 8
UniPath: Adaptive Coordination of Understanding and Generation for Unified Multimodal Reasoning Paper • 2605.11400 • Published May 12 • 5
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 169
Repurposing 3D Generative Model for Autoregressive Layout Generation Paper • 2604.16299 • Published Apr 17 • 12
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 507
SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published Apr 6 • 35
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published Mar 17 • 51
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 115
Tex3D: Objects as Attack Surfaces via Adversarial 3D Textures for Vision-Language-Action Models Paper • 2604.01618 • Published Apr 2 • 15
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset Paper • 2603.23607 • Published Mar 24 • 20