DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 5 days ago • 192
Where Does Authorship Signal Emerge in Encoder-Based Language Models? Paper • 2605.19908 • Published 6 days ago • 5
iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance Paper • 2605.21431 • Published 5 days ago • 2
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 11 days ago • 143
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization Paper • 2605.13641 • Published 12 days ago • 48
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 18 days ago • 215
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 19 days ago • 100
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 25 days ago • 218
TexOCR: Advancing Document OCR Models for Compilable Page-to-LaTeX Reconstruction Paper • 2604.22880 • Published Apr 24 • 9
ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces Paper • 2604.05172 • Published Apr 6 • 24
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503