Narrative-Driven Paper-to-Slide Generation via ArcDeck Paper • 2604.11969 • Published 10 days ago • 7
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation Paper • 2604.13010 • Published 9 days ago • 12
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data Paper • 2604.14164 • Published about 1 month ago • 34
AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization Paper • 2511.15915 • Published 8 days ago • 3
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published 20 days ago • 231
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 20 days ago • 364
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 21 days ago • 485
Kronos: A Foundation Model for the Language of Financial Markets Paper • 2508.02739 • Published Aug 2, 2025 • 24
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published Mar 19 • 66
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published Mar 17 • 308
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published Mar 10 • 75
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models Paper • 2603.13985 • Published Mar 14 • 10
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use Paper • 2603.08262 • Published Mar 9 • 42