MMFormalizer: Multimodal Autoformalization in the Wild Paper • 2601.03017 • Published 12 days ago • 102
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 10 days ago • 193
Running on Zero Featured 265 granite-docling-258M demo 📝 265 Convert images to structured text and answer questions
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published 20 days ago • 17
DiRL: An Efficient Post-Training Framework for Diffusion Language Models Paper • 2512.22234 • Published 26 days ago • 19
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published Dec 18, 2025 • 93
SaitBurak/Qwen3-4B-Thinking-2507-DeepSeek-v3.2-Speciale-Code-Distill-Q4_K_S-GGUF 4B • Updated Dec 14, 2025 • 116
SaitBurak/Qwen3-4B-Thinking-2507-DeepSeek-v3.2-Speciale-Code-Distill-Q4_K_S-GGUF 4B • Updated Dec 14, 2025 • 116
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models Paper • 2512.08829 • Published Dec 9, 2025 • 19
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published Dec 10, 2025 • 71
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published Nov 26, 2025 • 118
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 251
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published Nov 27, 2025 • 88
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Paper • 2511.16528 • Published Nov 20, 2025 • 22
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Paper • 2511.16664 • Published Nov 20, 2025 • 26