SpecBundle Collection A collection of production-grade draft models for speculative decoding • 18 items • Updated Apr 15 • 18
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 11 days ago • 140
Domino Collection Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding • 3 items • Updated 5 days ago • 2
DFlash Collection Block Diffusion for Flash Speculative Decoding • 21 items • Updated 28 days ago • 127
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 402
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery Paper • 2601.19325 • Published Jan 27 • 82
Are We on the Right Way to Assessing LLM-as-a-Judge? Paper • 2512.16041 • Published Dec 17, 2025 • 35
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper • 2511.09611 • Published Nov 12, 2025 • 72
AI for Service: Proactive Assistance with AI Glasses Paper • 2510.14359 • Published Oct 16, 2025 • 77