🚀 Qwen-MTP Collection ⚡ MTP (Multi Token Prediction) speculative decoding enables models like Qwen3.6 to have ~1.4-2.2x faster generation with no change in accuracy. • 6 items • Updated 4 days ago • 15
💻 Qwopus-Coder Collection Reasoning-distilled coding models optimized for specialized domains like agentic workflows. • 4 items • Updated 4 days ago • 5
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models Paper • 2407.09025 • Published Jul 12, 2024 • 140
HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation Paper • 2212.10315 • Published Dec 20, 2022 • 1