A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models Paper • 2504.05496 • Published Apr 7, 2025
KaVa: Latent Reasoning via Compressed KV-Cache Distillation Paper • 2510.02312 • Published Oct 2, 2025 • 1
Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation Paper • 2310.15961 • Published Oct 24, 2023 • 1
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts Paper • 2401.04081 • Published Jan 8, 2024 • 73