Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models Paper • 2512.06266 • Published Dec 6, 2025 • 7
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models Paper • 2602.02537 • Published 22 days ago • 6
PersonaMem-v2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory Paper • 2512.06688 • Published Dec 7, 2025 • 1
Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale Paper • 2504.14225 • Published Apr 19, 2025 • 1
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs Paper • 2602.02103 • Published 17 days ago • 70
Ming-V2 Collection Ming is the multi-modal series of any-to-any models developed by Ant Ling team. • 14 items • Updated 5 days ago • 34
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Paper • 2602.10934 • Published 8 days ago • 49
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Paper • 2602.11858 • Published 7 days ago • 56
Distribution Matching Distillation Meets Reinforcement Learning Paper • 2511.13649 • Published Nov 17, 2025 • 4
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield Paper • 2511.22677 • Published Nov 27, 2025 • 33
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 238
LLaDA2.1: Speeding Up Text Diffusion via Token Editing Paper • 2602.08676 • Published 10 days ago • 66