Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published Mar 6 • 120
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 506
view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ huggingface • Feb 3 • 53
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning Paper • 2601.11141 • Published Jan 16 • 23
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 178
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published Jan 14 • 34
view article Article Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR nvidia • Jan 5 • 87
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published Dec 12, 2025 • 30
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing Paper • 2512.06065 • Published Dec 5, 2025 • 29
view article Article Building for an Open Future - our new partnership with Google Cloud jeffboudier, pagezyhf • Nov 13, 2025 • 48
view article Article 20x Faster TRL Fine-tuning with RapidFire AI +1 kbigdelysh, arunkk09, qgallouedec • Nov 21, 2025 • 27
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 260