GenRecal: Generation after Recalibration from Large to Small Vision-Language Models Paper • 2506.15681 • Published Jun 18, 2025 • 42
patrickjohncyh/fashion-clip Zero-Shot Image Classification • 0.2B • Updated Sep 17, 2024 • 2.44M • 270
view article Article Introducing Training Cluster as a Service - a new collaboration with NVIDIA +1 Jun 11, 2025 • 27
Vision Language Models Papers 🖼️💬📝 Collection Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 40
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 157
SmolVLA Collection Small, efficient and light-weight VLAs pretrained on community datasets • 1 item • Updated Sep 5, 2025 • 32
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation Paper • 2401.02117 • Published Jan 4, 2024 • 33
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 • 342