view post Post 11428 deepseek-ai/DeepSeek-OCR is out! 🔥 my take ⤵️ > pretty insane it can parse and re-render charts in HTML> it uses CLIP and SAM features concatenated, so better grounding> very efficient per vision tokens/performance ratio> covers 100 languages See translation
Apr 7 Releases FINAL-Bench/Darwin-4B-Opus Text Generation • Updated 6 days ago • 694 • 16 nvidia/nemocurator-speech-bandwidth-filter Updated 15 days ago • 17 ACE-Step/acestep-v15-xl-turbo Text-to-Audio • 5B • Updated 10 days ago • 4.66k • 117 EasonXiao-888/SpatialEdit-16B Image-Text-to-Image • Updated 9 days ago • 92 • 15
Apr 3 Releases netflix/void-model Video-to-Video • Updated 11 days ago • 863 arcee-ai/Trinity-Large-Thinking Text Generation • 399B • Updated 8 days ago • 18.6k • • 156 KRAFTON/Raon-VisionEncoder Feature Extraction • Updated 16 days ago • 523 • 18 KRAFTON/Raon-SpeechChat-9B Audio-to-Audio • 10B • Updated 4 days ago • 954 • 28
Apr 7 Releases FINAL-Bench/Darwin-4B-Opus Text Generation • Updated 6 days ago • 694 • 16 nvidia/nemocurator-speech-bandwidth-filter Updated 15 days ago • 17 ACE-Step/acestep-v15-xl-turbo Text-to-Audio • 5B • Updated 10 days ago • 4.66k • 117 EasonXiao-888/SpatialEdit-16B Image-Text-to-Image • Updated 9 days ago • 92 • 15
Apr 3 Releases netflix/void-model Video-to-Video • Updated 11 days ago • 863 arcee-ai/Trinity-Large-Thinking Text Generation • 399B • Updated 8 days ago • 18.6k • • 156 KRAFTON/Raon-VisionEncoder Feature Extraction • Updated 16 days ago • 523 • 18 KRAFTON/Raon-SpeechChat-9B Audio-to-Audio • 10B • Updated 4 days ago • 954 • 28
Running on CPU Upgrade Agents 18 Daggr Image To 3d 👀 Convert images into 3D assets with background removal and enhancement