VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 10 days ago • 27
Running on Zero MCP Featured 1.49k Qwen-Image-Edit-2511-LoRAs-Fast 🎃 1.49k Demo of the Collection of Qwen Image Edit LoRAs
Running on Zero MCP 78 Qwen Image Edit 2509 LoRAs Fast ⚡ 78 Demo of the Collection of Qwen Image Editing LoRAs
MMSkills: Towards Multimodal Skills for General Visual Agents Paper • 2605.13527 • Published 11 days ago • 117
Running on Zero MCP 2.64k Wan2.2 14B Preview 🐌 2.64k generate a video from an image with a text prompt
The Invisible Leash: Why RLVR May Not Escape Its Origin Paper • 2507.14843 • Published Jul 20, 2025 • 85
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper • 2507.14683 • Published Jul 19, 2025 • 136
GUI-G^2: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published Jul 21, 2025 • 135