Xin Li

lixin67

WilliamLeeBravo

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

upvoted a paper 14 days ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

upvoted a paper 23 days ago

DeepEyesV2: Toward Agentic Multimodal Model

View all activity

Organizations

None yet

upvoted a paper 10 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 72

upvoted a paper 14 days ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published 16 days ago • 37

upvoted 2 papers 23 days ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7 • 42

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 80

upvoted a collection 26 days ago

Bee

Collection

7 items • Updated Nov 5 • 10

upvoted a paper 28 days ago

Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs

Paper • 2510.13795 • Published Oct 15 • 56

liked a dataset 28 days ago

Open-Bee/Honey-Data-15M

Viewer • Updated Nov 5 • 14.8M • 95.4k • 101

liked a model about 1 month ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated about 1 month ago • 391k • • 1.51k

upvoted 3 articles about 1 month ago

Article

What makes good reasoning data

Oct 30

•

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

Jan 15

•

upvoted 2 papers about 1 month ago

LongCat-Flash-Omni Technical Report

Paper • 2511.00279 • Published Oct 31 • 22

Revisiting Multimodal Positional Encoding in Vision-Language Models

Paper • 2510.23095 • Published Oct 27 • 20

upvoted 2 articles about 1 month ago

Article

Unlock the power of images with AI Sheets

Oct 21

•

Article

Streaming datasets: 100x More Efficient

Oct 27

•

upvoted a paper about 1 month ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30 • 116

upvoted 2 papers about 2 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 65

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Paper • 2510.12784 • Published Oct 14 • 19

liked a model about 2 months ago

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated Oct 15 • 802k • 255

upvoted a collection 2 months ago

Qwen3-VL

Collection

37 items • Updated Nov 1 • 492

Xin Li

AI & ML interests

Recent Activity

Organizations

lixin67's activity

What makes good reasoning data

Why Did MiniMax M2 End Up as a Full Attention Model?

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

Unlock the power of images with AI Sheets

Streaming datasets: 100x More Efficient