arxiv:2505.11493
Yusu Qian
YusuQian
AI & ML interests
multimodal llm research
Recent Activity
upvoted a paper 2 days ago
MosaicMem: Hybrid Spatial Memory for Controllable Video World Models upvoted a paper 5 months ago
PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error
Detection upvoted a paper 5 months ago
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing