赵沐宇's picture

赵沐宇

lukeleevi

AI & ML interests

Open-source multimodal experiment workflows. Interested in robust deployment.

Recent Activity

upvoted a paper 27 minutes ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

upvoted a paper 5 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

upvoted a paper 6 days ago

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

View all activity

Organizations

None yet

upvoted a paper 27 minutes ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 6 days ago • 61

upvoted a paper 5 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Paper • 2605.28618 • Published 13 days ago • 31

upvoted a paper 6 days ago

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code

Paper • 2606.01057 • Published 9 days ago • 7

upvoted a paper 7 days ago

Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows

Paper • 2605.24219 • Published 14 days ago • 9

upvoted 2 papers 18 days ago

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

Paper • 2605.22109 • Published 19 days ago • 169

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

upvoted a paper about 1 month ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 276

upvoted a paper about 2 months ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 121

upvoted a paper 2 months ago

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published Mar 25 • 183

upvoted a paper 3 months ago

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 249