5 31 1

alexiosss

Alexislhb

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

upvoted a paper 3 days ago

A Simple Baseline for Streaming Video Understanding

upvoted a paper 12 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

View all activity

Organizations

upvoted a paper about 18 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 3 days ago • 194

upvoted a paper 3 days ago

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published 7 days ago • 66

upvoted a paper 12 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published 22 days ago • 107

upvoted a paper 14 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 14 days ago • 95

upvoted a paper 16 days ago

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2603.18118 • Published 21 days ago • 12

upvoted 2 papers 26 days ago

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Paper • 2603.12247 • Published 27 days ago • 23

Video-Based Reward Modeling for Computer-Use Agents

Paper • 2603.10178 • Published 29 days ago • 43

upvoted a paper 30 days ago

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

Paper • 2603.08703 • Published about 1 month ago • 32

upvoted a paper 2 months ago

RISE-Video: Can Video Generators Decode Implicit World Rules?

Paper • 2602.05986 • Published Feb 5 • 26

upvoted a paper 3 months ago

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published Dec 24, 2025 • 69

upvoted a paper 4 months ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

upvoted a paper 5 months ago

UFO^3: Weaving the Digital Agent Galaxy

Paper • 2511.11332 • Published Nov 14, 2025 • 19

upvoted a paper 6 months ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15, 2025 • 11

upvoted 3 papers 7 months ago

Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation

Paper • 2509.18824 • Published Sep 23, 2025 • 23

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 84

upvoted 4 papers 8 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 303

Cut2Next: Generating Next Shot via In-Context Tuning

Paper • 2508.08244 • Published Aug 11, 2025 • 13

Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Paper • 2508.03320 • Published Aug 5, 2025 • 64

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5, 2025 • 52

alexiosss

AI & ML interests

Recent Activity

Organizations

Alexislhb's activity