Ruiyi Zhang's picture

4 6 7

Ruiyi Zhang

zhangry868

·

AI & ML interests

None yet

Recent Activity

published a dataset 15 days ago

LLaVAR-2/ccPDF_Extraction

upvoted a paper about 1 month ago

GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding

commented on a paper about 1 month ago

GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding

View all activity

Organizations

upvoted a paper about 1 month ago

GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding

Paper • 2511.00810 • Published Nov 2 • 3

upvoted 2 papers 4 months ago

VisR-Bench: An Empirical Study on Visual Retrieval-Augmented Generation for Multilingual Long Document Understanding

Paper • 2508.07493 • Published Aug 10 • 8

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 135

upvoted a paper 5 months ago

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Paper • 2507.07202 • Published Jul 9 • 24

upvoted a paper about 1 year ago

DynaSaur: Large Language Agents Beyond Predefined Actions

Paper • 2411.01747 • Published Nov 4, 2024 • 37

upvoted a paper over 2 years ago

LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding

Paper • 2306.17107 • Published Jun 29, 2023 • 11