Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zijie Xin's picture
2 9 12

Zijie Xin

xxayt
·
https://xxayt.github.io/
  • xxayt

AI & ML interests

multi-modal learning, AIGC

Recent Activity

upvoted a paper 1 day ago
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos
authored a paper 3 days ago
SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval
upvoted a paper 3 days ago
SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval
View all activity

Organizations

SeekWorld's profile picture

Collections 1

MGSV
[ICCV 2025] Music Grounding by Short Video
  • xxayt/MGSV-EC

    Viewer • Updated Aug 5, 2025 • 53.2k • 43 • 2
  • Music Grounding by Short Video

    Paper • 2408.16990 • Published Aug 30, 2024 • 2
MGSV
[ICCV 2025] Music Grounding by Short Video
  • xxayt/MGSV-EC

    Viewer • Updated Aug 5, 2025 • 53.2k • 43 • 2
  • Music Grounding by Short Video

    Paper • 2408.16990 • Published Aug 30, 2024 • 2

Papers 4

arxiv:2603.08224
arxiv:2508.02340
arxiv:2503.19351
arxiv:2408.16990

models 0

None public yet

datasets 1

xxayt/MGSV-EC

Viewer • Updated Aug 5, 2025 • 53.2k • 43 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs