Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Yu Zhang's picture
13 11 5

Yu Zhang

AaronZ345
bunyaminergen's profile picture Reel2reel's profile picture prompts-dot-com's profile picture
·
https://aaronz345.github.io
  • AaronZ345
  • yuzhang34

AI & ML interests

Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).

Recent Activity

authored a paper 2 days ago
ALIVE: Animate Your World with Lifelike Audio-Video Generation
authored a paper 3 days ago
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue
authored a paper 3 days ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
View all activity

Organizations

Zhejiang University's profile picture Zhejiang University's profile picture
AaronZ345 's papers 12
arxiv:2605.30993
arxiv:2605.30940
arxiv:2510.10396
arxiv:2508.10924
arxiv:2507.14534
arxiv:2507.06670
arxiv:2505.14910
arxiv:2504.20630
arxiv:2504.19062
arxiv:2409.15977
arxiv:2409.13832
arxiv:2312.10741
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs