Yu Zhang's picture

Yu Zhang

AaronZ345

·

https://aaronz345.github.io

AI & ML interests

Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).

Recent Activity

authored a paper 2 days ago

ALIVE: Animate Your World with Lifelike Audio-Video Generation

authored a paper 3 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

authored a paper 3 days ago

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

View all activity

Organizations

AaronZ345 's papers 12

arxiv:2605.30993

arxiv:2605.30940

arxiv:2510.10396

arxiv:2508.10924

arxiv:2507.14534

arxiv:2507.06670

arxiv:2505.14910

arxiv:2504.20630

arxiv:2504.19062

arxiv:2409.15977

arxiv:2409.13832

arxiv:2312.10741