Li Dong
unilm
AI & ML interests
Language Model Pre-Training
Recent Activity
liked
a model
about 5 hours ago
microsoft/VibeVoice-ASR
authored
a paper
1 day ago
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
authored
a paper
1 day ago
Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts