Shizhe Diao
shizhediao2
AI & ML interests
LLM pre-training and reasoning
Recent Activity
reacted
to
di-zhang-fdu's
post
with ๐ฅ
5 days ago
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
https://huggingface.co/papers/2511.21689
upvoted
a
paper
5 days ago
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models