arxiv:2601.07767
Deqing Fu PRO
deqing
AI & ML interests
None yet
Recent Activity
updated
a model about 5 hours ago
deqing/llama-600M-v4-original published
a model about 6 hours ago
deqing/llama-600M-v4-original updated
a model about 15 hours ago
deqing/llama-300M-v3-muon-original