arxiv:2606.30634
Egor Petrov
moderntalker
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining updated a model 3 days ago
moderntalker/efficient_pretrain_checkpointsOrganizations
None yet