Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Yandex Research

company
https://research.yandex.com/
YandexResearch
https://github.com/yandex-research
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

fzmushko  submitted a paper about 22 hours ago
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining
free001style  updated a model 4 months ago
yresearch/swd-medium-6-steps
free001style  updated a model 4 months ago
yresearch/swd-medium-4-steps
View all activity

Papers

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

View all Papers

Dmitry Baranchuk's profile picturenikita's profile pictureDenis Kuznedelev's profile pictureIvan Rubachev's profile pictureGleb Bazhenov's profile pictureValerii's profile pictureSergey Kastryulin's profile pictureIlya Drobyshevskiy's profile picture
yresearch 's papers 1
Submitted by
Zmushko Philip
20

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

yresearch Yandex Research
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs