Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
32
3
1
Stas Bekman
stas
Follow
argmin's profile picture
deenadayalandms's profile picture
DervinA1's profile picture
128 followers
·
4 following
https://stasosphere.com/machine-learning/
StasBekman
stas00
stasbekman
AI & ML interests
Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at Snowflake AI Research Training LLM/RAG/Generative AI/Machine Learning/Scalability
Recent Activity
updated
a model
2 days ago
stas/ml-engineering-book
posted
an
update
3 days ago
Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into HuggingFace Trainer, Accelerate and TRL For extensive details please see this writeup: https://huggingface.co/blog/ulysses-sp Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.
published
an
article
4 days ago
Ulysses Sequence Parallelism: Training with Million-Token Contexts
View all activity
Organizations
stas
's models
9
Sort:Â Recently updated
stas/ml-engineering-book
Updated
1 day ago
•
26
stas/tiny-random-llama-2
Text Generation
•
104k
•
Updated
Nov 14, 2023
•
28.1k
•
41
stas/tiny-m2m_100
Updated
Apr 29, 2022
•
3.57k
stas/tr8b-104B-debug3
Updated
Nov 29, 2021
stas/pegasus-cnn_dailymail-tiny-random
Updated
Jul 1, 2021
•
646
stas/mt5-tiny-random
Updated
Jun 23, 2021
•
260
•
2
stas/tiny-wmt19-en-de
Updated
May 3, 2021
•
111k
•
1
stas/tiny-wmt19-en-ru
Updated
May 3, 2021
•
316
stas/t5-very-small-random
Updated
Apr 21, 2021
•
14
•
1