Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
14.2
TFLOPS
45
139
430
Bertrand Chevrier
kramp
Follow
takarajordan's profile picture
Blane187's profile picture
timqian's profile picture
210 followers
·
923 following
kramp
krampstudio
chevrierbertrand
thekramp.bsky.social
AI & ML interests
text 2 speech, ai for music writting
Recent Activity
reacted
to
IlyasMoutawwakil
's
post
with 🔥
about 22 hours ago
Transformers v5 just landed! 🚀 It significantly unifies and reduces modeling code across architectures, while opening the door to a whole new class of performance optimizations. My favorite new feature? 🤔 The new dynamic weight loader + converter. Here’s why 👇 Over the last few months, the core Transformers maintainers built an incredibly fast weight loader, capable of converting tensors on the fly while loading them in parallel threads. This means we’re no longer constrained by how parameters are laid out inside the safetensors weight files. In practice, this unlocks two big things: - Much more modular modeling code. You can now clearly see how architectures build on top of each other (DeepSeek v2 → v3, Qwen v2 → v3 → MoE, etc.). This makes shared bottlenecks obvious and lets us optimize the right building blocks once, for all model families. - Performance optimizations beyond what torch.compile can do alone. torch.compile operates on the computation graph, but it can’t change parameter layouts. With the new loader, we can restructure weights at load time: fusing MoE expert projections, merging attention QKV projections, and enabling more compute-dense kernels that simply weren’t possible before. Personally, I'm honored to have contributed in this direction, including the work on optimizing MoE implementations and making modeling code more torch-exportable, so these optimizations can be ported cleanly across runtimes. Overall, Transformers v5 is a strong signal of where the community and industry are converging: Modularity and Performance, without sacrificing Flexibility. Transformers v5 makes its signature from_pretrained an entrypoint where you can mix and match: - Parallelism - Quantization - Custom kernels - Flash/Paged attention - Continuous batching - ... Kudos to everyone involved! I highly recommend the: Release notes: https://github.com/huggingface/transformers/releases/tag/v5.0.0 Blog post: https://huggingface.co/blog/transformers-v5
liked
a model
2 days ago
moonshotai/Kimi-K2.5
liked
a Space
2 days ago
yonigozlan/Transformers-Timeline
View all activity
Organizations
kramp
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
huggingface/HuggingDiscussions
17 days ago
[FEEDBACK] Local apps
👀
❤️
4
63
#31 opened over 1 year ago by
kramp
New activity in
reach-vb/TinyLlama-1.1B-Chat-v1.0-q4_k_m-GGUF
4 months ago
Update metadata (general.name)
#47 opened 4 months ago by
kramp
New activity in
huggingface/HuggingDiscussions
6 months ago
"Mark all as Read" broken for notifications
2
#71 opened 6 months ago by
Aurelien-Morgan
New activity in
prithivMLmods/Openpdf-Analysis-Recognition
7 months ago
Dataset-based model filter
2
#4 opened 7 months ago by
prithivMLmods
New activity in
huggingface/HuggingDiscussions
7 months ago
[FEEDBACK] Daily Papers
🔥
❤️
21
169
#32 opened over 1 year ago by
kramp
New activity in
huggingface/HuggingDiscussions
10 months ago
Async active filters on HomePage laggy or broken
4
#67 opened 10 months ago by
Aurelien-Morgan
New activity in
social-post-explorers/README
11 months ago
Post button doesn't appear.
7
#46 opened 11 months ago by
ccocks-deca
New activity in
huggingface/HuggingDiscussions
12 months ago
Upload 2 files
5
#50 opened 12 months ago by
Ajllo
New activity in
huggingface/HuggingDiscussions
about 1 year ago
Questions about Ollama: Running Hugging Face GGUF models
7
#46 opened about 1 year ago by
NCGWRjason
New activity in
huggingface/HuggingDiscussions
over 1 year ago
[FEEDBACK] Notifications
❤️
🤗
16
171
#6 opened over 3 years ago by
victor
Spaces - general feedback
8
#21 opened almost 2 years ago by
victor
New activity in
social-post-explorers/README
over 1 year ago
"Back To Feed" brings you back to feed not posts
1
#42 opened over 1 year ago by
Tonic
New activity in
blog-explorers/README
over 1 year ago
[Support] Community Articles
🚀
🤝
1
100
#5 opened almost 2 years ago by
victor
[Support] Community Articles
🤝
🚀
1
100
#5 opened almost 2 years ago by
victor
New activity in
huggingface/HuggingDiscussions
over 1 year ago
[FEEDBACK] Daily Papers
🔥
❤️
21
169
#32 opened over 1 year ago by
kramp
Bugs? Preview Errors today -- Resolved 💞🤗🙏
6
#35 opened over 1 year ago by
digiplay
commented
2 papers
almost 2 years ago
Stealing Part of a Production Language Model
Paper
•
2403.06634
•
Published
Mar 11, 2024
•
91
•
3
Stealing Part of a Production Language Model
Paper
•
2403.06634
•
Published
Mar 11, 2024
•
91
•
3
New activity in
social-post-explorers/README
almost 2 years ago
Removing the head or upper body during the generation
3
#28 opened almost 2 years ago by
drailix
Unable to recover a post after a failed attempt to post due to exceeding the daily posting quota
3
#23 opened almost 2 years ago by
santiviquez
Load more