AI & ML interests

None defined yet.

Articles

danielhanchenΒ 
posted an update 3 days ago
view post
Post
367
You don’t need to set LLM parameters anymore! πŸš€

llama.cpp uses only the context length + compute your local setup needs. Unsloth also auto-applies the correct model settings

Try in Unsloth Studio - now with precompiled llama.cpp binaries.

GitHub: https://github.com/unslothai/unsloth
  • 2 replies
Β·
danielhanchenΒ 
posted an update 11 days ago
view post
Post
3194
Introducing Unsloth Studio ✨
A new open-source web UI to train and run LLMs.

β€’ Run models locally on Mac, Windows, Linux
β€’ Train 500+ models 2x faster with 70% less VRAM
β€’ Supports GGUF, vision, audio, embedding models
β€’ Auto-create datasets from PDF, CSV, DOCX
β€’ Self-healing tool calling and code execution
β€’ Compare models side by side + export to GGUF

GitHub: https://github.com/unslothai/unsloth
Blog and Guide: https://unsloth.ai/docs/new/studio

Available now on Hugging Face, NVIDIA, Docker and Colab.
danielhanchenΒ 
posted an update 15 days ago
view post
Post
3828
We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. πŸ’š Learn:

β€’ Why RL environments matter + how to build them
β€’ When RL is better than SFT
β€’ GRPO and RL best practices
β€’ How verifiable rewards and RLVR work

Blog: https://unsloth.ai/blog/rl-environments
  • 4 replies
Β·
danielhanchenΒ 
posted an update 23 days ago
danielhanchenΒ 
posted an update 26 days ago
danielhanchenΒ 
posted an update about 1 month ago
view post
Post
3435
100,000+ models trained with Unsloth have now been open-sourced on πŸ€—Hugging Face! πŸ¦₯

Here are the most popular ones you can run local:
1. TeichAI - GLM-4.7-Flash distilled from Claude 4.5 Opus (high)
2. Zed - Qwen Coder 7B fine-tuned for stronger coding
3. DavidAU - Llama-3.3-8B distilled from Claude 4.5 Opus (high)
4. huihui - gpt-oss made β€œabliberated”

Links to models:
1. TeichAI: TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF
2. Zed: zed-industries/zeta
3. DavidAU: DavidAU/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning
4. huihui: huihui-ai/Huihui-gpt-oss-20b-BF16-abliterated

See all the 100K latest models fine-tuned with Unsloth here: https://huggingface.co/models?other=u
  • 2 replies
Β·
danielhanchenΒ 
posted an update about 1 month ago
danielhanchenΒ 
posted an update about 1 month ago
view post
Post
5680
You can now run Qwen3.5 locally! πŸ’œ
Qwen3.5-397B-A17B is an open MoE vision reasoning LLM for agentic coding & chat. It performs on par with Gemini 3 Pro, Claude Opus 4.5 & GPT-5.2.

GGUF: unsloth/Qwen3.5-397B-A17B-GGUF
Run Dynamic 3-bit on a 192GB Mac for 20 tokens/s.

Guide: https://unsloth.ai/docs/models/qwen3.5
  • 9 replies
Β·
danielhanchenΒ 
posted an update about 1 month ago
danielhanchenΒ 
posted an update about 2 months ago
view post
Post
5205
We collaborated with Hugging Face to enable you to train MoE models 12Γ— faster with 35% less VRAM via our new Triton kernels (no accuracy loss). πŸ€—

Train gpt-oss locally on 12.8GB VRAM with our free notebooks: https://unsloth.ai/docs/new/faster-moe
  • 1 reply
Β·
danielhanchenΒ 
posted an update about 2 months ago
view post
Post
3713
We created a tool-calling guide for local LLMs!

Learn how to use any open model like Qwen3-Coder-Next and GLM-4.7-Flash for function calling.

Guide: https://unsloth.ai/docs/basics/tool-calling-guide-for-local-llms

We provide hands-on examples for: story writing, Python execution, terminal tool calls, maths and more.
  • 7 replies
Β·
danielhanchenΒ 
posted an update about 2 months ago
danielhanchenΒ 
posted an update about 2 months ago
danielhanchenΒ 
posted an update 2 months ago
view post
Post
2637
You can now fine-tune embedding models in our free Unsloth notebook! πŸ€—

Fine-tuning embedding models improves retrieval & RAG by aligning vectors to your domain-specific notion of similarity, improving search, clustering, and recommendations on your data.

⭐ Blog + Notebooks: https://unsloth.ai/docs/new/embedding-finetuning

Unsloth trains embedding models 1.8-3.3x faster with 20% less VRAM, 2x longer context & no accuracy loss vs. FA2 setups.

We'd like to thank Hugging Face and Unsloth contributor: electroglyph for making this possible!
  • 3 replies
Β·
danielhanchenΒ 
posted an update 2 months ago
danielhanchenΒ 
posted an update 2 months ago
view post
Post
2878
You can now do reinforcement learning training with 7Γ— longer context and no accuracy loss, via our new batching algorithms.

Long reasoning chains in RL are costly, but now we enable you to train gpt-oss with GRPO & reach 380K context on a 192GB GPU.

Blog: https://unsloth.ai/docs/new/grpo-long-context
danielhanchenΒ 
posted an update 3 months ago
danielhanchenΒ 
posted an update 3 months ago
view post
Post
4139
You can now run GLM-4.7, the new 355B parameter SOTA model on your local device (128GB RAM).✨

The model achieves SOTA performance on coding, agentic and chat benchmarks.

GGUF: unsloth/GLM-4.7-GGUF
Guide: https://docs.unsloth.ai/models/glm-4.7
  • 3 replies
Β·
danielhanchenΒ 
posted an update 3 months ago
danielhanchenΒ 
posted an update 3 months ago