AI & ML interests
None defined yet.
Recent Activity
Organization Card
This is the organization grouping all the models and datasets used in the TRL library.
spaces 9
Starting
Agents
6
Dataset Length Profiler
👁
Estimate optimal max_length for SFT training
Running
code diff viewer
⚡
Display side-by-side code differences
Running
TRL PyPI downloads
📈
Show weekly PyPI download trends
Build error
Agents
Featured
213
StackLLaMa
🦙
Running
Agents
Chat Template Inspector
📚
Inspect and test chat templates for Hugging Face models
Running
Agents
4
Trackio
🚀
Track and visualize data streams in real-time
models 84
trl-lib/rloo_tldr
Text Generation • 1B • Updated • 3
trl-lib/ppo_tldr
Text Generation • 1B • Updated • 9
trl-lib/Qwen3-4B-LoRA
Updated • 1
trl-lib/Qwen2-0.5B-Reward-Math-Sheperd
Token Classification • 0.5B • Updated • 14 • 1
trl-lib/Qwen2-0.5B-XPO
Text Generation • 0.5B • Updated • 3 •
trl-lib/Qwen2-0.5B-OnlineDPO
Text Generation • 0.5B • Updated • 7 • • 1
trl-lib/Qwen2-0.5B-KTO
Text Generation • 0.5B • Updated • 4
trl-lib/Qwen2-0.5B-ORPO
Text Generation • 0.5B • Updated • 38 • 2
trl-lib/Qwen2-0.5B-DPO
Text Generation • 0.5B • Updated • 38 • 4
trl-lib/Qwen2-0.5B-Reward
Text Classification • 0.5B • Updated • 84 • 1
datasets 23
trl-lib/trackio-dataset
Viewer • Updated • 3.83k • 22k • 5
trl-lib/documentation-images
Viewer • Updated • 11 • 91k
trl-lib/DeepMath-103K
Viewer • Updated • 103k • 4.59k • 10
trl-lib/llava-instruct-mix
Viewer • Updated • 228k • 2.33k • 3
trl-lib/OpenMathReasoning
Viewer • Updated • 3.2M • 541
trl-lib/chatbot_arena_completions
Viewer • Updated • 33k • 251 • 1
trl-lib/rlaif-v
Viewer • Updated • 83.1k • 480 • 3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer • Updated • 16.6k • 42 • 4
trl-lib/ultrafeedback-prompt
Viewer • Updated • 39.8k • 605 • 9
trl-lib/tldr-preference
Viewer • Updated • 179k • 142 • 3