argilla/distilabel-intel-orca-dpo-pairs
Viewer
• Updated • 12.9k • 9.85k
• 183
Viewer
• Updated • 66.4k • 5.14k
• 241
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated • 60.9k • 11.7k
• 162
Viewer
• Updated • 15.3k • 50
• 19
theblackcat102/evol-codealpaca-v1
Viewer
• Updated • 111k • 11.5k
• 182
Viewer
• Updated • 395k • 52.5k
• 463
glaiveai/glaive-code-assistant-v2
Viewer
• Updated • 215k • 238
• 49
Viewer
• Updated • 12.9k • 2.49k
• 322
Viewer
• Updated • 183k • 1.67k
• 295
garage-bAInd/Open-Platypus
Viewer
• Updated • 24.9k • 9.27k
• 421
LLM360/CrystalCoderDatasets
Updated • 1.08k
• 22
protectai/deberta-v3-base-prompt-injection
Text Classification
• 0.2B • Updated • 46.9k
• • 108
nampdn-ai/tiny-orca-textbooks
Viewer
• Updated • 147k • 29
• 43
code-search-net/code_search_net
Viewer
• Updated • 4.14M • 14.5k
• 331
WhiteRabbitNeo/WRN-Chapter-1
Viewer
• Updated • 7.75k • 57
• 53
WhiteRabbitNeo/WRN-Chapter-2
Viewer
• Updated • 11.1k • 52
• 21
Text Generation
• 0.4B • Updated • 332
• 209
Viewer
• Updated • 31.1M • 18.7k
• 721
Viewer
• Updated • 3.54k • 39
• 55
NousResearch/json-mode-eval
Viewer
• Updated • 100 • 1.47k
• 44
Viewer
• Updated • 2.75M • 10.1k
• 395
Viewer
• Updated • 518k • 5
• 1
laurentiubp/openhermes-scored
Viewer
• Updated • 185k • 5
• 1
Towards Best Practices for Open Datasets for LLM Training
Paper
• 2501.08365
• Published • 62