Updated
•
3.8k
•
185
Viewer
•
Updated
•
170M
•
52.5k
•
88
Viewer
•
Updated
•
621M
•
35.1k
•
84
Locutusque/UltraTextbooks
Viewer
•
Updated
•
5.52M
•
1.71k
•
196
PrimeIntellect/StackV1-popular
Viewer
•
Updated
•
93M
•
2.64k
•
2
Viewer
•
Updated
•
11.7M
•
149
•
5
EleutherAI/the_pile_deduplicated
Viewer
•
Updated
•
134M
•
12.1k
•
107
HIT-TMG/KaLM-embedding-pretrain-data
Viewer
•
Updated
•
23.7M
•
1.8k
•
16
suriyagunasekar/stackoverflow-with-meta-data
Viewer
•
Updated
•
19.9M
•
828
•
12
Viewer
•
Updated
•
13.6M
•
871
•
5
Viewer
•
Updated
•
3.71M
•
818k
•
562
Viewer
•
Updated
•
474M
•
2.56k
•
4
EleutherAI/deep-ignorance-annealing-mix
Viewer
•
Updated
•
89M
•
1.46k
•
1
Viewer
•
Updated
•
10.2M
•
128
•
5
Viewer
•
Updated
•
1.76M
•
19.9k
•
394
Viewer
•
Updated
•
167M
•
2.36k
•
60
Locutusque/deeplm-training-data
Viewer
•
Updated
•
2.17M
•
75
•
3
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
•
Updated
•
3.91M
•
5.46k
•
638
Updated
•
51.2k
•
247
EssentialAI/essential-web-v1.0
Preview
•
Updated
•
6.25k
•
213