Random samples from large datasets, for convenience.
AI & ML interests
None defined yet.
Recent Activity
View all activity
datasets
24
bluelightai-dev/clt-mixed-eval-data-tokenized-Qwen3
Viewer
•
Updated
•
115k
•
3
bluelightai-dev/clt-mixed-eval-data
Viewer
•
Updated
•
60k
•
15
bluelightai-dev/clt-mixed-data-tokenized-Qwen3
Viewer
•
Updated
•
2.6M
•
4
bluelightai-dev/clt-pretrain-eval-data-tokenized-Qwen3-256
Viewer
•
Updated
•
194k
•
67
bluelightai-dev/clt-pretrain-data-dedup-tokenized-Qwen3-1024
Viewer
•
Updated
•
2.52M
•
108
bluelightai-dev/clt-pretrain-data-v2-dedup
Preview
•
Updated
•
1
bluelightai-dev/clt-pretrain-data-tokenized-Qwen3-1024
Viewer
•
Updated
•
2.44M
•
52
bluelightai-dev/clt-pretrain-data-v2
Preview
•
Updated
•
51
bluelightai-dev/MathPile_Commercial-formatted
Viewer
•
Updated
•
389k
•
12
bluelightai-dev/clt_posttrain_data_tokenized
Viewer
•
Updated
•
1.34M
•
8