Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Evaluation datasets
community
Activity Feed
Follow
75
AI & ML interests
None defined yet.
Recent Activity
alozowski
authored
a paper
8 days ago
YourBench: Easy Custom Evaluation Sets for Everyone
SaylorTwift
new
activity
14 days ago
OpenEvals/SimpleQA:
adds_eval_yaml
SaylorTwift
updated
a dataset
14 days ago
OpenEvals/SimpleQA
View all activity
Team members
8
lighteval
's datasets
192
Sort: Recently updated
lighteval/prost
Viewer
•
Updated
Aug 13
•
18.7k
•
21
lighteval/mmlu
Viewer
•
Updated
Aug 13
•
5.82M
•
17.3k
•
43
lighteval/pile_helm
Viewer
•
Updated
Aug 13
•
21.3k
•
125
lighteval/summarization
Viewer
•
Updated
Aug 13
•
90.3k
•
225
•
3
lighteval/treb_table_retrieval
Viewer
•
Updated
Jul 22
•
500
•
286
lighteval/squad_v2
Viewer
•
Updated
Jul 21
•
142k
•
302
lighteval/wikitablequestions
Viewer
•
Updated
Jul 21
•
18.5k
•
352
•
1
lighteval/RULER-262144-SmolLM3-SFT-chatml
Preview
•
Updated
Jul 3
•
42
lighteval/RULER-131072-SmolLM3-SFT-chatml
Preview
•
Updated
Jul 3
•
3
lighteval/RULER-65536-SmolLM3-SFT-chatml
Preview
•
Updated
Jul 3
•
18
lighteval/RULER-32768-SmolLM3-SFT-chatml
Preview
•
Updated
Jul 3
•
9
lighteval/RULER-16384-SmolLM3-SFT-chatml
Preview
•
Updated
Jul 3
•
130
lighteval/RULER-8192-SmolLM3-SFT-chatml
Preview
•
Updated
Jul 3
•
66
lighteval/RULER-4096-SmolLM3-SFT-chatml
Preview
•
Updated
Jul 3
•
61
lighteval/RULER-262144-Qwen2.5-0.5B
Preview
•
Updated
Jul 2
•
80
lighteval/RULER-131072-Qwen2.5-0.5B
Preview
•
Updated
Jul 2
•
24
lighteval/RULER-65536-Qwen2.5-0.5B
Preview
•
Updated
Jul 2
•
10
lighteval/RULER-32768-Qwen2.5-0.5B
Preview
•
Updated
Jul 2
•
118
lighteval/RULER-16384-Qwen2.5-0.5B
Preview
•
Updated
Jul 2
•
8
lighteval/RULER-8192-Qwen2.5-0.5B
Preview
•
Updated
Jul 2
•
144
lighteval/RULER-4096-Qwen2.5-0.5B
Preview
•
Updated
Jul 2
•
111
lighteval/RULER-262144-Qwen3-1.7B
Preview
•
Updated
Jul 2
•
21
lighteval/RULER-131072-Qwen3-1.7B
Preview
•
Updated
Jul 2
•
52
lighteval/RULER-65536-Qwen3-1.7B
Preview
•
Updated
Jul 2
•
119
lighteval/RULER-32768-Qwen3-1.7B
Preview
•
Updated
Jul 2
•
74
lighteval/RULER-16384-Qwen3-1.7B
Preview
•
Updated
Jul 2
•
49
lighteval/RULER-8192-Qwen3-1.7B
Preview
•
Updated
Jul 2
•
13
lighteval/RULER-4096-Qwen3-1.7B
Preview
•
Updated
Jul 2
•
15
lighteval/RULER-8192-llama3.2-1b-chat
Preview
•
Updated
Jul 2
•
17
lighteval/RULER-16384-llama3.2-1b-chat
Preview
•
Updated
Jul 2
•
88
Previous
1
2
3
4
...
7
Next