TIGER-Lab/MMLU-Pro
Benchmark
•
Updated
•
12.1k
•
71.8k
•
405
Natural Language Processing, Image Generation
VisCoder2: Building Multi-Language Visualization Coding Agents
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions