EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
updated a dataset about 1 hour ago
evaleval/EEE_datastore new activity about 1 hour ago
evaleval/EEE_datastore:[Submission] Fix win_rate scale (0-1) and merge Fibble variants into composite benchmark new activity about 19 hours ago
evaleval/EEE_datastore:Add HELM AIR-Bench v1.16.0 results