EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
new activity about 17 hours ago
evaleval/EEE_datastore:Add HELM AIR-Bench v1.16.0 results new activity about 20 hours ago
evaleval/EEE_datastore:[ACL Shared Task] Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models) new activity about 22 hours ago
evaleval/EEE_datastore:[ACL Shared Task] Add SWE-bench Verified official leaderboard data