Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 118 AutoTrain Advanced 🚀 118 Create powerful AI models without code Runtime error 39 LLM Merge Adapter 🐢 39 Runtime error Featured 290 mergekit-gui 🔀 290 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots Running Featured 435 LLM Performance Leaderboard 🐨 435 View LLM performance rankings Configuration error 4.71k LMArena Leaderboard 🏆 4.71k Display LMArena Leaderboard Running on CPU Upgrade 6.98k MTEB Leaderboard 🥇 6.98k Embedding Leaderboard
Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots
Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 118 AutoTrain Advanced 🚀 118 Create powerful AI models without code Runtime error 39 LLM Merge Adapter 🐢 39 Runtime error Featured 290 mergekit-gui 🔀 290 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots Running Featured 435 LLM Performance Leaderboard 🐨 435 View LLM performance rankings Configuration error 4.71k LMArena Leaderboard 🏆 4.71k Display LMArena Leaderboard Running on CPU Upgrade 6.98k MTEB Leaderboard 🥇 6.98k Embedding Leaderboard
Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots