Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

mm-evaluation

community
https://github.com/mm-evaluation
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ZTWHHH  updated a dataset 2 days ago
mm-eval/LiveBench
ZTWHHH  published a dataset 2 days ago
mm-eval/LiveBench
ZTWHHH  updated a dataset 2 days ago
mm-eval/SpatialEval
View all activity

William Li's profile pictureIcy Wang's profile pictureTianwei Zhao's profile pictureSnorf Yang's profile picture

models 18

mm-eval/WeMM-Chat-2k-CN

Updated Oct 14, 2025 • 2

mm-eval/WeMM-Chat-CN

Updated Oct 14, 2025 • 1

mm-eval/WeMM

Updated Oct 14, 2025 • 2

mm-eval/VLMEvalKit

Updated Jul 20, 2025

mm-eval/llava-next-qwen-32b

Updated May 7, 2025 • 2

mm-eval/minigpt4_13b

Updated May 4, 2025

mm-eval/minigpt4_7b

Updated May 4, 2025

mm-eval/minigpt4_v2

Updated May 4, 2025

mm-eval/Llama-3-LongVILA-8B-512Frames

Text Generation • Updated Apr 29, 2025 • 2

mm-eval/Llama-3-LongVILA-8B-1024Frames

Updated Apr 29, 2025 • 2
View 18 models

datasets 121

mm-eval/LiveBench

Viewer • Updated 2 days ago • 1k • 26

mm-eval/SpatialEval

Viewer • Updated 2 days ago • 9.27k • 23 • 1

mm-eval/LEGO-Puzzles

Viewer • Updated 2 days ago • 1.1k • 20

mm-eval/MathVerse

Updated 2 days ago • 168

mm-eval/SeePhys

Viewer • Updated 2 days ago • 2k • 34 • 1

mm-eval/ScienceQA-IMG

Viewer • Updated 2 days ago • 10.3k • 20

mm-eval/LLaVA-Bench-COCO

Viewer • Updated 2 days ago • 90 • 21

mm-eval/TheoremQA

Viewer • Updated 3 days ago • 53 • 22

mm-eval/NaturalBench

Updated 3 days ago • 107

mm-eval/OCRBench-v2

Viewer • Updated 3 days ago • 10k • 91
View 121 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs