JMMMU Leaderboard

๐ŸŒ Homepage | ๐Ÿค— Dataset | ๐Ÿ† HF Leaderboard | ๐Ÿ“– arXiv (coming soon) | ๐Ÿ’ป GitHub

"Which LMM is expert in Japanese subjects?" ๐Ÿ† Welcome to the leaderboard of JMMMU

We introduce JMMMU (Japanese MMMU), a multimodal benchmark that can truly evaluate LMM performance in Japanese.
JMMMU consists of 720 translation-based (Culture Agnostic) and 600 brand-new questions (Culture Specific), for a total of 1,320 questions, updating the size of the existing culture-aware Japanese benchmark by >10x.

Evaluation Dimension
Model Size
Model Type
Model
Overall

LMM

65.8