Sarvam MMLU-Indic (Marathi)

Sarvam MMLU-Indic (Marathi)

Translated MMLU (Massive Multitask Language Understanding) benchmark in 10 Indian languages including Marathi. Contains multiple-choice questions spanning science, humanities, social sciences, and more. Standard benchmark for evaluating how well Marathi LLMs compare to English ones.

Benchmark Marathi language models on MMLU-Indic to measure knowledge and reasoning capabilities in Marathi.
HomepageHuggingFace

Quick Start

from datasets import load_dataset
# ds = load_dataset('sarvam/mmlu-indic', 'mr')
print('Sarvam MMLU-Indic Marathi')
print('Multiple-choice knowledge benchmark in Marathi')
Modality
text
Size
~296K questions across 10 languages; Marathi subset included
License
Format
JSON / Parquet
Language
mr
Update Frequency
static
Organization
Sarvam AI

Schema

FieldTypeDescription
questionstringMultiple-choice question in Marathi
choiceslist[string]Answer choices
answerstringCorrect answer
subjectstringSubject/topic area

Build With This

Create a Marathi model knowledge gap analyzer identifying which subjects have weakest performance on MMLU
Develop a curriculum-aligned Marathi QA benchmark extending MMLU with Maharashtra-specific educational content
Build a Marathi model comparison leaderboard tracking MMLU-Indic scores across different model families

AI Use Cases

Marathi LLM evaluation and benchmarkingKnowledge assessment across domainsCross-lingual model comparisonIdentifying LLM knowledge gaps in Marathi
Last verified: 2026-03-09