Translated MMLU (Massive Multitask Language Understanding) benchmark in 10 Indian languages including Marathi. Contains multiple-choice questions spanning science, humanities, social sciences, and more. Standard benchmark for evaluating how well Marathi LLMs compare to English ones.
from datasets import load_dataset
# ds = load_dataset('sarvam/mmlu-indic', 'mr')
print('Sarvam MMLU-Indic Marathi')
print('Multiple-choice knowledge benchmark in Marathi')| Field | Type | Description |
|---|---|---|
| question | string | Multiple-choice question in Marathi |
| choices | list[string] | Answer choices |
| answer | string | Correct answer |
| subject | string | Subject/topic area |