Marathi Subset — Natural language understanding benchmark for 11 Indian languages including Marathi, covering tasks like news categorisation, headline prediction, and paraphrase detection
from datasets import load_dataset
ds = load_dataset('ai4bharat/indic_glue', 'wnli.mr')
print(f'Tasks available for Marathi (mr)')
print(f'WNLI samples: {len(ds["train"])}')| Field | Type | Description |
|---|---|---|
| task | string | NLU task name |
| text | string | Input text for the task |
| label | string | Task-specific label |