Natural Language Inference (NLI) dataset for 11 Indic languages including Marathi, created by high-quality machine translation of the English XNLI dataset. Contains premise-hypothesis pairs with entailment, contradiction, and neutral labels for evaluating Marathi language understanding.
from datasets import load_dataset
ds = load_dataset('Divyanshu/indicxnli', 'mr', split='test')
for ex in ds[:5]:
labels = ['entailment', 'neutral', 'contradiction']
print(f"P: {ex['premise'][:60]}...")
print(f"H: {ex['hypothesis'][:60]}...")
print(f"Label: {labels[ex['label']]}\n")| Field | Type | Description |
|---|---|---|
| premise | string | Premise sentence in Marathi |
| hypothesis | string | Hypothesis sentence in Marathi |
| label | int | Entailment label (0=entailment, 1=neutral, 2=contradiction) |