Meta FLORES-200 (mr) dataset for language nlp.
from datasets import load_dataset
ds = load_dataset('facebook/flores', 'mar_Deva', split='devtest')
for ex in ds[:5]:
print(f"[{ex['id']}] {ex['sentence'][:80]}...")| Field | Type | Description |
|---|---|---|
| sentence | string | Text sentence in Marathi (or source language) |
| id | int | Sentence identifier aligned across languages |