L3Cube-MahaNews dataset for language nlp.
from datasets import load_dataset
ds = load_dataset('l3cube-pune/MahaNews')
for ex in ds['train'][:5]:
print(f"[{ex['label']}] {ex['text'][:80]}...")| Field | Type | Description |
|---|---|---|
| text | string | Marathi news headline or article text |
| label | string | News topic category (one of 12 categories: politics, sports, entertainment, etc.) |