L3Cube-MahaHate dataset for language nlp.
from datasets import load_dataset
ds = load_dataset('l3cube-pune/MahaHate')
for ex in ds['train'][:5]:
print(f"[{ex['label']}] {ex['text'][:80]}...")| Field | Type | Description |
|---|---|---|
| text | string | Marathi tweet text |
| label | string | Hate speech class label (hate, offensive, profane, not in binary; hate/not-hate in binary) |