Full dump of Marathi Wikipedia articles providing encyclopaedic knowledge coverage across diverse topics in Marathi language
from datasets import load_dataset
ds = load_dataset('wikimedia/wikipedia', '20231101.mr', split='train')
print(f"Total articles: {len(ds)}")
for ex in ds[:5]:
print(f"Title: {ex['title']}, Length: {len(ex['text'])} chars")| Field | Type | Description |
|---|---|---|
| text | string | Marathi Wikipedia article content |
| title | string | Article title in Marathi |