L3Cube-MahaNews

L3Cube-MahaNews

MH Specific

L3Cube-MahaNews dataset for language nlp.

Build a Marathi news aggregation and auto-tagging service that categorizes articles by topic for personalized feeds.
HomepageHuggingFace

Quick Start

from datasets import load_dataset
ds = load_dataset('l3cube-pune/MahaNews')
for ex in ds['train'][:5]:
    print(f"[{ex['label']}] {ex['text'][:80]}...")
Modality
text
Size
1.05 lakh records across 12 news categories
License
Format
CSV
Language
mr
Update Frequency
static
Organization
L3Cube, Pune

Schema

FieldTypeDescription
textstringMarathi news headline or article text
labelstringNews topic category (one of 12 categories: politics, sports, entertainment, etc.)

Build With This

Create a breaking-news detection system that classifies and prioritizes incoming Marathi news by topic and urgency
Develop a personalized Marathi news digest app that learns user preferences and recommends stories from their favourite categories
Build a media monitoring service for Maharashtra government officials that tracks coverage by topic across Marathi publications

AI Use Cases

News topic classification
Last verified: 2026-03-07