AI4Bharat IndicHeadlineGeneration (mr)

AI4Bharat IndicHeadlineGeneration (mr)

MH Specific

AI4Bharat IndicHeadlineGeneration (mr) dataset for language nlp.

Build an automatic Marathi headline generator for news aggregation platforms to create concise, accurate headlines from article text.
HomepageHuggingFace

Quick Start

from datasets import load_dataset
ds = load_dataset('ai4bharat/IndicHeadlineGeneration', 'mr', split='train', streaming=True)
for i, ex in enumerate(ds):
    print(f"Headline: {ex['headline'][:60]}")
    print(f"Article: {ex['text'][:80]}...\n")
    if i >= 4: break
Modality
text
Size
114,042 train + 14,253 dev + 14,340 test
License
Format
CSV/JSON
Language
mr
Update Frequency
static
Organization
AI4Bharat, IIT Madras

Schema

FieldTypeDescription
textstringFull Marathi news article text
headlinestringCorresponding headline for the article

Build With This

Create a Marathi clickbait detector that compares generated headlines against article content to identify misleading titles
Develop an SEO-optimized headline suggester for Marathi news publishers that maximizes search visibility
Build a multilingual headline comparison tool that generates headlines in both Marathi and English for bilingual news platforms

AI Use Cases

Headline generation
Last verified: 2026-03-07