AI4Bharat IndicSentenceSummarization (mr)

AI4Bharat IndicSentenceSummarization (mr)

MH Specific

AI4Bharat IndicSentenceSummarization (mr) dataset for language nlp.

Build a Marathi news digest app that summarizes long articles into single sentences for quick mobile reading.
HomepageHuggingFace

Quick Start

from datasets import load_dataset
ds = load_dataset('ai4bharat/IndicSentenceSummarization', 'mr', split='train', streaming=True)
for i, ex in enumerate(ds):
    print(f"Original: {ex['text'][:80]}...")
    print(f"Summary: {ex['summary'][:80]}...\n")
    if i >= 4: break
Modality
text
Size
27,066 train + 3,249 dev + 3,309 test
License
Format
CSV/JSON
Language
mr
Update Frequency
static
Organization
AI4Bharat, IIT Madras

Schema

FieldTypeDescription
textstringSource Marathi text to be summarized
summarystringCondensed summary of the text

Build With This

Create a meeting minutes summarizer for Maharashtra government departments that condenses lengthy discussions into key points
Develop a legal document abstracting service that produces one-line summaries of Maharashtra High Court judgments
Build a social media post generator that summarizes Marathi news articles into tweet-length updates

AI Use Cases

Text summarization
Last verified: 2026-03-07