AI4Bharat IndicQA

AI4Bharat IndicQA

MH Specific

Marathi — Expert-generated reading comprehension dataset for 11 Indic languages including Marathi, with context-question-answer triples for extractive QA tasks

Build a Marathi government scheme FAQ bot using RAG
HomepageHuggingFace

Quick Start

from datasets import load_dataset
ds = load_dataset("ai4bharat/IndicQA", "indicqa.mr")
print(ds["test"][0]["question"])
print(ds["test"][0]["answers"])
Modality
Text (Marathi)
Size
Multi-lingual, Marathi subset
License
Format
CSV/JSON
Language
mr
Update Frequency
static
Organization
AI4Bharat, IIT Madras

Schema

FieldTypeDescription
contextstringMarathi passage providing the information source
questionstringQuestion in Marathi about the context
answersobjectObject with 'text' (list of answer strings) and 'answer_start' (character offsets)
idstringUnique question-answer pair identifier

Build With This

RAG-powered Marathi chatbot that answers questions about government schemes
Marathi reading comprehension tutor for school students
Automated Marathi customer support for insurance or banking

AI Use Cases

Question answering fine-tuningreading comprehension evaluationRAG answer extraction
Last verified: 2026-03-07