Comprehensive Indian legal NLP benchmark with 8 tasks in Marathi including Legal NER, Rhetorical Role Prediction, Court Judgment Prediction, Bail Prediction, Legal Statute Identification, Prior Case Retrieval, Summarization, and Legal Machine Translation. Far more structured than raw court document text.
# Access from https://github.com/Legal-NLP-EkStep
import json
with open('legal_marathi_ner.json') as f:
data = json.load(f)
print(f"Total documents: {len(data)}")
for doc in data[:3]:
print(f"Text: {str(doc)[:80]}...")| Field | Type | Description |
|---|---|---|
| text | string | Legal text passage in Marathi or English |
| task_type | string | NLP task type (NER, classification, summarization) |
| label | string | Task-specific annotation or label |