Large-scale scene text dataset for 11 Indian languages plus English, sourced from Wikimedia images of Indian signboards and street scenes. Includes 5,113 Marathi word annotations with polygon bounding boxes
# Bharat Scene Text Dataset
import json
with open('bstd_annotations.json') as f:
data = json.load(f)
print(f'Total images: {len(data)}')
for img in list(data.values())[:3]:
print(f"Text regions: {len(img.get('annotations', []))}")| Field | Type | Description |
|---|---|---|
| image | image | Street scene image containing text |
| bounding_boxes | list[object] | Text region bounding box coordinates |
| text | string | Recognized text content |
| script | string | Script type (Devanagari, Latin, etc.) |