Indic Scene Text Recognition dataset covering 12 major Indian languages including Marathi. Word images collected from natural scenes such as signboards, shop nameplates, railway stations, advertisements, and banners
# IndicSTR12 dataset
import json
print('IndicSTR12: Scene text recognition for 12 Indian scripts')
print('Includes Devanagari for Marathi text recognition')| Field | Type | Description |
|---|---|---|
| image | image | Scene text image in Indian script |
| text | string | Ground truth text transcription |
| script | string | Script type (Devanagari, etc.) |