Collection of ~12K Marathi word images with corresponding UTF-8 text labels, sourced from 12 Marathi books across various genres. Images are binarized, thresholded, and resized to 96 dpi for direct neural network input
# Marathi OCR Dataset
import cv2
print('Marathi OCR Dataset')
print('Printed Devanagari text images with ground truth')| Field | Type | Description |
|---|---|---|
| image | image | Document image containing Marathi text |
| text | string | OCR ground truth text |