Evaluation results and benchmark scores for MahaBERT (L3Cube) and IndicBERT (AI4Bharat) models on Marathi NLU tasks including sentiment, NER, and text classification
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained('l3cube-pune/marathi-bert-v2')
model = AutoModel.from_pretrained('l3cube-pune/marathi-bert-v2')
inputs = tokenizer('मराठी भाषा', return_tensors='pt')
print(f'Output shape: {model(**inputs).last_hidden_state.shape}')| Field | Type | Description |
|---|---|---|
| model | string | Model name being evaluated |
| task | string | Benchmark task name |
| metric | string | Evaluation metric (F1, accuracy, etc.) |
| score | float | Benchmark score |