Comprehensive Marathi NLP library including MahaBERT, MahaAlBERT, MahaRoBERTa language models, MahaFT word embeddings, and tools for tokenisation, sentiment, NER, and hate speech detection
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained('l3cube-pune/marathi-roberta')
model = AutoModel.from_pretrained('l3cube-pune/marathi-roberta')
inputs = tokenizer('मराठी भाषा प्रक्रिया', return_tensors='pt')
outputs = model(**inputs)
print(f"Embedding shape: {outputs.last_hidden_state.shape}")| Field | Type | Description |
|---|---|---|
| model_name | string | Name of the pre-trained Marathi NLP model |
| task | string | NLP task (classification, NER, embedding, etc.) |