Instructions to use ninadp/marathi-mitra-phi3-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use ninadp/marathi-mitra-phi3-v3 with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-3-mini-4k-instruct") model = PeftModel.from_pretrained(base_model, "ninadp/marathi-mitra-phi3-v3") - Notebooks
- Google Colab
- Kaggle
๐ธ Marathi Mitra โ v3 (Optuna HPO)
Fine-tuned Phi-3 Mini using Optuna automated HPO (20 trials, A100).
Model Description
| Property | Value |
|---|---|
| Base Model | microsoft/Phi-3-mini-4k-instruct |
| Fine-tuning | QLoRA + Optuna HPO |
| LoRA Rank | r=64, alpha=128 |
| Training Examples | 250 |
| Optuna Trials | 20 (TPE sampler, A100) |
| Optimized For | Unseen word generalisation |
Performance
| Words | Score |
|---|---|
| Seen words | 76.0% |
| Unseen words | 82.0% |
| Overall | 79.0% |
| Generalisation gap | -6.0% (unseen > seen) |
Key Finding
Optuna was configured to maximize unseen word score. This produced a negative generalisation gap (-6%) where the model performs better on words it never saw during training.
However overall score (79.0%) is lower than v2 (89.4%), demonstrating metric-objective misalignment โ optimizing for a single metric (unseen) hurt the overall performance.
Lesson: HPO objective should be (seen + unseen) / 2
not just unseen score alone.
Best Config Found by Optuna
| Parameter | Value |
|---|---|
| Learning rate | 2.36e-4 |
| Epochs | 32 |
| LoRA rank | 64 |
| LoRA alpha | 128 |
| Quantization | 4-bit |
Recommended Version
For production use, v2 achieves higher overall score (89.4% vs 79.0%). v3 is useful as a research artifact demonstrating generalisation vs accuracy trade-offs in HPO.
Links
- Downloads last month
- 39
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for ninadp/marathi-mitra-phi3-v3
Base model
microsoft/Phi-3-mini-4k-instruct