๐ŸŒธ Marathi Mitra โ€” v3 (Optuna HPO)

Fine-tuned Phi-3 Mini using Optuna automated HPO (20 trials, A100).

Model Description

Property Value
Base Model microsoft/Phi-3-mini-4k-instruct
Fine-tuning QLoRA + Optuna HPO
LoRA Rank r=64, alpha=128
Training Examples 250
Optuna Trials 20 (TPE sampler, A100)
Optimized For Unseen word generalisation

Performance

Words Score
Seen words 76.0%
Unseen words 82.0%
Overall 79.0%
Generalisation gap -6.0% (unseen > seen)

Key Finding

Optuna was configured to maximize unseen word score. This produced a negative generalisation gap (-6%) where the model performs better on words it never saw during training.

However overall score (79.0%) is lower than v2 (89.4%), demonstrating metric-objective misalignment โ€” optimizing for a single metric (unseen) hurt the overall performance.

Lesson: HPO objective should be (seen + unseen) / 2 not just unseen score alone.

Best Config Found by Optuna

Parameter Value
Learning rate 2.36e-4
Epochs 32
LoRA rank 64
LoRA alpha 128
Quantization 4-bit

Recommended Version

For production use, v2 achieves higher overall score (89.4% vs 79.0%). v3 is useful as a research artifact demonstrating generalisation vs accuracy trade-offs in HPO.

Links

Downloads last month
39
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ninadp/marathi-mitra-phi3-v3

Adapter
(844)
this model