mozilla-foundation/common_voice_17_0
Updated • 5.61k • 15
How to use somu9/whisper-small-alb with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="somu9/whisper-small-alb") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("somu9/whisper-small-alb")
model = AutoModelForSpeechSeq2Seq.from_pretrained("somu9/whisper-small-alb")This model is a fine-tuned version of openai/whisper-small on the Common Voice 17.0 dataset. It achieves the following results on the evaluation set:
This is a speech to text model finetuned over Whisper model by OpenAI.
This is free to use for learning or commercial purposes. I don't plan to monetize this ever or make it private. My goal is to make whisper more localized which is why i have this trained this model and made it public for everyone.
This model is trained on common_voice_17 dataset. It is an open source multilingual dataset.
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.005 | 15.1515 | 1000 | 0.9955 | 53.7437 |
| 0.0003 | 30.3030 | 2000 | 1.1066 | 52.5698 |
| 0.0001 | 45.4545 | 3000 | 1.1585 | 52.8553 |
| 0.0001 | 60.6061 | 4000 | 1.1889 | 52.7284 |
| 0.0001 | 75.7576 | 5000 | 1.2013 | 52.6332 |
Base model
openai/whisper-small