mozilla-foundation/common_voice_17_0
Updated • 5.93k • 18
How to use emason/whisper-base-sw with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="emason/whisper-base-sw") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("emason/whisper-base-sw")
model = AutoModelForSpeechSeq2Seq.from_pretrained("emason/whisper-base-sw")This model is a fine-tuned version of openai/whisper-base on the Common Voice 17.0 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.5121 | 1.0893 | 1000 | 0.7500 | 50.1768 |
| 0.3802 | 2.1786 | 2000 | 0.6447 | 43.3016 |
| 0.31 | 3.2680 | 3000 | 0.6139 | 39.4864 |
| 0.2768 | 4.3573 | 4000 | 0.6029 | 39.2423 |
| 0.266 | 5.4466 | 5000 | 0.6033 | 39.3269 |
Base model
openai/whisper-base