No punctuation

by Sogl-coder - opened Sep 16, 2023

Sep 16, 2023

Compared to the original large-v2 (or just large) the output has no punctuation, proper names with a small letter, and there are artifacts in words.

Example:

mitchelldehaven

Owner Sep 16, 2023

Yes, this is expected. This model was trained on a Russian dataset that I had access to that had been preprocessed with a particular focus in mind. Thus, if I recall correctly, all punctuation is removed and all words are lower-cased. I'm not sure about the artifacts in words however.

mitchelldehaven changed discussion status to closed Sep 16, 2023

diimdeep

Oct 28, 2023

effort - 🏆
result - 💩

nikich340

Nov 21, 2023

So original whisper is just better lol..

nikich340

Nov 21, 2023

This comment has been hidden

mitchelldehaven

Owner Nov 21, 2023

If you need case and punctuation, then yes you should use the original v2 model, or the new v3 model.

In un-cased and non-punctuation contexts, this model will likely have a lower WER than the original v2 model, particularly in noisy environments. I'm unsure about the v3 model, as I haven't tested it for Russian, but I assume v3 would be better as it improved substantially on non-English languages.

LSMSK

Dec 27, 2023

•

edited Dec 27, 2023

Can you finetune to russian version 3?

mitchelldehaven

Owner Mar 12, 2024

Unfortunately I cannot, I do not have access to the compute resource I used for this any more.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment