Instructions to use OpenVoiceOS/parakeet-tdt-ctc-110m-coreml with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use OpenVoiceOS/parakeet-tdt-ctc-110m-coreml with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("OpenVoiceOS/parakeet-tdt-ctc-110m-coreml") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
parakeet-tdt-ctc-110m-coreml
CoreML conversion of nvidia/parakeet-tdt_ctc-110m.
| Architecture | TDT |
| Language | English |
| Sample rate | 16000 Hz |
| Max audio | 15.0s |
| Vocab size | 1024 |
| Framework | NVIDIA NeMo → CoreML (coremltools) |
Components
| File | Component | Best compute |
|---|---|---|
parakeet_mel_encoder.mlpackage |
mel_encoder | ANE / GPU |
parakeet_ctc_decoder.mlpackage |
ctc_decoder | ANE / GPU |
parakeet_decoder.mlpackage |
decoder | CPU only |
parakeet_joint_decision_single_step.mlpackage |
joint_decision_single_step | ANE / GPU |
Usage
pip install ovos-stt-plugin-coreml
from ovos_stt_plugin_coreml import CoremlSTT
stt = CoremlSTT(config={"metadata": "metadata.json"})
Source model
- Downloads last month
- -
Model tree for OpenVoiceOS/parakeet-tdt-ctc-110m-coreml
Base model
nvidia/parakeet-tdt_ctc-110m