Coqui STT (🐸STT) is an open-source deep-learning toolkit for training and deploying speech-to-text models.
🐸STT is battle tested in both production and research 🚀
The fastest way to deploy a pre-trained 🐸STT model is with pip with Python 3.6, 3.7, 3.8 or 3.9:
# Create a virtual environment $ python3 -m venv venv-stt $ source venv-stt/bin/activate # Install 🐸STT $ python -m pip install -U pip $ python -m pip install stt # Download 🐸's pre-trained English models $ curl -LO https://github.com/coqui-ai/STT/releases/download/v0.9.3/coqui-stt-0.9.3-models.tflite $ curl -LO https://github.com/coqui-ai/STT/releases/download/v0.9.3/coqui-stt-0.9.3-models.scorer # Download some example audio files $ curl -LO https://github.com/coqui-ai/STT/releases/download/v0.9.3/audio-0.9.3.tar.gz $ tar -xvf audio-0.9.3.tar.gz # Transcribe an audio file $ stt --model coqui-stt-0.9.3-models.tflite --scorer coqui-stt-0.9.3-models.scorer --audio audio/2830-3980-0043.wav
There are several ways to contact us or to get help:
GitHub Discussions - GitHub Discussions is the first place to look. Search for keywords related to your question or problem to see if someone else has run into it already. If you can’t find anything relevant there, search on our issue tracker to see if there is an existing issue about your problem.
Create a new issue - Finally, if you have a bug report or a feature request that isn’t already covered by an existing issue, please open an issue in our repo and fill the appropriate information on your hardware and software setup.