Coqui STT logo and wordmark

Coqui STT (🐸STT) is an open-source deep-learning toolkit for training and deploying speech-to-text models.

🐸STT is battle tested in both production and research 🚀

Quickstart: Deployment

The fastest way to deploy a pre-trained 🐸STT model is with pip with Python 3.6, 3.7, 3.8 or 3.9:

# Create a virtual environment
$ python3 -m venv venv-stt
$ source venv-stt/bin/activate

# Install 🐸STT
$ python -m pip install -U pip
$ python -m pip install stt

# Download 🐸's pre-trained English models
$ curl -LO
$ curl -LO

# Download some example audio files
$ curl -LO
$ tar -xvf audio-0.9.3.tar.gz

# Transcribe an audio file
$ stt --model coqui-stt-0.9.3-models.tflite --scorer coqui-stt-0.9.3-models.scorer --audio audio/2830-3980-0043.wav

Contact/Getting Help

There are several ways to contact us or to get help:

  1. GitHub Discussions - GitHub Discussions is the first place to look. Search for keywords related to your question or problem to see if someone else has run into it already. If you can’t find anything relevant there, search on our issue tracker to see if there is an existing issue about your problem.

  2. Matrix chat - If your question is not addressed on GitHub Discussions, you can contact us on the channel on Matrix.

  3. Create a new issue - Finally, if you have a bug report or a feature request that isn’t already covered by an existing issue, please open an issue in our repo and fill the appropriate information on your hardware and software setup.

STT Playbook

Indices and tables