Coqui STT (🐸STT) is an open-source deep-learning toolkit for training and deploying speech-to-text models.

🐸STT is battle tested in both production and research 🚀

Quick Reference

Quickstart¶

The fastest way to use a pre-trained 🐸STT model is with the 🐸STT model manager, a tool that lets you quickly test and demo models locally. You’ll need Python 3.6, 3.7, 3.8 or 3.9:

# Create a virtual environment
$ python3 -m venv venv-stt
$ source venv-stt/bin/activate

# Install 🐸STT model manager
$ python -m pip install -U pip
$ python -m pip install coqui-stt-model-manager

# Run the model manager. A browser tab will open and you can then download and test models from the Model Zoo.
$ stt-model-manager

API Reference

Examples

Language Model

How to Train a Language Model

Contact/Getting Help¶

There are several ways to contact us or to get help:

GitHub Discussions - GitHub Discussions is the first place to look. Search for keywords related to your question or problem to see if someone else has run into it already. If you can’t find anything relevant there, search on our issue tracker to see if there is an existing issue about your problem.
Matrix chat - If your question is not addressed on GitHub Discussions, you can contact us on the #stt:matrix.org channel on Matrix.
Create a new issue - Finally, if you have a bug report or a feature request that isn’t already covered by an existing issue, please open an issue in our repo and fill the appropriate information on your hardware and software setup.

STT Playbook

Coqui STT Playbook

Advanced topics

Quickstart¶

Contact/Getting Help¶

Indices and tables¶