[!NOTE] This project was previously named
faster-whisper-server. I've decided to change the name fromfaster-whisper-server, as the project has evolved to support more than just ASR.
speaches is an OpenAI API-compatible server supporting streaming transcription, translation, and speech generation. Speach-to-Text is powered by faster-whisper and for Text-to-Speech piper and Kokoro are used. This project aims to be Ollama, but for TTS/STT models.
See the documentation for installation instructions and usage: speaches.ai
speaches.kokoro(Ranked #1 in the TTS Arena) and piper models.Please create an issue if you find a bug, have a question, or a feature suggestion.
https://github.com/user-attachments/assets/457a736d-4c29-4b43-984b-05cc4d9995bc
(Excuse the breathing lol. Didn't have enough time to record a better demo)
TODO
https://github.com/user-attachments/assets/0021acd9-f480-4bc3-904d-831f54c4d45b
$ claude mcp add speaches \
-- python -m otcore.mcp_server <graph>