![]() |
![]() |
![]() |
|---|---|---|
![]() |
![]() |
![]() |
|---|---|---|
![]() |
![]() |
![]() |
|---|---|---|
| Text-to-speech | Audio/Music Generation | Audio Conversion/Tools |
|---|---|---|
| Bark | MusicGen | RVC |
| Tortoise | MAGNeT | Demucs |
| Maha TTS | Stable Audio | Vocos |
| MMS | Riffusion* | Whisper |
| Vall-E X | AudioCraft Mac* | AP BWE |
| StyleTTS2 | AudioCraft Plus* | Resemble Enhance |
| SeamlessM4T | ACE-Step* | Audio Separator |
| XTTSv2* | Song Bloom* | PyRNNoise* |
| MARS5* | MiMo Audio* | |
| F5-TTS* | ||
| Parler TTS* | ||
| OpenVoice* | ||
| OpenVoice V2* | ||
| Kokoro TTS* | ||
| DIA* | ||
| CosyVoice* | ||
| GPT-SoVITS* | ||
| Piper TTS* | ||
| Kimi Audio 7B Instruct* | ||
| Chatterbox* | ||
| VibeVoice* | ||
| Kitten TTS* | ||
| Index-TTS2* | ||
| VoxCPM* | ||
| FireRedTTS2* | ||
| MegaTTS3* | ||
| MiniMax Cloud TTS (built-in) |
* These models are not installed by default, instead they are available as extensions.
TTS WebUI Ignition is the new installer and launcher for TTS WebUI.
On Windows:
winget install TTS-WebUI.Ignition
For other platforms, download the latest release from the releases page or build it from source using the instructions in the repository.
Current base installation size is around 10.7 GB. Each model will require 2-8 GB of space in addition.
Prerequisites: * git * Python 3.10 or 3.11 (3.12 not supported yet) * PyTorch * ffmpeg (with vorbis support) * (Optional) NodeJS 22.9.0 for React UI * SQLite (bundled with Python) for database support
bash
git clone https://github.com/rsxdalv/tts-webui.git
cd tts-webuiInstall required packages:
bash
pip install -r requirements.txt
Run the server:
bash
python server.py --no-react
For React UI:
bash
cd react-ui
npm install
npm run build
cd ..
python server.py
For detailed manual installation instructions, please refer to the Manual Installation Guide.
tts-webui can also be ran inside of a Docker container. Using CUDA inside of docker requires NVIDIA Container Toolkit. To get started, pull the image from GitHub Container Registry:
docker pull ghcr.io/rsxdalv/tts-webui:main
Once the image has been pulled it can be started with Docker Compose: The ports are 7770 (env:TTS_PORT) for the Gradio backend and 3000 (env:UI_PORT) for the React front end.
docker compose up -d
The container will take some time to generate the first output while models are downloaded in the background. The status of this download can be verified by checking the container logs:
docker logs tts-webui
If you wish to build your own docker container, you can use the included Dockerfile:
docker build -t tts-webui .
Please note that the docker-compose needs to be edited to use the image you just built.
May: * Add virtual environment system (uv), preventing extension conflicts.
April: * Add torchcodec CPU to requirements * Upgrade PyTorch to 2.11.0 * Update pin TorchAudio
$ claude mcp add TTS-WebUI \
-- python -m otcore.mcp_server <graph>