The best AI speech recognition, translation, and multilingual dubbing solution 🚀
<img src="https://github.com/abus-aikorea/voice-pro/raw/v3.2.0/docs/images/main_page_crop.eng.jpg?raw=true" alt="Dubbing Studio"/>
한국어
∙
English
∙
中文简体
∙
中文繁體
∙
日本語
∙
Deutsch
∙
Español
∙
Português
Voice-Pro is a state-of-the-art web app that transforms multimedia content creation. It integrates YouTube video downloading, voice separation, speech recognition, translation, and text-to-speech into a single, powerful tool for creators, researchers, and multilingual professionals. - 🔊 Top-tier speech recognition: Whisper, Faster-Whisper, Whisper-Timestamped, WhisperX - 🎤 Zero-shot voice cloning: F5-TTS, E2-TTS, CosyVoice - 📢 Multilingual text-to-speech: Edge-TTS, kokoro (Paid version includes Azure TTS) - 🎥 YouTube processing & audio extraction: yt-dlp - 🌍 Instant translation for 100+ languages: Deep-Translator (Paid version includes Azure Translator)
A robust alternative to ElevenLabs, Voice-Pro empowers podcasters, developers, and creators with advanced voice solutions.
installer_files folder and then running configure.bat followed by start.bat.version 3.2
Connect with people from all over the world for meaningful cultural exchanges, language learning, and international friendships.

version 3.1
English &
Chinese: SWivid/F5-TTS_v1
Finnish: AsmoKoskinen/F5-TTS_Finnish_Model
French: RASPIAUDIO/F5-French-MixedSpeakers-reduced
Hindi: SPRINGLab/F5-Hindi-24KHz
Italian: alien79/F5-TTS-italian
Japanese: Jmica/F5TTS/JA_21999120
Russian: hotstone228/F5-TTS-Russian
Spanish: jpgallegoar/F5-Spanish version 3.0
version 2.0
Dubbing Studio Tab$ claude mcp add voice-pro \
-- python -m otcore.mcp_server <graph>