Make every frame speak beautifully
SmartSub is a local-first desktop app that takes you all the way from media → subtitles → translation → proofreading → burn-in. All transcription runs locally — your files never leave your machine. It supports batch processing and GPU acceleration on Windows, macOS, and Linux.

3.0 is a near-complete rewrite. The headline changes:
whisper.cpp, faster-whisper, FunASR, Qwen3-ASR, FireRedASR, and your local Whisper CLI. For Chinese, reach straight for FunASR / FireRedASR.| Video Synthesis (Burn-in) | Subtitle Proofreading |
|---|---|
![]() |
![]() |
3.0 turns the transcription engine into a per-task choice. Manage runtimes and models from the "Engines & Models" page:
| Engine | Notes | How it runs |
|---|---|---|
| whisper.cpp (built-in) | Default engine; supports ggml quantized models and GPU acceleration | Bundled, works out of the box |
| faster-whisper | Based on CTranslate2, faster; models downloaded on demand from HuggingFace | Self-contained Python runtime (in-app download) |
| FunASR | SenseVoice (zh/en/ja/ko/yue) and Paraformer-zh; great for Chinese | Bundled sherpa-onnx native lib |
| Qwen3-ASR | Qwen speech recognition (qwen3-asr-0.6b) | Bundled sherpa-onnx native lib |
| FireRedASR | FireRedASR-AED large (zh-en); great for Chinese | Bundled sherpa-onnx native lib |
| Local Whisper CLI | Calls a whisper-compatible command you installed yourself | Uses your system command |
Note: FunASR / Qwen3-ASR / FireRedASR all run on the bundled sherpa-onnx native library with no extra setup; faster-whisper downloads a self-contained runtime inside the app.
whisper.cpp / faster-whisper use the whisper family of models. Bigger models are more accurate but slower and need more VRAM:
tiny / base — fast and lightweightsmall / base to balance accuracy and resourceslarge series for top accuracyen for English-optimized resultsq5 / q8 quantized variants for a smaller footprint at a slight accuracy costSmartSub ships with a built-in acceleration-pack manager — no need to install the CUDA Toolkit manually. After installing, open "Settings → GPU Acceleration"; the app detects your GPU and recommends a suitable option.
| Platform | Backend | Notes |
|---|---|---|
| Windows / Linux + NVIDIA | CUDA | Supports CUDA 11.8.0 / 12.2.0 / 12.4.0 / 13.0.2; download the matching pack in-app |
| Windows / Linux + AMD / Intel | Vulkan | New in 3.0 — built-in Vulkan acceleration pack |
| macOS (Apple Silicon) | Core ML / Metal | Enabled automatically with the mac arm64 build |
| Any platform | CPU | Automatic fallback when no GPU is available |
This project supports 17 translation services, including Baidu, Volcano Engine, Aliyun, Tencent, Xunfei, NiuTrans, Google, DeepLX, plus LLM/aggregation platforms such as Ollama, DeepSeek, Gemini, Qwen, SiliconFlow, Azure OpenAI, and DeerAPI. Using these services requires the appropriate API keys or configuration.
For information on obtaining API keys for services like Baidu Translation and Volcano Engine, please refer to https://bobtranslate.com/service/. We appreciate the information provided by Bob, an excellent software tool.
For AI translation, results are heavily influenced by the model and prompt, so try different combinations to find what works for you. We recommend the AI aggregation platform DeerAPI, which supports nearly 500 models across multiple platforms.
SmartSub lets you configure custom parameters for each AI translation service to precisely control model behavior:
Download the package for your system and chip. GPU acceleration packs are not chosen at download time — get them in-app after installing.
| System | Chip | Download Package | Notes |
|---|---|---|---|
| Windows | x64 | windows-x64 | NVIDIA → CUDA, AMD/Intel → Vulkan, downloaded in-app |
| Mac | Apple | mac-arm64 | Core ML / Metal acceleration enabled automatically |
| Mac | Intel | mac-x64 | CPU only, no GPU acceleration |
| Linux | x64 | linux-x64 | NVIDIA → CUDA, AMD/Intel → Vulkan, downloaded in-app |
$ claude mcp add SmartSub \
-- python -m otcore.mcp_server <graph>