Windows x64
The standard Windows build for Windows laptops and desktops. CUDA acceleration is managed inside the app when supported.
Windows 10 or Windows 11, 64-bit.
Desktop app downloads
Choose the build for your computer, then create subtitles, speech-to-text transcripts and SRT/VTT files from local video or audio. Process multiple files, select up to 99 recognition languages and export SRT, VTT, TXT, LRC or CSV. The latest release includes English subtitle output with English only and Original + English modes.
Which build should I choose?
Available downloads
Download the Windows x64, macOS Universal or Linux x64 build. Linux opens a dedicated install guide for .deb and portable .tar.gz options. CUDA is available on supported NVIDIA GPU systems, and Metal is available on supported Apple Silicon Macs.
The standard Windows build for Windows laptops and desktops. CUDA acceleration is managed inside the app when supported.
Windows 10 or Windows 11, 64-bit.
One universal macOS build for both Apple Silicon and Intel Macs. Use this single DMG instead of choosing between separate CPU architectures.
macOS 11.0+ on Apple Silicon or Intel Mac.
Choose the recommended .deb package for Ubuntu/Debian-based distros, or the portable .tar.gz archive for Fedora, Arch, Manjaro, openSUSE and other Linux distros.
Linux x64. Ubuntu, Debian, Linux Mint, Pop!_OS, Fedora, Arch, Manjaro, openSUSE and other distros.
After installation, the core workflow is simple and repeatable.
Import a video, audio or voice recording file from your computer.
Let AI recognize speech and create subtitles, transcript text or speech-to-text output.
Review the result and export SRT, VTT, TXT, LRC or CSV.
Download workflows
After choosing the right build, start with the local media task you need to finish.
Choose the Windows x64 build for Windows, the macOS Universal build for both Apple Silicon and Intel Macs, or the Linux guide for .deb and portable .tar.gz options.
Yes. Voice2Sub supports optional English subtitle output. Use English only for the English file, or Original + English for separate original and English subtitle files.
Yes. Voice2Sub includes a batch workflow for adding multiple video or audio files and generating subtitle or transcript outputs in one run.
Yes. Voice2Sub can turn local video, audio or voice recordings into transcript text and subtitle outputs for review and export.
Yes. CUDA is optional for supported NVIDIA GPU systems. Voice2Sub can still run through the compatible local workflow when CUDA is not available.
The macOS Universal build is designed to run on both Apple Silicon and Intel Macs. Metal acceleration is available on supported Apple Silicon Macs.
Yes. Voice2Sub is free to download for Windows, macOS and Linux.
Download Voice2Sub
Install the desktop app, import video or audio from your computer, and export subtitles, speech-to-text transcripts or text files for your workflow.