Supported formats

Supported Video, Audio & Subtitle Formats

Check which video, audio and subtitle formats Voice2Sub supports, including local media import, batch workflows, and SRT, VTT, TXT, LRC or CSV export.

Broad media support with a clear container/codec caveat for unusual files.

Supported Formats

Best for format-heavy workflows

  • Files from phones and cameras
  • Screen recordings and course videos
  • Podcast and interview audio
  • Video editor exports
  • Subtitle and transcript handoff

Start with the media file you already have

Voice2Sub is designed to open a broad range of common and many less common video or audio files directly in the desktop app. That includes real-world files from phones, cameras, screen recorders, editing apps, podcasts, lectures and voice recordings.

Download Voice2Sub

Broad format support, explained clearly

  • Video containers such as MP4, MOV, MKV, AVI and WebM are common starting points for subtitle generation.
  • Audio files such as MP3, WAV, M4A, AAC and FLAC can become transcripts or timed subtitle files.
  • Containers can hold different codecs, so very unusual files should be opened in the app first before a long processing job.
  • Export formats are intentionally practical: SRT/VTT for captions, TXT for text, LRC for line timestamps and CSV for structured review.
  • Choose from up to 99 recognition languages before generating subtitle or transcript files.

Review step

Format support plus review

Use format support to open common media and subtitle files, then review generated output before choosing the final export format.

Explore subtitle editor

Workflow

Import media, generate subtitles, export the right format

Check compatibility before format-heavy work such as MP4 to SRT, MP3 to transcript, VTT for web players or CSV review handoffs.

  1. 01

    Open a local media file

    Start from MP4, MOV, MKV, AVI, WebM, MP3, WAV, M4A, FLAC or another supported video/audio file.

  2. 02

    Generate AI text

    Voice2Sub recognizes speech and creates subtitle or transcript text you can review.

  3. 03

    Choose the output format

    Export SRT, VTT, TXT, LRC or CSV based on where the result will be used.

Supported formats

Supported input and export formats

Voice2Sub is designed for broad media support: common containers and many less common real-world video or audio files from creators, cameras, phones, screen recorders, podcasts, lectures and editing apps. If a familiar container uses an unusual codec, open it in Voice2Sub to confirm compatibility before a long workflow.

Video containers

Import videos from phones, cameras, screen recorders, editing apps and archive folders.

  • MP4
  • MOV
  • MKV
  • AVI
  • WebM
  • WMV
  • FLV
  • M4V
  • MPEG/MPG
  • TS
  • 3GP
  • VOB
  • MXF
  • DIVX
  • and more

Audio formats

Import recordings, podcasts, interviews, lectures, meetings and voice notes.

  • MP3
  • WAV
  • M4A
  • AAC
  • FLAC
  • OGG
  • WMA
  • OPUS
  • AIFF
  • AMR
  • AC3
  • APE
  • and more

Subtitle and transcript exports

Export subtitles, WebVTT, transcript text, timestamped lyric-style text or CSV review data.

  • SRT
  • VTT / WebVTT
  • TXT
  • LRC
  • CSV
  • English output can be reviewed and exported separately.

File extensions are examples. Containers such as MP4, MOV or MKV can hold different codecs; unusual or proprietary codecs should be tested in the app before long processing jobs.

Export guide

Which export format should you choose?

Choose the output based on where the subtitles or transcript will be used after review.

YouTube and video platforms

Use SRT or VTT when you need a standard caption file for publishing workflows.

  • SRT
  • VTT

Web players

Use VTT/WebVTT when the destination is a web video player or HTML5 caption workflow.

  • VTT

Editing and review

Use SRT for broad compatibility or CSV when structured timestamp/text review is useful.

  • SRT
  • CSV

Transcript archive

Use TXT when you need readable text for notes, search, documentation or handoff.

  • TXT

Lyric-style timestamps

Use LRC when line-by-line timestamps are helpful for lyric-style or timestamped text workflows.

  • LRC

Container vs codec

Why one MP4 may work differently from another

MP4, MOV and MKV are containers. They can hold different audio/video codecs inside the same extension. Voice2Sub is built for broad real-world compatibility, but very unusual or proprietary codecs should be opened in the app first before a long processing job.

  • Container: MP4, MOV, MKV
  • Codec: the media stream inside
  • Test unusual files first

Input coverage

A practical format page for files users actually have

The page is meant to answer format questions from creators, educators and editors: camera videos, screen recordings, downloaded meeting files, podcast audio, lectures, interviews and exports from editing software.

  • Phone and camera videos
  • Screen recordings
  • Podcast and lecture audio

Output coverage

Subtitle and transcript formats serve different jobs

SRT and VTT are caption files. TXT is readable transcript text. LRC adds line-by-line timestamps, and CSV is useful for structured review or handoff. The right output depends on where the result goes after review.

  • SRT/VTT for captions
  • TXT for transcript archives
  • CSV for structured review

Conversion rule

Do not convert first unless you need to

For common video and audio files, start with the original media. Convert only when a file cannot be opened, uses an unsupported codec, or comes from an unusual tool that created a nonstandard stream.

  • Start with the original file
  • Test unusual codecs
  • Convert only when needed

AI Transcription

AI Transcription

Voice2Sub also covers speech to text, audio to text, video transcription and Whisper AI transcription, so subtitle generation can stay connected to transcript and export needs.

  • Speech to Text
  • AI Transcription
  • Whisper AI Transcription

Format examples

Common source-to-output workflows

These examples match practical long-tail questions such as MP4 to SRT, MOV to captions, MP3 to transcript, WAV to text and VTT for web players.

  • MP4 to SRT or VTT captions
  • MOV or MKV videos to subtitle files
  • MP3 or WAV to transcript text
  • M4A recordings to TXT or SRT
  • SRT/VTT export for editors and platforms

Supported formats FAQ

Does Voice2Sub support MP4?

Yes. MP4 is a common video container supported by Voice2Sub. Codec compatibility can still vary for unusual files, so test unfamiliar media before a long workflow.

What does Original + English create?

It creates the original subtitle output and an additional English subtitle file as a separate file. The English file may use an .en suffix.

Can I create SRT from MP3 or WAV?

Yes. Import an MP3 or WAV audio file, generate text with AI recognition, review the result and export SRT when timed subtitles are useful.

What is the difference between SRT and VTT?

SRT is widely used for subtitle exchange and review. VTT, also called WebVTT, is common for web video captions and some publishing workflows.

What if a video uses an unusual codec?

A familiar extension can contain an uncommon codec. Open the file in Voice2Sub first to confirm compatibility before committing to a long processing job.

Do I need to convert files before importing?

Usually no. Start with the original video or audio file when it is a common format. Convert only if the file cannot be opened or uses an unsupported codec.

Which output formats can Voice2Sub generate?

Voice2Sub can generate subtitle and transcript files such as SRT, VTT, TXT, LRC and CSV. Availability can depend on the app build and workflow.

Open your media file and export the format you need

Download the desktop app, import video/audio on your computer, and export SRT, VTT, TXT, LRC or CSV after review.