yt-whisper

CLI tool to auto-caption any YouTube video with Whisper

Visit yt-whisperOpens in a new tab. Not an affiliate link.

Best for

Generating SRT files from YouTube URLs without uploading to a service

Our take

yt-whisper is a single-purpose CLI: paste a YouTube URL, get an SRT file. It pipes through yt-dlp for the download and Whisper for the transcription. The simplest possible bridge from a public video to a usable caption file. Useful as a building block in larger pipelines.

Pros
  • Single-purpose simplicity
  • Free and locally hosted
  • Pairs naturally with yt-dlp pipelines
Watch-outs
  • CLI only, no GUI
  • No styling or burn-in
  • Depends on YouTube terms for the videos you process
In depth

yt-whisper is a tightly scoped open-source command-line tool that uses yt-dlp to download a YouTube video and Whisper to transcribe it, producing a SubRip file. The pitch is simplicity. You install the package, run one command with a YouTube URL, and an SRT file lands in your working directory. It is the easiest possible bridge from a public video to a usable caption track, and it pairs naturally with larger pipelines that also use yt-dlp for archival or processing. There is no GUI, no styling, no burn-in. Output is plain SubRip suitable for upload back to YouTube, ingestion into a styling tool, or use in any application that consumes standard subtitle files. The dependency on Whisper means local compute determines speed — with a GPU, transcription completes in a fraction of real time, and on CPU only it runs slower. Multi-language support comes from Whisper directly, which means transcription works across most major languages with auto-detection or explicit language flags. For developers building captioning workflows on top of YouTube content, yt-whisper is a useful building block. For end users without command-line comfort, it is the wrong shape — pair them with a wrapping tool or send them to a hosted alternative.


Other tools like this

See all Captioning
Captioning$$

Auto-caption and clip generator built for creators who post to TikTok and Reels daily.

Best for: Short-form social clips
Read more →Visit site
CaptioningFreemium

Free mobile-first editor with the viral caption styles powering TikTok.

Best for: Short-form creators
Read more →Visit site
Captioning$$

AI video editor that leans hard into avatars and automated end-to-end edits.

Best for: AI avatar videos
Read more →Visit site

Compare yt-whisper with


yt-whisper FAQ

What is yt-whisper in one line?

CLI tool to auto-caption any YouTube video with Whisper

Who should pick yt-whisper?

yt-whisper is shaped for generating srt files from youtube urls without uploading to a service. Its biggest strength: single-purpose simplicity. It pipes through yt-dlp for the download and Whisper for the transcription

What should I watch out for with yt-whisper?

cli only, no gui; no styling or burn-in. None of these are deal-breakers on their own, but they're worth knowing before you commit.

Is yt-whisper free?

Yes. yt-whisper is genuinely free — no paywall lurking after a few episodes.

What can I use instead of yt-whisper?

Closest in the same category: Submagic, CapCut, Captions. Each has its own shape — see the alternatives page for a side-by-side.