CLI tool to auto-caption any YouTube video with Whisper
Generating SRT files from YouTube URLs without uploading to a service
yt-whisper is a single-purpose CLI: paste a YouTube URL, get an SRT file. It pipes through yt-dlp for the download and Whisper for the transcription. The simplest possible bridge from a public video to a usable caption file. Useful as a building block in larger pipelines.
yt-whisper is a tightly scoped open-source command-line tool that uses yt-dlp to download a YouTube video and Whisper to transcribe it, producing a SubRip file. The pitch is simplicity. You install the package, run one command with a YouTube URL, and an SRT file lands in your working directory. It is the easiest possible bridge from a public video to a usable caption track, and it pairs naturally with larger pipelines that also use yt-dlp for archival or processing. There is no GUI, no styling, no burn-in. Output is plain SubRip suitable for upload back to YouTube, ingestion into a styling tool, or use in any application that consumes standard subtitle files. The dependency on Whisper means local compute determines speed — with a GPU, transcription completes in a fraction of real time, and on CPU only it runs slower. Multi-language support comes from Whisper directly, which means transcription works across most major languages with auto-detection or explicit language flags. For developers building captioning workflows on top of YouTube content, yt-whisper is a useful building block. For end users without command-line comfort, it is the wrong shape — pair them with a wrapping tool or send them to a hosted alternative.
Auto-caption and clip generator built for creators who post to TikTok and Reels daily.
Free mobile-first editor with the viral caption styles powering TikTok.
AI video editor that leans hard into avatars and automated end-to-end edits.
CLI tool to auto-caption any YouTube video with Whisper
yt-whisper is shaped for generating srt files from youtube urls without uploading to a service. Its biggest strength: single-purpose simplicity. It pipes through yt-dlp for the download and Whisper for the transcription
cli only, no gui; no styling or burn-in. None of these are deal-breakers on their own, but they're worth knowing before you commit.
Yes. yt-whisper is genuinely free — no paywall lurking after a few episodes.
Closest in the same category: Submagic, CapCut, Captions. Each has its own shape — see the alternatives page for a side-by-side.