Tools that turn audio into searchable, editable text for repurposing, show notes, or accessibility.
48 tools in this category
Real-time transcription and meeting notes with sharable highlights.
Voice AI API that developers reach for when accuracy and uptime actually matter.
Pay-per-minute transcription with human-grade accuracy when you actually need 99%.
Enterprise voice AI APIs with a focus on speed, scale, and unified voice agents.
Batch transcription powered by the open-source model that reset the bar.
Enterprise speech-to-text with deep on-prem and global language coverage.
Multilingual Whisper-powered API with sub-300ms streaming.
Unified speech model with mid-sentence translation across 60+ languages.
Affordable human transcription with optional verbatim and subtitling.
Newsroom-friendly transcription with collaborative story editing.
Per-hour automated transcripts with 40+ language support.
Transcripts and subtitles in 120 languages with a clean editor.
Cross-device transcription with a tidy mobile app for field interviews.
Boutique podcast transcription with strict accuracy guarantees.
Scribe model from the voice-AI company
Rev's developer API for async and streaming ASR
Hybrid AI plus human transcription for regulated industries
Four-step human transcription at budget rates
HIPAA-compliant human transcription with vertical specialisation
Accessibility-first captioning and transcription
Boutique human transcription with film and TV pedigree
US-based human transcription with industry expertise
Amazon's managed speech-to-text service
Microsoft's enterprise-grade ASR with custom model training
Google's flagship ASR with the Chirp 2 model
IBM's long-running enterprise ASR service
GPU-accelerated ASR you run on your own hardware
On-device streaming speech-to-text
European ASR API with strong CEE language coverage
Low-cost speech-to-text API for indie developers
Broadcast live captioning and ASR
Live captioning service for events and education
Personal live captioning and lecture transcription
Transcription bundled with Cleanvoice's noise and filler removal
Meeting notes from Zoom, Meet, and Teams
Meeting recorder with AI highlights and clips
Conversation intelligence for revenue teams
Open-source offline speech recognition
Open framework for speech and multimodal AI
Python wrapper around multiple ASR engines
Open Whisper variants and fine-tunes
Google's free Android live captioning app
Free auto-generated captions on every YouTube upload
Browser dictation tool, no signup
100-plus-language transcription with translation
EU-based AI plus human transcription and captioning
Editor-first transcription that doubles as your DAW
Transcripts and captions inside Riverside studio