Free Audio Translator

Free Audio Translator

Translate audio from 99 languages into English text, right in your browser. Drop in a Spanish podcast, a Mandarin interview, a French lecture — get English text with timestamps. Download as .txt, .srt, or .vtt. No upload. No sign-up.

Drop an audio or video file

or click to browse. Any of 99 languages. Best with files under 30 min on most browsers. Cap at 60 min — split longer files first with our Audio Splitter.

MP3WAVOGGFLACAACM4AWEBMMP4MOV

Don't have a file? Record one with our voice recorder to test how translation works.

100% in your browser. Audio stays on your device. The Whisper AI model downloads once (~40 MB) from a public CDN, then runs locally for every translation. We can't access your audio because it never leaves your computer. Privacy policy.

file.mp3

Best with clear speech. Uncheck Translate to English for source-language transcription. Keep this tab open — we'll chime if you switch tabs. Models cache after first download. How models compare →

Loading model… 0%

Translation

Translate audio to English — free, private, browser-based

SnipSound's Audio Translator uses OpenAI's open-source Whisper speech-translation model running entirely in your browser via WebAssembly. Upload a Spanish podcast, a Mandarin interview, a French lecture, an Arabic voice memo — Whisper renders it as English text with accurate timestamps. The first time you click Translate, your browser downloads a ~40 MB AI model from a public CDN; after that, every translation is local.

What it's great for

What it's not so good for

How it compares to Cockatoo, Otter, Rev

Cockatoo, Otter, Rev, Trint, Sonix all run larger Whisper variants on their servers. Quality is meaningfully higher — especially on heavy accents, multi-speaker audio, low-resource languages. They charge $10-30/month or $1/minute because GPU servers cost money. SnipSound's wedge: free, no sign-up, no upload. Use this when privacy / cost matters more than maximum accuracy.

Need transcription instead?

Uncheck Translate to English for source-language transcription, or use the dedicated Audio Transcription tool.

Frequently asked questions

What does the Audio Translator do?
It listens to your audio in any of 99 supported languages and produces English text. Spanish podcast → English transcript. Mandarin interview → English notes. All in your browser.
What languages can it translate FROM?
99 languages including Spanish, Mandarin Chinese, French, Portuguese, German, Italian, Russian, Japanese, Korean, Arabic, Hindi, Dutch, Polish, Turkish, Vietnamese, Indonesian and many more.
Can it translate INTO languages other than English?
Not today. Whisper translates between any source language and English only. To translate INTO another language, transcribe in the source language using our Audio Transcription tool, then paste into Google Translate or DeepL.
Is my audio uploaded?
No. Audio stays in your browser. The AI model downloads once from a public CDN and is cached locally.
How accurate is the translation?
Good for clear speech in well-represented languages. Drops on heavy accents, music, low-resource languages, noisy environments. Paid services with bigger models outperform for high-quality work.
Does it work for translating subtitles?
Yes. Download the .srt or .vtt output and load it into YouTube, Vimeo, or your video editor as English subtitles for a foreign-language video.