Audio Transcription & Live Interpreter
Real-time transcription, live translation, and speech-to-speech playback for browser audio. Includes Subtitle TTS for videos.
As of June 2026, Audio Transcription & Live Interpreter has 78 users and a 3.00/5 rating from 2 reviews in the Communication category.
Usersup 875.0 percent+875.0%
78
78
Ratingno change0%
3.00
2 reviews
Reviewsno change0%
2
Version
3.1.0
Manifest V3
90-day change · In the last 90 days this extension 2 version updates.
History
10 snapshotsTracking since Apr 1, 2026.
View as table
| Date | Users | Rating | Reviews | Version |
|---|---|---|---|---|
| Apr 1, 2026 | 8 | — | — | 2.8.5 |
| Apr 17, 2026 | 8 | — | — | 2.8.5 |
| Apr 24, 2026 | 22 | — | — | 2.8.6 |
| May 1, 2026 | 23 | — | — | 2.8.6 |
| May 8, 2026 | 27 | — | — | 2.8.6 |
| May 13, 2026 | 25 | — | — | 2.8.6 |
| May 18, 2026 | 26 | 5.00 | 1 | 2.8.6 |
| May 25, 2026 | 38 | 5.00 | 1 | 3.1.0 |
| Jun 1, 2026 | 49 | 5.00 | 1 | 3.1.0 |
| Jun 7, 2026 | 64 | 3.00 | 2 | 3.1.0 |
| Now | 78 | 3.00 | 2 | 3.1.0 |
Changelog
- May 18, 2026description
Audio Transcription is a powerful extension that turns your browser into a real-time interpreter. It captures any audio playing in a tab, transcribes it, translates it live, and can even read the results back to you. It works with any audio or video stream, completely independent of whether the source has pre-existing subtitles. ⚠️ IMPORTANT: This extension requires a local server running on your computer (based on WhisperLive) to process the audio, ensuring your data remains private and operates with low latency. ✨ Key Features: • 🗣️ Real-Time Speech-to-Speech: Listen to live translations as they happen. The extension can read the transcribed or translated text aloud, creating a seamless audio interpreting experience. • 📝 Live Transcription: Fast and accurate transcription using your local machine's processing power with OpenAI's Whisper AI. • 🌐 Instant Translation: Translate live transcriptions on the fly using Google Translate (free) or Google Gemini API for advanced, context-aware translations. • 🖼️ Flexible UI Modes: View transcripts in a floating overlay or a dedicated Standalone popup window (perfect for web podcasts, Zoom, Google Meet, or Teams). • 📋 Text History: Saves all processed text in a continuous history for easy clipboard copying. • 🛡️ Total Privacy: All audio is processed on your local server. No data is sent to third-party cloud transcription services. ⚙️ SERVER INSTRUCTIONS & SOURCE CODE: To use this extension, you must run the local server. Get the server script and detailed instructions at: https://github.com/antor44/Audio-Transcription ⚖️ LICENSE: This is a free and open-source project distributed under the GNU General Public License v3.0 (GPL-3.0). For more details, visit the GitHub repository. --- 🆕 WHAT'S NEW IN VERSION 2.8.6: • ⚖️ Licensing Updates: Added comprehensive GPL-3.0 and MIT licensing notices directly into the extension's UI to comply with open-source standards. • 🖼️ Visual Updates: Added new and improved screenshots to the store listing. • ⚙️ Backend Tweaks: Minor optimizations and updates to the external local server scripts.
Audio Transcription is a powerful extension that turns your browser into a real-time interpreter. It captures any audio playing in a tab (transcribing it via Whisper AI) or reads existing video subtitles, translates them live, and reads the results back to you via Text-to-Speech (TTS). 🌟 Designed with privacy and efficiency in mind, it is optimized to run smoothly on low-resource computers and operates as independently from cloud services as possible. Compatible with Linux, Windows, and macOS, this extension acts as a true Live Interpreter for any media stream. ✨ Key Features: • 🎬 Subtitle TTS Mode: Read aloud and translate existing subtitles from YouTube, Twitch, or any HTML5 video without needing a local server. • 🌐 Source Language Control: Rely on smart auto-detection, or manually select the subtitle language for maximum accuracy. • 🗣️ Real-Time Speech-to-Speech: Listen to live translations with a natural, fluid voice that buffers complete sentences for a seamless experience. • 📝 Live Audio Transcription: Fast and accurate transcription from scratch using your local machine's processing power with OpenAI's Whisper AI (WhisperLive server required). • 🤖 Instant Translation: Translate live text using Google Translate (free) or the latest Google Gemini (Flash-Lite) & Gemma 4 AI models. • 🖼️ Flexible UI Modes: View transcripts in a floating overlay or a dedicated Standalone popup window. • 🛡️ Total Privacy: Local audio processing and transparent open-source code. ⚙️ SERVER INSTRUCTIONS & SOURCE CODE: The Subtitle TTS mode works completely out-of-the-box. However, to use the advanced "Live Audio Transcription" feature, you must run the local WhisperLive server on your computer. Get the server scripts and detailed setup instructions at: https://github.com/antor44/Audio-Transcription ⚖️ LICENSE: This is a free and open-source project distributed under the GNU General Public License v3.0 (GPL-3.0). For more details, visit the GitHub repository. --- 🆕 WHAT'S NEW IN VERSION 3.1.0: • 🎤 Smart TTS: The voice engine now intelligently waits for sentence boundaries (periods), creating a much more natural and less choppy listening experience. • ⭐ Language Selector: Added an optional 'Source Language' menu in Subtitle TTS mode to fix auto-detection edge cases. • 🤖 AI Update: Cleaned up deprecated models and added support for the new Gemma 4 generation. • 🐞 Bug Fixes: Fixed initial auto-detect hangs, stopped short phrases from being skipped, and fixed the "Stop" button state when tabs are closed.
- May 18, 2026short_description
Real-time transcription, live translation, and speech-to-speech playback for any audio playing in your browser.
Real-time transcription, live translation, and speech-to-speech playback for browser audio. Includes Subtitle TTS for videos.
- Apr 17, 2026description
Audio Transcription is a powerful extension that turns your browser into a real-time interpreter. It captures any audio playing in a tab, transcribes it, translates it live, and can even read the results back to you. It works with any audio or video stream, completely independent of whether the source has pre-existing subtitles. ⚠️ IMPORTANT: This extension requires a local server running on your computer (based on WhisperLive) to process the audio, ensuring your data remains private and operates with low latency. ✨ Key Features: • 🗣️ Real-Time Speech-to-Speech: Listen to live translations as they happen. The extension can read the transcribed or translated text aloud, creating a seamless audio interpreting experience. • 📝 Live Transcription: Fast and accurate transcription using your local machine's processing power with OpenAI's Whisper AI. • 🌐 Instant Translation: Translate live transcriptions on the fly using Google Translate (free) or Google Gemini API for advanced, context-aware translations. • 🖼️ Flexible UI Modes: View transcripts in a floating overlay or a dedicated Standalone popup window (perfect for web podcasts, Zoom, Google Meet, or Teams). • 📋 Text History: Saves all processed text in a continuous history for easy clipboard copying. • 🛡️ Total Privacy: All audio is processed on your local server. No data is sent to third-party cloud transcription services. ⚙️ SERVER INSTRUCTIONS & SOURCE CODE: To use this extension, you must run the local server. Get the server script and detailed instructions at: https://github.com/antor44/Audio-Transcription ⚖️ LICENSE: This is a free and open-source project distributed under the GNU General Public License v3.0 (GPL-3.0). For more details, visit the GitHub repository. 🆕 WHAT'S NEW IN VERSION 2.8.5: • Improved transcription/translation for complex languages (Chinese, Japanese, Korean, Arabic). • Upgraded core audio engine to the modern AudioWorklet API for better performance. • Translation is more reliable: fixed truncation bugs and improved Gemini API error handling. • Fixed a bug where a "ghost" overlay could remain visible on new tabs. • UI updates: Translation errors are now clearly visible in the status bar.
Audio Transcription is a powerful extension that turns your browser into a real-time interpreter. It captures any audio playing in a tab, transcribes it, translates it live, and can even read the results back to you. It works with any audio or video stream, completely independent of whether the source has pre-existing subtitles. ⚠️ IMPORTANT: This extension requires a local server running on your computer (based on WhisperLive) to process the audio, ensuring your data remains private and operates with low latency. ✨ Key Features: • 🗣️ Real-Time Speech-to-Speech: Listen to live translations as they happen. The extension can read the transcribed or translated text aloud, creating a seamless audio interpreting experience. • 📝 Live Transcription: Fast and accurate transcription using your local machine's processing power with OpenAI's Whisper AI. • 🌐 Instant Translation: Translate live transcriptions on the fly using Google Translate (free) or Google Gemini API for advanced, context-aware translations. • 🖼️ Flexible UI Modes: View transcripts in a floating overlay or a dedicated Standalone popup window (perfect for web podcasts, Zoom, Google Meet, or Teams). • 📋 Text History: Saves all processed text in a continuous history for easy clipboard copying. • 🛡️ Total Privacy: All audio is processed on your local server. No data is sent to third-party cloud transcription services. ⚙️ SERVER INSTRUCTIONS & SOURCE CODE: To use this extension, you must run the local server. Get the server script and detailed instructions at: https://github.com/antor44/Audio-Transcription ⚖️ LICENSE: This is a free and open-source project distributed under the GNU General Public License v3.0 (GPL-3.0). For more details, visit the GitHub repository. --- 🆕 WHAT'S NEW IN VERSION 2.8.6: • ⚖️ Licensing Updates: Added comprehensive GPL-3.0 and MIT licensing notices directly into the extension's UI to comply with open-source standards. • 🖼️ Visual Updates: Added new and improved screenshots to the store listing. • ⚙️ Backend Tweaks: Minor optimizations and updates to the external local server scripts.
Permissions & access
- Permissions
- storageactiveTabtabstabCapturescriptingtts
- Host access
- https://generativelanguage.googleapis.com/*, https://clients5.google.com/*
Screenshots
About
Audio Transcription is a powerful extension that turns your browser into a real-time interpreter. It captures any audio playing in a tab (transcribing it via Whisper AI) or reads existing video subtitles, translates them live, and reads the results back to you via Text-to-Speech (TTS). 🌟 Designed with privacy and efficiency in mind, it is optimized to run smoothly on low-resource computers and operates as independently from cloud services as possible. Compatible with Linux, Windows, and macOS, this extension acts as a true Live Interpreter for any media stream. ✨ Key Features: • 🎬 Subtitle TTS Mode: Read aloud and translate existing subtitles from YouTube, Twitch, or any HTML5 video without needing a local server. • 🌐 Source Language Control: Rely on smart auto-detection, or manually select the subtitle language for maximum accuracy. • 🗣️ Real-Time Speech-to-Speech: Listen to live translations with a natural, fluid voice that buffers complete sentences for a seamless experience. • 📝 Live Audio Transcription: Fast and accurate transcription from scratch using your local machine's processing power with OpenAI's Whisper AI (WhisperLive server required). • 🤖 Instant Translation: Translate live text using Google Translate (free) or the latest Google Gemini (Flash-Lite) & Gemma 4 AI models. • 🖼️ Flexible UI Modes: View transcripts in a floating overlay or a dedicated Standalone popup window. • 🛡️ Total Privacy: Local audio processing and transparent open-source code. ⚙️ SERVER INSTRUCTIONS & SOURCE CODE: The Subtitle TTS mode works completely out-of-the-box. However, to use the advanced "Live Audio Transcription" feature, you must run the local WhisperLive server on your computer. Get the server scripts and detailed setup instructions at: https://github.com/antor44/Audio-Transcription ⚖️ LICENSE: This is a free and open-source project distributed under the GNU General Public License v3.0 (GPL-3.0). For more details, visit the GitHub repository. --- 🆕 WHAT'S NEW IN VERSION 3.1.0: • 🎤 Smart TTS: The voice engine now intelligently waits for sentence boundaries (periods), creating a much more natural and less choppy listening experience. • ⭐ Language Selector: Added an optional 'Source Language' menu in Subtitle TTS mode to fix auto-detection edge cases. • 🤖 AI Update: Cleaned up deprecated models and added support for the new Gemma 4 generation. • 🐞 Bug Fixes: Fixed initial auto-detect hangs, stopped short phrases from being skipped, and fixed the "Stop" button state when tabs are closed.
Technical
- Version
- 3.1.0
- Manifest
- V3
- Size
- 83.2KiB
- Min Chrome
- 88
- Languages
- 1
- Featured
- No
Metadata
- ID
- mgekiekmhamibkobnlfbphhifjkhkohh
- Developer ID
- u2034146789500feb0b5ed6410876ee66
- Developer Email
- [email protected]
- Created
- Mar 23, 2026
- Last Updated (Store)
- May 18, 2026
- Last Scraped
- Jun 7, 2026
- Website
- —
Data sourced from the Chrome Web Store · last verified Jun 7, 2026.