Stream translation works by capturing microphone audio, converting it to text via speech recognition, translating to the target language, and displaying subtitles on stream via OBS browser source — all in under 2 seconds.
Start Translating Free → No credit card · 28+ languages · Works with OBS, Streamlabs, XSplitModern stream translation tools like StreamTranslate use a four-stage pipeline: audio capture → speech recognition → machine translation → subtitle display. Each stage is optimized for minimum latency.
Stage 1: Audio capture. The browser source captures your microphone audio in real time. No software install needed — it runs in the browser.
Stage 2: Speech recognition (STT). Audio is sent to a speech-to-text engine. StreamTranslate uses Deepgram Nova-2, which processes audio in chunks of ~500ms and returns accurate transcriptions even for accents and gaming audio.
Stage 3: Machine translation. The transcribed text is sent to a translation API. StreamTranslate uses a multi-provider translation race — it simultaneously queries multiple translation engines and uses the first result that returns, minimizing latency.
Stage 4: Subtitle display. The translated text is rendered as a styled subtitle overlay displayed via the OBS browser source. The entire pipeline completes in under 2 seconds end-to-end.