🎯 Try StreamTranslate free for your next stream — 60-second setup, no card requiredStart Free Trial →
Zero CPU · Zero GPU · Cloud-based

Cloud-Based OBS Captions — No CPU, No GPU

OBS plugins like LocalVocal run Whisper on your GPU during streams, eating 5-20% of your VRAM. StreamTranslate runs entirely cloud-side. Your GPU stays free for games.

Try StreamTranslate Free → View Pricing

No credit card. 6-hour free trial. Works on any browser.

TL;DR
Local caption plugins (LocalVocal, Polyglot OBS) run speech recognition on your machine's GPU, taking 5-20% of available VRAM during streams. StreamTranslate runs Deepgram speech recognition in the cloud — your GPU stays free for the actual game. Add as an OBS browser source, no plugin install, no compiled binaries.

Why local OBS caption plugins hurt your stream

The most popular free OBS caption plugin (LocalVocal) runs OpenAI Whisper locally on your GPU. That means every word you speak triggers Whisper inference on your machine. Typical impact:

For streamers already pushing the GPU with games + OBS encoding, this matters. You either drop encoding bitrate, reduce streaming resolution, or get worse Whisper transcription accuracy because the model has less budget. It is a triangle: game quality, stream quality, caption quality — pick two.

Why cloud-based is different

StreamTranslate moves the speech recognition off your machine entirely. The flow:

  1. Your browser (running as an OBS browser source) captures mic audio via MediaRecorder
  2. Audio streams over WebSocket to our servers
  3. Deepgram Nova-3 runs the transcription on dedicated GPU infrastructure
  4. Translation runs in parallel (DeepL + Google + Azure race for fastest result)
  5. Result streams back to the browser overlay

Your machine's job: encode audio, render the overlay text. Both take near-zero CPU. The heavy work happens in the cloud where dedicated GPU servers can crunch through hundreds of streams in parallel without affecting any one streamer's machine.

Resource comparison: local vs cloud

ResourceStreamTranslate (cloud)LocalVocal (local)
GPU usage~0%5-20%
VRAM~0 MB500 MB - 2 GB
CPU usage~1-2%5-10%
SetupPaste 1 URLInstall plugin, download model, configure audio
Breaks on OBS updateNever (browser source)Often (plugin needs rebuild)
Internet requiredYesNo (runs offline)
CostFrom $9.99 onceFree

When to pick which

Pick LocalVocal if: you have a beefy GPU (RTX 4070+), do not need translation accuracy beyond Whisper's built-in, and want totally free / offline operation.

Pick StreamTranslate if: your GPU is already busy with games, you want sub-500ms latency, you need 30+ languages with multi-provider translation, or you cannot afford to have your plugin break on the next OBS update.

Frequently asked questions

How much GPU does StreamTranslate actually use?
Near zero — same as any other OBS browser source (Twitch alerts, chat overlay). The browser source renders text and that is it. Speech recognition happens on our servers.
Does it require a strong internet connection?
Yes — audio streams to our servers. Typical upstream bandwidth: ~50 KB/s. Any normal home internet handles it.
What happens if my internet drops?
Captions stop generating until reconnect (usually < 1 second). The browser source automatically reconnects to our WebSocket.
Will my game frame rate be affected?
No. The browser source has zero GPU impact.

Free 6-hour trial

No credit card. One URL pasted into OBS. Live in 60 seconds.

Launch StreamTranslate →