Glossary

What is
Automatic Speech Recognition (ASR)?

ASR is the AI technology that converts your voice to text in real time — powering live captions, translation, and accessibility for streamers.

Get Started Free

Definition

Automatic Speech Recognition (ASR), also called Speech-to-Text (STT), is the technology that processes audio input and converts it to written text automatically using AI and machine learning models.

Modern ASR systems achieve near-human accuracy for clear speech in common languages, operating in real time with sub-second latency.

How ASR Works

ASR Providers Compared

StreamTranslate uses Deepgram for ASR. Here's how leading ASR providers compare for live streaming use cases:

Related Resources

Pricing

See full pricing →