Glossary

What is
Speech-to-Text (STT)?

Speech-to-Text is the engine behind live stream captions and translation. StreamTranslate uses Deepgram's STT to convert your voice to text in milliseconds.

Get Started Free

Definition

Speech-to-Text (STT), also called Automatic Speech Recognition (ASR), is the technology that converts spoken audio into written text in real time. STT is the first step in any live caption or translation pipeline.

How STT Works

STT in StreamTranslate

StreamTranslate uses Deepgram's Nova-2 model for speech recognition — one of the fastest and most accurate STT engines available. Deepgram achieves industry-leading word error rates for English and many other languages.

Related Resources

Pricing

See full pricing →