AssemblyAI Universal-Streaming is now available as a speech-to-text option through the Telnyx STT API and Mission Control, adding low-latency transcription with built-in turn detection for voice agents.
assemblyai/universal-streaming model is now available for real-time speech-to-text transcription through the STT API, with built-in end-of-turn detection for voice agent workflows.AssemblyAI STT uses the model ID assemblyai/universal-streaming for real-time transcription with low latency and automatic turn detection, so your voice agent knows when the caller has finished speaking.
Supported languages: English, Spanish, German, French, Portuguese, and Italian.
Developers building voice agents now have another low-latency STT option alongside Deepgram Flux, with AssemblyAI's Universal-Streaming model providing turn detection out of the box. This gives more flexibility to choose the STT engine that fits your accuracy, language, and latency requirements.
Via the API:
assemblyai/universal-streaming in your STT API or WebSocket requests. See the transcription settings documentation for model details.In Mission Control:
assemblyai/universal-streaming from the Transcription Model dropdown