We’ve expanded our STT capabilities to make the Telnyx Speech-to-Text API available as a standalone service. So you can transcribe speech in real time, with or without a telephony workflow. With support for over 100 languages, sub-250ms latency, and enterprise-grade infrastructure, Telnyx STT is purpose-built for developers creating high-volume, real-time tools with voice features like AI assistants, real-time dictation workflows and multilingual user experiences.
We also expanded our STT engine options. Alongside existing support for Telnyx Speech-to-Text, Google Speech-to-Text and Deepgram, you can now use Azure Speech-to-Text through the Telnyx Speech-to-Text API, Voice API and Voice AI Agents.
Voice input is becoming a core part of how people interact with apps whether powering real-time AI agents, delivering live captions, enabling hands-free note-taking, or building voice interfaces into smart devices. The Telnyx STT API gives developers a fast, reliable way to convert live speech into text, across 100+ languages, with low latency, 24/7 support, and effortless scalability.
Explore the Telnyx developer docs for detailed instructions and learn the pricing.