Telnyx AI Agents can now receive inbound MMS messages from users during an active voice call. This means callers can send images or rich media while speaking with the agent, and it can respond in real-time, provided it’s powered by a vision-capable LLM. Currently supported vision models include Groq/llama-4-maverick-17b-128e-instruct
and OpenAI/gpt-4o
.
What’s new
- In-call MMS support is now available for Voice AI Agents with messaging enabled and matching voice and SMS numbers.
- This functionality works seamlessly with third-party vision models that can interpret image input along spoken dialogue.
Why it matters
This release introduces a major leap in multimodal AI conversations. With MMS support:
- Callers can send a photo, file, or receipt mid-call, and the AI Agent can immediately interpret and respond.
- Agents can deliver more contextual help, like confirming an ID photo or reviewing an image of a damaged product.
- You can offer a richer, more humanlike customer experience without breaking the flow across channels.
Getting started
It only takes a few minutes to enable MMS support.
- Sign up or log in to your Telnyx Mission Control Portal account.
- Ensure messaging is enabled for your Voice AI Agent.
- Use the same
to
and from
numbers for voice and messaging. - Use a vision-capable model (like GPT-4o or LLaMA-4-Maverick on Groq).
- Send an image from your phone during a call and watch the agent respond in real time.