Inference Pricing

Incorporate AI into your applications with an OpenAI compatible API at a fraction of the price.

Custom pricing

Get set up on a contract and get volume discounts the more you use.

Pay as you go

Pricing is based on which services you use, and how much you use them.

Pay as you go

Chat completions pricing (per 1M tokens)
Model
Price
Kimi K2.6 — Highest intelligence, voice AI
Input: $0.665 / 1M tokens
Cached Input: $0.080 / 1M tokens
Output: $4.000 / 1M tokens
GLM-5.1-FP8 — Most efficient reasoning
Input: $0.980 / 1M tokens
Cached Input: $0.130 / 1M tokens
Output: $4.400 / 1M tokens
MiniMax-M2.7 — Cheapest while maintaining high intelligence
Input: $0.210 / 1M tokens
Cached Input: $0.030 / 1M tokens
Output: $1.200 / 1M tokens
Other services
Service
Price
Embeddings (gte-large)
$0.0001 / 1K tokens
Speech to text
$0.003 / minute
AI-enabled storage and retrieval
$0.02 / GB / day

Sign up

Start building with our intuitive APIs.

Volume-based pricing

Telnyx offers you discounts in exchange for monthly commitments as you scale

  • Checkmark

    Contract

    Get set up on a contract with predictable monthly payments.

  • Checkmark

    Discounted rate

    Receive a discounted rate with the more you spend instead of our pay-as-you-go rates.

  • Checkmark

    24/7 support

    Free 24/7 support as well as a customer success manager dedicated to helping you.