Inference Pricing

Incorporate AI into your applications with an OpenAI compatible API at a fraction of the price.

Custom pricing

Get set up on a contract and get volume discounts the more you use.

Pay as you go

Pricing is based on which services you use, and how much you use them.

Pay as you go

Service pricing
Service	Price	% Cheaper
Chat Completions per token	Price	$0.0002 / 1K tokens for 7B parameter models $0.0003 / 1K tokens for 13B, 34B, 8x7B parameter models $0.0006 / 1K tokens for 70B+ parameter models	% Cheaper	Up to 90% cheaper vs. OpenAI GPT-3.5 Turbo
Embeddings per token	Price	Small: $0.00005 / 1K tokens Large: $0.0001 / 1K tokens	% Cheaper	Up to 50% cheaper vs. OpenAI Ada
Speech to text per minute	Price	$0.003 / minute	% Cheaper	Up to 50% cheaper vs. OpenAI Whisper
AI-enabled storage and retrieval	Price	$0.02 / GB / day	% Cheaper	At least 90% cheaper vs. OpenAI Assistant Retrieval

Sign up

Start building with our intuitive APIs.

Volume-based pricing

Telnyx offers you discounts in exchange for monthly commitments as you scale

Contract
Get set up on a contract with predictable monthly payments.
Discounted rate
Receive a discounted rate with the more you spend instead of our pay-as-you-go rates.
24/7 support
Free 24/7 support as well as a customer success manager dedicated to helping you.