llama-4-17b-128e-instruct

Name: Llama 4 Maverick 17B 128e Instruct: Powerful AI Model for Diverse Tasks
Brand: Telnyx
Price: 1 USD
Availability: InStock

Meta's largest Llama 4 model with 17B active parameters across 128 experts, supporting multimodal input and 128k context for complex agentic workflows.

Start building GET Available Models

about

With 128 routed experts plus one shared expert per layer and 400B total parameters, Maverick is the highest expert-count open model in any major family. It uses early-fusion multimodality trained on 22 trillion tokens of text and image data, debuting at an LMSYS Chatbot Arena ELO of 1417, above both GPT-4o and Gemini 2.0 Flash, with a 1-million-token context window.

Licensellama4

Context window(in thousands)1,000,000

Use cases for llama-4-17b-128e-instruct

Multimodal document processing: Early-fusion vision trained on 22 trillion tokens enables native understanding of charts, diagrams, and screenshots alongside text without adapter overhead.
Image-grounded conversation: With an LMSYS ELO of 1417 and native image input, it answers complex visual questions about photographs, UI designs, and technical schematics in multi-turn dialogue.
Efficient large-model inference: 128 experts with only 17B active per token means frontier-quality output at a fraction of the compute cost of dense models at equivalent quality.

Quality

Arena Elo1327

MMLU85.5

MT BenchN/A

Llama 4 Maverick scores 85.5% on MMLU and 80.5% on MMLU-Pro, placing it near GPT-4o (88.7% MMLU) on the same sheet with only 17B active parameters. Its LMSYS Arena ELO of 1,327 sits above GPT-4o (1,316), achieved through 128-expert routing that activates only a fraction of its 400B total parameters per token.

o1-mini

1337

o3-mini

1337

llama-4-17b-128e-instruct

1327

gpt-4-turbo-preview

1324

llama-3.3-70b-versatile

1318

pricing

Running Llama 4 Maverick through Telnyx Inference follows 70B+ pricing at $0.0006 per 1,000 tokens, since only 17B of its 400B parameters are active per token. Processing 1,000,000 multimodal queries at 1,500 tokens each would cost approximately $900, delivering GPT-4o-competitive quality with MoE efficiency.

What's Twitter saying?

Developers note Llama 4 Maverick underperforms in coding tasks, often giving up or producing inferior results compared to DeepSeek v3, making it better for "vibe coding" than serious development.
Tech commentators express disappointment with verbosity, as Maverick "faffs around" with long, circular responses and jokes, burying answers and failing to get to the point.
Community benchmarks highlight controversy over inflated scores, with LMArena's high ELO from an experimental, non-released chat-tuned version leading to bans on such models.

Explore Our LLM Library

Discover the power and diversity of large language models available with Telnyx. Explore the options below to find the perfect model for your project.

Organizationdeepseek-ai

Model NameDeepSeek-R1-Distill-Qwen-14B

Taskstext generation

Languages SupportedEnglish

Context Length43,000

Parameters14.8B

Model Tiermedium

Licensedeepseek

Organization	Model Name	Tasks	Languages Supported	Context Length	Parameters	Model Tier	License
deepseek-ai	DeepSeek-R1-Distill-Qwen-14B	text generation	English	43,000	14.8B	medium	deepseek
fixie-ai	ultravox-v0_4_1-llama-3_1-8b	audio text-to-text	Multilingual	8,000	8.7B	small	mit
google	gemma-2b-it	text generation	English	8,192	2.5B	small	gemma
google	gemma-7b-it	text generation	English	8,192	8.5B	small	gemma
meta-llama	Llama-3.3-70B-Instruct	text generation	Multilingual	99,000	70.6B	large	llama3.3
meta-llama	Llama-Guard-3-1B	safety classification	Multilingual	128,000	1.5B	small	llama3.3
meta-llama	Meta-Llama-3.1-70B-Instruct	text generation	Multilingual	99,000	70.6B	large	llama3.1
meta-llama	Meta-Llama-3.1-8B-Instruct	text generation	Multilingual	131,072	8.0B	small	llama3.1
minimaxai	MiniMax-M2.5	text generation	English	2,000,000	0	large	minimaxai
minimaxai	MiniMax-M2.7	text generation	English	200,000	0	large	minimaxai
mistralai	Mistral-7B-Instruct-v0.1	text generation	English	8,192	7.2B	small	apache-2.0
mistralai	Mistral-7B-Instruct-v0.2	text generation	English	32,768	7.2B	small	apache-2.0
mistralai	Mixtral-8x7B-Instruct-v0.1	text generation	Multilingual	32,768	46.7B	medium	apache-2.0
moonshotai	Kimi-K2.5	text generation	English	256,000	1.0T	large	modified-mit
Qwen	Qwen3-235B-A22B	text generation	English	32,768	235.1B	large	apache-2.0
zai-org	GLM-5.1-FP8	text generation	English	202,752	753.9B	large	mit
anthropic	claude-3-7-sonnet-latest	text generation	Multilingual	200,000	0	large	anthropic
anthropic	claude-haiku-4-5	text generation	Multilingual	200,000	0	large	anthropic
anthropic	claude-opus-4-6	text generation	Multilingual	200,000	0	large	anthropic
anthropic	claude-sonnet-4-20250514	text generation	Multilingual	200,000	0	large	anthropic
google	gemini-2.0-flash	text generation	Multilingual	1,048,576	0	large	google
google	gemini-2.5-flash	text generation	Multilingual	1,048,576	0	large	google
google	gemini-2.5-flash-lite	text generation	Multilingual	1,048,576	0	large	google
groq	gpt-oss-120b	text generation	English	131,072	117.0B	large	groq
groq	kimi-k2-instruct	text generation	English	131,072	1.0T	large	groq
groq	llama-3.3-70b-versatile	text generation	Multilingual	131,072	70.6B	large	llama3.3
groq	llama-4-maverick-17b-128e-instruct	text generation	Multilingual	1,000,000	400.0B	large	llama4
groq	llama-4-scout-17b-16e-instruct	text generation	Multilingual	128,000	109.0B	large	llama4
openai	gpt-3.5-turbo	text generation	Multilingual	4,096	0	large	openai
openai	gpt-4	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4-0125-preview	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4-0314	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4-0613	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4-1106-preview	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4-32k-0314	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4-turbo-preview	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4.1	text generation	Multilingual	1,047,576	0	large	openai
openai	gpt-4.1-mini	text generation	Multilingual	1,047,576	0	large	openai
openai	gpt-4o	text generation	Multilingual	128,000	0	large	openai
openai	gpt-4o-mini	text generation	Multilingual	128,000	0	large	openai
openai	gpt-5	text generation	Multilingual	400,000	0	large	openai
openai	gpt-5-mini	text generation	Multilingual	400,000	0	large	openai
openai	gpt-5.1	text generation	Multilingual	400,000	0	large	openai
openai	gpt-5.2	text generation	Multilingual	400,000	0	large	openai
openai	o1-mini	text generation	Multilingual	128,000	0	large	openai
openai	o1-preview	text generation	Multilingual	128,000	0	large	openai
openai	o3-mini	text generation	Multilingual	200,000	0	large	openai
xai-org	grok-2	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-2-latest	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3-beta	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3-fast	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3-fast-beta	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3-fast-latest	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3-latest	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3-mini	text generation	Multilingual	131,072	0	large	xai
xai-org	grok-3-mini-fast	text generation	Multilingual	131,072	0	large	xai

TRY IT OUT

Chat with an LLM

Powered by our own GPU infrastructure, select a large language model, add a prompt, and chat away. For unlimited chats, sign up for a free account on our Mission Control Portal here.

HOW IT WORKS

Selecting LLMs for Voice AI

GET Available Models

RESOURCES

Get started

Check out our helpful tools to help get you started.

Test in the portal
Easily browse and select your preferred model in the AI Playground.
Test today
Explore the docs
Don’t wait to scale, start today with our public API endpoints.
Get started
Stay up to date
Keep an eye on our AI changelog so you don't miss a beat.
See updates

Sign up and start building

faqs

Is Llama 4 Maverick free to use?

Llama 4 Maverick is released under Meta's community license, making it free for most commercial applications. Weights are available on Hugging Face and through hosted inference providers.

What is Llama 4 Maverick?

Llama 4 Maverick is Meta's mixture-of-experts model with 17 billion active parameters drawn from 128 experts, designed for high-capability reasoning at efficient compute cost. It was released as part of Meta's Llama 4 family alongside Llama 4 Scout.

What provider is Llama 4 Maverick?

Llama 4 Maverick is available through multiple providers including Telnyx, together.ai, Fireworks, and directly from Meta's own infrastructure. It can also be self-hosted using the open weights.

Is Llama 4 Maverick MoE?

Yes, Llama 4 Maverick uses a mixture-of-experts architecture with 128 experts, activating 17B parameters per inference pass. This MoE design delivers strong performance while keeping per-token compute cost manageable.

How does Maverick compare to Scout?

Maverick is the larger, more capable model with 128 experts, while Scout uses 16 experts for faster, lighter inference. Maverick targets complex reasoning tasks while Scout is better suited for high-throughput production workloads.

Is Llama 4 Maverick good at coding?

Maverick performs well on coding benchmarks, benefiting from its large expert pool for specialized code patterns. It is competitive with GPT-4 class models on code generation and is particularly strong on multi-file reasoning tasks.

llama-4-17b-128e-instruct

about

Use cases for llama-4-17b-128e-instruct

Quality

pricing

What's Twitter saying?

Explore Our LLM Library

Chat with an LLM

Selecting LLMs for Voice AI

Create an account

Choose llama-4-17b-128e-instruct

Enter your API key

Prompt the LLM

Get started

Test in the portal

Explore the docs

Stay up to date

Sign up and start building

faqs

Is Llama 4 Maverick free to use?

What is Llama 4 Maverick?

What provider is Llama 4 Maverick?

Is Llama 4 Maverick MoE?

How does Maverick compare to Scout?

Is Llama 4 Maverick good at coding?

llama-4-17b-128e-instruct

about

Use cases for llama-4-17b-128e-instruct

Quality

pricing

What's Twitter saying?

Explore Our LLM Library

DeepSeek-R1-Distill-Qwen-14B

ultravox-v0_4_1-llama-3_1-8b

gemma-2b-it

gemma-7b-it

Llama-3.3-70B-Instruct

Llama-Guard-3-1B

Meta-Llama-3.1-70B-Instruct

Meta-Llama-3.1-8B-Instruct

MiniMax-M2.5

MiniMax-M2.7

Mistral-7B-Instruct-v0.1

Mistral-7B-Instruct-v0.2

Mixtral-8x7B-Instruct-v0.1

Kimi-K2.5

Qwen3-235B-A22B

GLM-5.1-FP8

claude-3-7-sonnet-latest

claude-haiku-4-5

claude-opus-4-6

claude-sonnet-4-20250514

gemini-2.0-flash

gemini-2.5-flash

gemini-2.5-flash-lite

gpt-oss-120b

kimi-k2-instruct

llama-3.3-70b-versatile

llama-4-maverick-17b-128e-instruct

llama-4-scout-17b-16e-instruct

gpt-3.5-turbo

gpt-4

gpt-4-0125-preview

gpt-4-0314

gpt-4-0613

gpt-4-1106-preview

gpt-4-32k-0314

gpt-4-turbo-preview

gpt-4.1

gpt-4.1-mini

gpt-4o

gpt-4o-mini

gpt-5

gpt-5-mini

gpt-5.1

gpt-5.2

o1-mini

o1-preview

o3-mini

grok-2

grok-2-latest

grok-3

grok-3-beta

grok-3-fast

grok-3-fast-beta

grok-3-fast-latest

grok-3-latest

grok-3-mini

grok-3-mini-fast

Chat with an LLM

Selecting LLMs for Voice AI

Create an account

Choose llama-4-17b-128e-instruct

Enter your API key

Prompt the LLM

Test in the portal

Explore the docs

Stay up to date

Sign up and start building

faqs

Is Llama 4 Maverick free to use?

What is Llama 4 Maverick?

What provider is Llama 4 Maverick?

Is Llama 4 Maverick MoE?

How does Maverick compare to Scout?