Claude-Haiku-4-5

Name: Claude Haiku 4: Fast Reasoning Model for Real-Time Applications
Brand: Telnyx
Price: 1 USD
Availability: InStock

Anthropic's fastest model with near-frontier performance, matching Sonnet 4 in coding at one-third the cost and more than double the speed.

Start building GET Available Models

about

Running at 98.9 tokens per second with a 0.68-second time-to-first-token, Haiku 4.5 scores 73.3% on SWE-bench Verified, within 5 points of the mid-tier Sonnet despite costing $1/$5 per million tokens. It was the first Haiku model to ship with extended thinking, computer use, and context awareness, closing the gap between Anthropic's speed tier and its reasoning tier.

Licenseanthropic

Context window(in thousands)200000

Use cases for Claude-Haiku-4-5

Real-time customer interaction triage: At 98.9 tokens per second and 0.68s time-to-first-token, Haiku 4.5 classifies and routes incoming queries faster than users can notice latency.
Automated code review at scale: Scoring 73.3% on SWE-bench Verified, it catches bugs and suggests fixes in pull request pipelines where speed matters more than maximum depth.
Edge-deployed content moderation: Its small footprint and extended thinking capability make it suited for on-device safety filtering where round-trip API calls are too slow.

Quality

Arena EloN/A

MMLUN/A

MT BenchN/A

Claude Haiku 4.5 scores 73.3% on SWE-bench Verified, within 5 points of Claude Sonnet 4 (72.7%) on the same benchmark despite costing one-third as much. On MMLU, the Claude 3 Haiku baseline scored 76.7% (0-shot CoT), and the 4.5 update maintains that range while adding extended thinking and tool use. At 98.9 tokens per second, it delivers near-Sonnet quality at Haiku speed.

Claude-Opus-4-6

1501

GLM-5

1456

gpt-5.1

1455

Kimi-K2.5

1454

gpt-5.2

1440

pricing

Running Claude Haiku 4.5 through Telnyx Inference costs $1.00 per million input tokens and $5.00 per million output tokens. Processing 1,000,000 customer support conversations at 1,000 tokens each would cost approximately $3,000, roughly one-third the cost of the same workload on Claude Sonnet 4 ($9,000).

What's Twitter saying?

Developers praise Claude Haiku 4.5 for its near-Sonnet 4.5 performance on coding benchmarks like 73% on SWE-Bench Verified, at a fraction of the cost and twice the speed, ideal for agentic apps and tool calling.
Users on Hacker News call it brilliant for nuanced coding tasks but note it's slow in some cases and requires strict rules to avoid deviation.
Early testers report shocking speed enabling full-stack apps in under a minute, though one YouTube review finds it disappointing compared to GPT-5 in non-coding areas.

Explore Our LLM Library

Discover the power and diversity of large language models available with Telnyx. Explore the options below to find the perfect model for your project.

No data available at this time, please try again later.

Organization	Model Name	Tasks	Languages Supported	Context Length	Parameters	Model Tier	License
No data available at this time, please try again later.

TRY IT OUT

Chat with an LLM

Powered by our own GPU infrastructure, select a large language model, add a prompt, and chat away. For unlimited chats, sign up for a free account on our Mission Control Portal here.

HOW IT WORKS

Selecting LLMs for Voice AI

GET Available Models

RESOURCES

Get started

Check out our helpful tools to help get you started.

Test in the portal
Easily browse and select your preferred model in the AI Playground.
Test today
Explore the docs
Don’t wait to scale, start today with our public API endpoints.
Get started
Stay up to date
Keep an eye on our AI changelog so you don't miss a beat.
See updates

Sign up and start building

faqs

What is Claude Haiku 4.5 good for?

Claude Haiku 4.5 is optimized for high-speed, cost-efficient tasks where quick responses matter. It excels at classification, summarization, and conversational AI, delivering coding performance similar to Claude Sonnet 4 at one-third the cost and more than double the speed.

Can I use Haiku 4.5 in Claude Code?

Yes, Claude Haiku 4.5 is available as a model option in Claude Code, providing a fast and cost-effective choice for coding assistance and development workflows. Its low latency makes it particularly useful for rapid iteration during development.

Is Claude Haiku 4.5 free?

Claude Haiku 4.5 is available for free with usage limits on claude.ai. Through the API, it is priced at $1 per million input tokens and $5 per million output tokens, making it Anthropic's most affordable model.

What's the difference between Claude 4.5 Sonnet vs Haiku?

Sonnet 4 is Anthropic's more capable model for complex reasoning and multi-step tasks, while Haiku 4.5 prioritizes speed and cost efficiency. Haiku 4.5 approaches Sonnet's coding performance while running significantly faster at a lower price, making it better suited for high-volume or latency-sensitive applications.

Is Claude Haiku cheap?

Haiku 4.5 is Anthropic's lowest-cost model at $1 per million input tokens, roughly one-third the price of Claude Sonnet 4. For voice AI and real-time applications that require sub-second responses, this cost structure makes Haiku a practical choice for production workloads.

Is Claude Haiku 4.5 bad?

Haiku 4.5 is not a weak model. It performs competitively on coding benchmarks and handles most everyday tasks well, according to Anthropic's own benchmarks. Its limitations show on complex reasoning and multi-step analysis, where Sonnet or Opus models are better suited.

How much does Haiku 4.5 cost?

Haiku 4.5 is priced at $1 per million input tokens and $5 per million output tokens through the API. Telnyx offers access to Haiku 4.5 through its inference infrastructure, where co-located processing can reduce overall pipeline latency.

about

Use cases for Claude-Haiku-4-5

Real-time customer interaction triage: At 98.9 tokens per second and 0.68s time-to-first-token, Haiku 4.5 classifies and routes incoming queries faster than users can notice latency.
Automated code review at scale: Scoring 73.3% on SWE-bench Verified, it catches bugs and suggests fixes in pull request pipelines where speed matters more than maximum depth.
Edge-deployed content moderation: Its small footprint and extended thinking capability make it suited for on-device safety filtering where round-trip API calls are too slow.

pricing

What's Twitter saying?

Developers praise Claude Haiku 4.5 for its near-Sonnet 4.5 performance on coding benchmarks like 73% on SWE-Bench Verified, at a fraction of the cost and twice the speed, ideal for agentic apps and tool calling.
Users on Hacker News call it brilliant for nuanced coding tasks but note it's slow in some cases and requires strict rules to avoid deviation.
Early testers report shocking speed enabling full-stack apps in under a minute, though one YouTube review finds it disappointing compared to GPT-5 in non-coding areas.

Organization

Model Name

Tasks

Languages Supported

Context Length

Parameters

Model Tier

License

No data available at this time, please try again later.

faqs

Claude-Haiku-4-5

about

Use cases for Claude-Haiku-4-5

Quality

pricing

What's Twitter saying?

Explore Our LLM Library

Chat with an LLM

Selecting LLMs for Voice AI

Create an account

Choose Claude-Haiku-4-5

Enter your API key

Prompt the LLM

Test in the portal

Explore the docs

Stay up to date

Sign up and start building