Telnyx - Global Communications Platform ProviderHome
Voice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-speechSIP TrunkingSMS APIWhatsApp Business APIView all productsHealthcareFinanceTravel and HospitalityLogistics and TransportationContact CenterInsuranceRetail and E-CommerceSales and MarketingServices and DiningView all solutionsVoice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-SpeechSIP TrunkingSMS APIWhatsApp Business APIGlobal NumbersIoT SIM CardView all pricingOur NetworkMission Control PortalCustomer storiesGlobal communicationsPartnersCareersEventsResource centerSupport centerAI TemplatesSETIDev DocsIntegrations
Contact usLog in
Contact usLog inSign up

Social

Company

  • Our Network
  • Global Coverage
  • Release Notes
  • Careers
  • Voice AI
  • AI Glossary
  • Shop

Legal

  • Data and Privacy
  • Report Abuse
  • Privacy Policy
  • Cookie Policy
  • Law Enforcement
  • Acceptable Use
  • Trust Center
  • Country Specific Requirements
  • Website Terms and Conditions
  • Terms and Conditions of Service

Compare

  • ElevenLabs
  • Vapi
  • Baseten
  • Together.ai
  • Twilio
  • Bandwidth
  • Vonage
  • Amazon Connect
© Telnyx LLC 2026
ISO • PCI • HIPAA • GDPR • SOC2 Type II

Ask AI

  • GPT
  • Claude
  • Perplexity
  • Gemini
  • Grok

Claude-Haiku-4-5

Anthropic's fastest model with near-frontier performance, matching Sonnet 4 in coding at one-third the cost and more than double the speed.

Start buildingGET Available Models

about

Running at 98.9 tokens per second with a 0.68-second time-to-first-token, Haiku 4.5 scores 73.3% on SWE-bench Verified, within 5 points of the mid-tier Sonnet despite costing $1/$5 per million tokens. It was the first Haiku model to ship with extended thinking, computer use, and context awareness, closing the gap between Anthropic's speed tier and its reasoning tier.

Licenseanthropic

Explore Our LLM Library

Discover the power and diversity of large language models available with Telnyx. Explore the options below to find the perfect model for your project.

No data available at this time, please try again later.
OrganizationModel NameTasksLanguages SupportedContext LengthParametersModel TierLicense
No data available at this time, please try again later.
HOW IT WORKS

Selecting LLMs for Voice AI

GET Available Models
RESOURCES

Get started

Check out our helpful tools to help get you started.

  • Icon Resources ebook

    Test in the portal

    Easily browse and select your preferred model in the AI Playground.

Sign up and start building

Sign upContact sales

faqs

What is Claude Haiku 4.5 good for?

Claude Haiku 4.5 is optimized for high-speed, cost-efficient tasks where quick responses matter. It excels at classification, summarization, and conversational AI, delivering coding performance similar to Claude Sonnet 4 at one-third the cost and more than double the speed.

Can I use Haiku 4.5 in Claude Code?

Yes, Claude Haiku 4.5 is available as a model option in Claude Code, providing a fast and cost-effective choice for coding assistance and development workflows. Its low latency makes it particularly useful for rapid iteration during development.

Is Claude Haiku 4.5 free?

Claude Haiku 4.5 is available for free with usage limits on claude.ai. Through the API, it is , making it Anthropic's most affordable model.

Context window(in thousands)
200000

Use cases for Claude-Haiku-4-5

  1. Real-time customer interaction triage: At 98.9 tokens per second and 0.68s time-to-first-token, Haiku 4.5 classifies and routes incoming queries faster than users can notice latency.
  2. Automated code review at scale: Scoring 73.3% on SWE-bench Verified, it catches bugs and suggests fixes in pull request pipelines where speed matters more than maximum depth.
  3. Edge-deployed content moderation: Its small footprint and extended thinking capability make it suited for on-device safety filtering where round-trip API calls are too slow.

Quality

Arena EloN/A
MMLUN/A
MT BenchN/A

Claude Haiku 4.5 scores 73.3% on SWE-bench Verified, within 5 points of Claude Sonnet 4 (72.7%) on the same benchmark despite costing one-third as much. On MMLU, the Claude 3 Haiku baseline scored 76.7% (0-shot CoT), and the 4.5 update maintains that range while adding extended thinking and tool use. At 98.9 tokens per second, it delivers near-Sonnet quality at Haiku speed.

Claude-Opus-4-6

1501

GLM-5

1456

gpt-5.1

1455

Kimi-K2.5

1454

gpt-5.2

1440

pricing

Running Claude Haiku 4.5 through Telnyx Inference costs $1.00 per million input tokens and $5.00 per million output tokens. Processing 1,000,000 customer support conversations at 1,000 tokens each would cost approximately $3,000, roughly one-third the cost of the same workload on Claude Sonnet 4 ($9,000).

What's Twitter saying?

  • Developers praise Claude Haiku 4.5 for its near-Sonnet 4.5 performance on coding benchmarks like 73% on SWE-Bench Verified, at a fraction of the cost and twice the speed, ideal for agentic apps and tool calling.
  • Users on Hacker News call it brilliant for nuanced coding tasks but note it's slow in some cases and requires strict rules to avoid deviation.
  • Early testers report shocking speed enabling full-stack apps in under a minute, though one YouTube review finds it disappointing compared to GPT-5 in non-coding areas.
Test today
  • Icon Resources Docs

    Explore the docs

    Don’t wait to scale, start today with our public API endpoints.

    Get started
  • Icon Resources Article

    Stay up to date

    Keep an eye on our AI changelog so you don't miss a beat.

    See updates
  • priced at $1 per million input tokens and $5 per million output tokens

    What's the difference between Claude 4.5 Sonnet vs Haiku?

    Sonnet 4 is Anthropic's more capable model for complex reasoning and multi-step tasks, while Haiku 4.5 prioritizes speed and cost efficiency. Haiku 4.5 approaches Sonnet's coding performance while running significantly faster at a lower price, making it better suited for high-volume or latency-sensitive applications.

    Is Claude Haiku cheap?

    Haiku 4.5 is Anthropic's lowest-cost model at $1 per million input tokens, roughly one-third the price of Claude Sonnet 4. For voice AI and real-time applications that require sub-second responses, this cost structure makes Haiku a practical choice for production workloads.

    Is Claude Haiku 4.5 bad?

    Haiku 4.5 is not a weak model. It performs competitively on coding benchmarks and handles most everyday tasks well, according to Anthropic's own benchmarks. Its limitations show on complex reasoning and multi-step analysis, where Sonnet or Opus models are better suited.

    How much does Haiku 4.5 cost?

    Haiku 4.5 is priced at $1 per million input tokens and $5 per million output tokens through the API. Telnyx offers access to Haiku 4.5 through its inference infrastructure, where co-located processing can reduce overall pipeline latency.