Anthropic’s Claude Haiku 4.5: Sonnet-4 Coding Power at One-Third the Cost and Twice the Speed

What Claude Haiku 4.5 is

Anthropic has released Claude Haiku 4.5, a latency-optimized small model designed for interactive, cost-sensitive workflows. It aims to deliver coding performance comparable to Claude Sonnet 4 while running more than twice as fast and costing about one-third as much. The model is available immediately via Anthropic’s API and through partner catalogs on Amazon Bedrock and Google Cloud Vertex AI.

Target workloads and positioning

Haiku 4.5 is built for real-time assistants, customer-support automations, pair-programming, and other scenarios where latency and throughput are critical. Anthropic positions Haiku 4.5 as a drop-in replacement for Haiku 3.5 and for many use cases where Sonnet 4 would previously be used but cost or latency are limiting factors. Sonnet 4.5 remains the top-tier model for complex multi-step planning and highest-fidelity coding tasks, while Haiku 4.5 offers near-frontier coding performance at much lower cost.

How Anthropic recommends using Haiku 4.5

A recommended architecture is to use a frontier model such as Sonnet 4.5 for planning and orchestration, and to parallelize execution with multiple Haiku 4.5 workers. That planner–executor split lets teams keep heavy reasoning and orchestration on the higher-capability model while benefiting from Haiku 4.5’s lower latency and operating cost for routine or parallelizable jobs.

Availability and pricing

Developers can call the model identifier claude-haiku-4-5 on Anthropic’s API. Anthropic also lists Haiku 4.5 in the model catalogs on Amazon Bedrock and Google Cloud Vertex AI; cloud catalog IDs and regional coverage may vary over time. Pricing at launch is $1 per million input tokens and $5 per million output tokens. Prompt-caching prices are listed at $1.25/MTok for writes and $0.10/MTok for reads.

Benchmarks and methodology

Anthropic published benchmark summaries across several standard and agentic suites and documented methodology details to help qualify the numbers. Representative items include:

Anthropic emphasizes coding parity with Sonnet 4 on these scaffolds and reports gains on computer-use tasks such as GUI and browser manipulation. They note that users should replicate tests with their own orchestration, tool stacks, and thinking budgets before generalizing performance claims.

Key takeaways for developers and teams

For the original launch post and technical details, see Anthropic’s announcement at https://www.anthropic.com/news/claude-haiku-4-5.