Claude 3.7 Sonnet

Claude 3.7 Sonnet was the model that taught Claude to think before it spoke. Released February 25, 2025, it brought hybrid reasoning: a mode where Claude could work through a problem internally, step by step, and then deliver a more accurate answer. This was the last Claude 3.x model, and it laid the groundwork for everything that came in Claude 4.

Key Specs

Spec	Details
API ID	`claude-3-7-sonnet-20250225`
Context window	200K tokens
Input pricing	$3 / 1M tokens
Output pricing	$15 / 1M tokens
Thinking token pricing	Included in output pricing
Max output tokens	64,000 (with extended thinking)
Release date	February 25, 2025

Extended Thinking

The defining feature. When you turn it on, Claude runs an internal reasoning loop before it writes a single output token. The model spends a thinking budget to work the problem, then delivers the answer. For math proofs, multi-step code logic, scientific work, and planning tasks, this produced a lot better results.

API users got fine-grained control of the budget. Set it low for quick questions. Set it high for hard problems. Thinking tokens counted toward output pricing, but on the hard tasks the quality jump was worth the cost.

Hybrid Reasoning

One conversation, two modes. Quick answers for simple questions. Slow, step-by-step reasoning for the hard ones. You did not have to pick between a "thinking model" and a "fast model". The same model handled both and switched based on the task.

State-of-the-Art Agentic Coding

Claude 3.7 Sonnet set new highs on SWE-bench Verified, the benchmark that tests real GitHub issues (not synthetic problems). It could read a bug report, work through the codebase, find the root cause, and ship a working fix more reliably than any Claude before it.

Instruction-Following and Multimodal

Building on the 3.5 Sonnet v2 gains, 3.7 Sonnet got better at following long, multi-constraint instructions. Images, charts, and mixed media inputs all came back with higher accuracy too.

How Extended Thinking Worked in Practice

The pattern was simple:

Send a complex prompt (code review, math proof, architectural call)
Claude spends its thinking budget to reason it through internally
The answer comes back with higher accuracy and fewer logical mistakes

The biggest wins landed on math, science, and multi-file code changes. Tasks that used to need several back-and-forth corrections often came back right on the first try.

For a deeper look at getting the most out of it, see the deep thinking techniques guide.

Pricing and Output Size

Same $3/$15 per million tokens. Same 200K context window. But meaningfully better reasoning, coding, and instruction-following. The max output token limit jumped to 64,000 when extended thinking was on (up from 8,192), which made it real for generating long code, docs, or analysis in a single response.

The most important difference was qualitative: Claude 3.7 Sonnet made fewer reasoning mistakes on hard tasks. Extended thinking gave it a way to "show its work" internally, catching errors before they reached the output.

Status

Model	Status
Claude 3.7 Sonnet	Superseded by Claude 4 generation

Claude 3.7 Sonnet was the bridge between 3.x and 4.x. The hybrid reasoning and extended thinking ideas it pioneered became standard in Claude 4 and every model that followed.

All Claude Models for the full model index
Claude 3.5 Sonnet v2, the October 2024 predecessor
Claude 4, the next generation
Deep thinking techniques for getting the most out of extended thinking
Model selection strategies for choosing between Claude models

Claude 3.7 Sonnet

On this page