Build This Now
Build This Now
Claude Code ModelsOpus 4.8 CheatsheetDeepSeek V4: Pricing, Context, and MigrationClaude Code Quality Regression: What Actually HappenedClaude Opus 4.7 vs GPT-5.5Claude Opus 4.7 vs Other AI ModelsClaude Mythos: The Model That Thinks in LoopsClaude Opus 4.5 in Claude CodeClaude Opus 4.7Claude Opus 4.7 vs 4.6Claude Opus 4.7 Use CasesClaude Opus 4.6Claude Sonnet 4.6Claude Opus 4.5Claude Sonnet 4.5Claude Haiku 4.5Claude Opus 4.1Claude 4Claude 3.7 SonnetClaude 3.5 Sonnet v2 and Claude 3.5 HaikuClaude 3.5 SonnetClaude 3Every Claude Model
speedy_devvkoen_salo
Blog/Model Picker/Claude 3.7 Sonnet

Claude 3.7 Sonnet

Claude 3.7 Sonnet shipped February 2025 with hybrid reasoning and extended thinking. 64K output, thinking-budget control, SWE-bench coding gains at $3/$15.

Stop configuring. Start building.

SaaS builder templates with AI orchestration.

Published Feb 21, 2026Model Picker hub

Claude 3.7 Sonnet was the model that taught Claude to think before it spoke. Released February 25, 2025, it brought hybrid reasoning: a mode where Claude could work through a problem internally, step by step, and then deliver a more accurate answer. This was the last Claude 3.x model, and it laid the groundwork for everything that came in Claude 4.

Key Specs

SpecDetails
API IDclaude-3-7-sonnet-20250225
Context window200K tokens
Input pricing$3 / 1M tokens
Output pricing$15 / 1M tokens
Thinking token pricingIncluded in output pricing
Max output tokens64,000 (with extended thinking)
Release dateFebruary 25, 2025

Extended Thinking

The defining feature. When you turn it on, Claude runs an internal reasoning loop before it writes a single output token. The model spends a thinking budget to work the problem, then delivers the answer. For math proofs, multi-step code logic, scientific work, and planning tasks, this produced a lot better results.

API users got fine-grained control of the budget. Set it low for quick questions. Set it high for hard problems. Thinking tokens counted toward output pricing, but on the hard tasks the quality jump was worth the cost.

Hybrid Reasoning

One conversation, two modes. Quick answers for simple questions. Slow, step-by-step reasoning for the hard ones. You did not have to pick between a "thinking model" and a "fast model". The same model handled both and switched based on the task.

State-of-the-Art Agentic Coding

Claude 3.7 Sonnet set new highs on SWE-bench Verified, the benchmark that tests real GitHub issues (not synthetic problems). It could read a bug report, work through the codebase, find the root cause, and ship a working fix more reliably than any Claude before it.

Instruction-Following and Multimodal

Building on the 3.5 Sonnet v2 gains, 3.7 Sonnet got better at following long, multi-constraint instructions. Images, charts, and mixed media inputs all came back with higher accuracy too.

How Extended Thinking Worked in Practice

The pattern was simple:

  1. Send a complex prompt (code review, math proof, architectural call)
  2. Claude spends its thinking budget to reason it through internally
  3. The answer comes back with higher accuracy and fewer logical mistakes

The biggest wins landed on math, science, and multi-file code changes. Tasks that used to need several back-and-forth corrections often came back right on the first try.

For a deeper look at getting the most out of it, see the deep thinking techniques guide.

Pricing and Output Size

Same $3/$15 per million tokens. Same 200K context window. But meaningfully better reasoning, coding, and instruction-following. The max output token limit jumped to 64,000 when extended thinking was on (up from 8,192), which made it real for generating long code, docs, or analysis in a single response.

The most important difference was qualitative: Claude 3.7 Sonnet made fewer reasoning mistakes on hard tasks. Extended thinking gave it a way to "show its work" internally, catching errors before they reached the output.

Status

ModelStatus
Claude 3.7 SonnetSuperseded by Claude 4 generation

Claude 3.7 Sonnet was the bridge between 3.x and 4.x. The hybrid reasoning and extended thinking ideas it pioneered became standard in Claude 4 and every model that followed.

Related Pages

  • All Claude Models for the full model index
  • Claude 3.5 Sonnet v2, the October 2024 predecessor
  • Claude 4, the next generation
  • Deep thinking techniques for getting the most out of extended thinking
  • Model selection strategies for choosing between Claude models

More in Model Picker

  • Claude Mythos: The Model That Thinks in Loops
    Claude Mythos is suspected to use recurrent-depth architecture: one shared layer looped N times, with ACT halting so hard questions get more passes and easy ones stop early.
  • Claude Opus 4.7 vs Other AI Models
    Claude Opus 4.7, GPT-5.4, Kimi K2.6, Gemini 3.1 Pro, DeepSeek V3.2: benchmarks, context windows, agent reliability, and cost, so you reach for the right one.
  • DeepSeek V4: Pricing, Context, and Migration
    DeepSeek V4 ships two models: V4-Flash at $0.28/M output and V4-Pro at $3.48/M. Both carry a genuine 1M context window and drop into any Anthropic-compatible SDK with one line changed.
  • Every Claude Model
    Every Claude model on one page: Claude 3, 3.5, 3.7, 4, Opus 4.1 to 4.6, Sonnet 4.5 and 4.6, Haiku 4.5. Specs, pricing, benchmarks, and when to use each.
  • Claude 3.5 Sonnet v2 and Claude 3.5 Haiku
    Claude 3.5 Sonnet v2 and 3.5 Haiku launched October 2024 with Computer Use beta, cursor control, upgraded coding and tool use, and cheaper Haiku at $0.80/$4.
  • Claude 3.5 Sonnet
    Claude 3.5 Sonnet launched June 2024 at $3/$15, beating Claude 3 Opus on MMLU, GPQA, HumanEval at a fifth of the cost. Specs, benchmarks, and code gains.

Stop configuring. Start building.

SaaS builder templates with AI orchestration.

On this page

Key Specs
Extended Thinking
Hybrid Reasoning
State-of-the-Art Agentic Coding
Instruction-Following and Multimodal
How Extended Thinking Worked in Practice
Pricing and Output Size
Status
Related Pages

Stop configuring. Start building.

SaaS builder templates with AI orchestration.