Build This Now
Build This Now
Echte BuildsState of Claude Code 2026: What 2,500 Public Repos RevealBauen ist nicht mehr der FlaschenhalsDistribution ist der neue BurggrabenWarum QA das eigentliche Nadelöhr in der KI-Entwicklung istFirst Principles im Zeitalter der 24-Stunden-MVPsDie Autonomie-Kurve: Wie viel Freiheit darfst du einem KI-Agenten geben?Von der Idee zum SaaSGAN LoopSelf-Evolving HooksTrace to SkillDistribution AgentsKI-Sicherheits-AgentsAutonomer KI-SchwarmKI-E-Mail-SequenzenKI räumt sich selbst aufAgent Swarm OrchestrationEine komplette App mit Claude Code bauen: Echte BeispieleClaude Code für Nicht-Entwickler: Echte BeispieleClaude Code for Freelancers: Ship 3x FasterA Security Update from Build This NowThe AI Agent That Deleted a Production Database in 9 SecondsHow to Build Your Own Claude Code Harness (or Buy One)Run Claude Code on a Cheaper Model: DeepSeek and GLM Cost ArbitrageIs Claude Code Just a Thin Wrapper? Inside the Harness DebateHow Much Does It Really Cost to Build a SaaS with Claude Code?How to Cut Your Claude Code Token Bill in HalfDo I Still Need a Boilerplate If I Use Claude Code?Harness vs Boilerplate vs Framework: The Build-System Stack ExplainedHow Long Does Idea to Production Actually Take with Claude Code?Is Vibe Coding Safe? What the Lovable and Moltbook Breaches TeachOwn Your Vercel Analytics: I Built a Drain-to-Postgres PipelineSpec-Driven Development Explained: Why Pros Stopped Vibe CodingState of Vibe-Coded SaaS Security (2026 Data)From Vibe Coding to Production: The Checklist That Stops Data LeaksVibe Coding vs Vibe Engineering vs Agentic Engineering: The 2026 GlossaryWhat Is an Agent Harness? Why the Harness, Not the Model, Is the 2026 Moat
speedy_devvkoen_salo
Blog/Real Builds/How to Build Your Own Claude Code Harness (or Buy One)

How to Build Your Own Claude Code Harness (or Buy One)

How to build a Claude Code harness from scratch, what it costs in time, and when buying a pre-built kit ships faster.

Hören Sie auf zu konfigurieren. Fangen Sie an zu bauen.

SaaS-Builder-Vorlagen mit KI-Orchestrierung.

Published Jun 26, 20268 min readReal Builds hub

To build a Claude Code harness, you create a .claude/ folder in your project that holds your rules, skills, hooks, and settings so Claude follows your workflow every session without being re-told. Building one that you are happy with takes most developers 3 to 6 months of tuning, so the honest choice is: build it yourself if your stack is unusual or you want total control, or buy a pre-built kit (priced $0 to $149) if you want to start shipping today.


Hören Sie auf zu konfigurieren. Fangen Sie an zu bauen.

SaaS-Builder-Vorlagen mit KI-Orchestrierung.


What a Claude Code harness actually is

A "harness" is just the setup that tells Claude Code how to work on your project. Without one, you re-explain your rules every chat. With one, Claude reads your instructions at the start of every session and follows them.

Anthropic describes a harness as five layers. Here is what each one is and the file it lives in:

  1. Memory. Your standing rules and project facts. Lives in CLAUDE.md, a plain Markdown file Claude reads automatically at session start.
  2. Tools. Extra abilities you give Claude, like a reusable skill or an MCP server (a small program that connects Claude to an outside system). Skills live in a skills/ folder.
  3. Permissions. What Claude is allowed to do without asking. Lives in settings.json.
  4. Hooks. Small scripts that run automatically before or after Claude does something. Hook commands are wired in settings.json and point at your own scripts.
  5. Observability. A record of what happened, so you can debug. Lives in your session logs.

Get those five right and Claude behaves like a teammate who already knows your codebase.

Why "more rules" is the wrong instinct

The tempting move is to dump every rule into CLAUDE.md. That backfires. Developers report that rule-following drops to around 30% once CLAUDE.md grows past roughly 4,000 tokens (about 3,000 words). The model starts skimming. So the fix is not a longer file. The fix is better architecture: short memory, real skills for repeated tasks, and hooks for anything that must never be skipped.

Hooks are the only rule Claude cannot ignore

This is the part most people miss. A line in CLAUDE.md is a suggestion. The model can reason its way around it. A hook is law.

A PreToolUse hook runs right before Claude uses a tool. If that hook exits with code 2, the tool call is blocked. No exceptions, no negotiation. That makes hooks the correct home for quality gates ("never commit without passing tests") and guardrails ("never run a delete command on production"). If a rule truly cannot be broken, it belongs in a hook, not in your Markdown.

The third path: let Claude write its own harness

In 2026 Anthropic added dynamic workflows. For a one-off complex task, Claude can write its own short JavaScript orchestration on the fly instead of relying on your static files. It picks from six patterns: classify-and-act, fan-out-and-synthesize, adversarial verification, generate-and-filter, tournament, and loop-until-done.

What this changes: your static .claude/ directory no longer has to encode every orchestration trick. It is now for the things that persist, like your team conventions and your stack opinions. The throwaway logic for a single hard task, Claude can generate itself.

DIY vs. Pre-built vs. Dynamic Workflow

QuestionBuild from scratchFree open-source starter (MIT, npx)Opinionated paid kit ($29 to $99)Claude dynamic workflows
Time to first working setupWeeksMinutesMinutesSeconds (per task)
Upfront cost$0 (your time)$0$29 to $99$0 (token cost only)
Ongoing maintenanceHigh (you own it)MediumLow (kit ships updates)None
Works for any stackYesYesNo (stack-specific)Yes
Enforces persistent team conventionsYesYesYesNo
Best for one-off complex tasksNoNoNoYes
Skills/rules out of the box0~25 skills, 27 commands, 10 hooksSkills + specialist agentsGenerated per task
Recommended forUnusual stacks, full controlFirst-timers with no setupStandard SaaS stacks shipping fastSingle hard tasks

The buy side, honestly

Three real options, with the tradeoffs stated plainly:

  • Free MIT starter kit (npx). A community starter with about 27 commands, 10 hooks, and 25 skills. Costs nothing and works on any stack. You still tune it yourself.
  • ClaudeKit.cc Engineer Kit, reported at $99. A paid, broad kit for general engineering work.
  • Build This Now Code Kit, the $29 Code Kit. Skills and specialist agents pre-wired for a specific stack: Next.js, Supabase with row-level security on every table, and Stripe payments. It ships fast because it is opinionated. That is also the catch: opinionated kits constrain your stack choice. If you want that exact stack, you save months. If you want something else, you fight the kit.

The rule of thumb: an opinionated kit trades flexibility for speed. On a standard SaaS stack, that trade is usually worth it.

When to spend tokens on a generator-evaluator setup

For high-value work, the best architecture is two separate roles: one Claude agent generates the work, a second Claude agent grades it. This generator-evaluator split beats a single agent critiquing itself, but it costs more tokens.

Anthropic's own retro-game example shows the gap: a solo agent finished in about 20 minutes for roughly $9, while a full multi-agent harness ran about 6 hours for roughly $200. Same goal, very different bill. So save the heavy generator-evaluator setup for tasks where a better result is worth real money. For everyday work, a single agent is fine.

A decision framework

  1. No .claude directory yet? Install the free starter kit today. Zero cost, instant baseline.
  2. Building on a standard SaaS stack and want 48-hour shipping? A $29 to $99 opinionated kit pays for itself in the first session. You skip the skills and hooks you would otherwise spend months writing.
  3. Unusual codebase, or you need control over every rule? Budget about 3 months to build and iterate your own. Keep CLAUDE.md short, put hard rules in hooks, and add skills only for tasks you repeat.
  4. One-off complex task? Skip the static harness and let dynamic workflows generate the orchestration for that single job.

FAQ

How do I make Claude Code follow my rules every session?

Put your rules in a CLAUDE.md file inside a .claude/ directory at your project root. Claude reads it automatically at the start of every session. For rules that must never be bypassed, use a PreToolUse hook that exits with code 2 to block the tool call no matter what.

What is a Claude Code harness?

A Claude Code harness is the .claude/ directory structure (your CLAUDE.md rules, skills, hooks, and settings.json) that enforces your workflow, tech conventions, and quality gates every time Claude Code runs. Anthropic defines it as five layers: Memory, Tools, Permissions, Hooks, and Observability.

How long does it take to build a Claude Code harness?

Developers report that a setup they are happy with takes 3 to 6 months of iteration. A basic CLAUDE.md takes an hour, but tuning rule compliance, wiring hooks, and building skills that hold up across sessions takes sustained effort. A pre-built kit cuts this to a day or less.

Should I build my own Claude Code harness or buy one?

Build it if your stack is non-standard or you want full control. Buy one ($0 to $149) if you are on a common stack and want to ship fast, since pre-built kits include skills and hooks you would otherwise spend months writing. For one-off complex tasks, use Claude Code's dynamic workflows and let Claude generate the orchestration itself.

More in Real Builds

  • KI räumt sich selbst auf
    Drei overnight Claude Code-Workflows, die das Chaos der KI selbst bereinigen: slop-cleaner entfernt toten Code, /heal repariert kaputte Branches, /drift erkennt Pattern-Drift.
  • Agent Swarm Orchestration
    Four infrastructure layers that stop agent swarms from double-claiming tasks, drifting on field names, and collapsing under merge chaos.
  • GAN Loop
    Ein Agent generiert, einer reißt ihn auseinander, sie loopen bis der Score nicht mehr steigt. GAN Loop Implementierung mit Agent-Definitionen und Rubrik-Templates.
  • Die Autonomie-Kurve: Wie viel Freiheit darfst du einem KI-Agenten geben?
    Wie viel Autonomie du einem KI-Agenten geben kannst, hängt an einer einzigen Sache: wie lange ein Modell eine Aufgabe hält, ohne abzudriften. Ein gutes Gerüst plus ein zuverlässiges Modell macht echte Agentenarbeit erst möglich.
  • The AI Agent That Deleted a Production Database in 9 Seconds
    An AI deleted PocketOS's production database and all backups in 9 seconds. Here is why it happened and the guardrails that prevent it.
  • KI-E-Mail-Sequenzen
    Ein Claude Code-Befehl erstellt 17 Lifecycle-E-Mails über 6 Sequenzen, verkabelt Inngest-Verhaltenstrigger und liefert einen verzweigten E-Mail-Funnel bereit zum Deployment.

Hören Sie auf zu konfigurieren. Fangen Sie an zu bauen.

SaaS-Builder-Vorlagen mit KI-Orchestrierung.

The AI Agent That Deleted a Production Database in 9 Seconds

An AI deleted PocketOS's production database and all backups in 9 seconds. Here is why it happened and the guardrails that prevent it.

Run Claude Code on a Cheaper Model: DeepSeek and GLM Cost Arbitrage

Point Claude Code at DeepSeek or GLM to cut your bill 7 to 17x. Setup, what breaks, and the July 2026 model-name change explained.

On this page

What a Claude Code harness actually is
Why "more rules" is the wrong instinct
Hooks are the only rule Claude cannot ignore
The third path: let Claude write its own harness
DIY vs. Pre-built vs. Dynamic Workflow
The buy side, honestly
When to spend tokens on a generator-evaluator setup
A decision framework
FAQ
How do I make Claude Code follow my rules every session?
What is a Claude Code harness?
How long does it take to build a Claude Code harness?
Should I build my own Claude Code harness or buy one?

Hören Sie auf zu konfigurieren. Fangen Sie an zu bauen.

SaaS-Builder-Vorlagen mit KI-Orchestrierung.