How Garry Tan (YC CEO) Uses Claude Code: Inside the 23-Tool gstack Setup
The 23 slash commands, six roles plus a CSO, and one install line behind Garry Tan's Claude Code setup. Plus the criticism.
Pare de configurar. Comece a construir.
Templates SaaS com orquestração de IA.
Problem: You watched Garry Tan's launch tweet, skimmed the TechCrunch piece, maybe caught the SXSW clip, and now you want to know what is actually in gstack. The repo has 97k stars. The README is dense. Half of X is calling it god mode. The other half is calling it a folder of prompts. Neither side is showing you the table.
Quick Win: Clone gstack into your Claude Code skills folder with one line and you get 23 slash commands grouped under a Think to Ship loop:
git clone --single-branch --depth 1 https://github.com/garrytan/gstack.git \
~/.claude/skills/gstack && cd ~/.claude/skills/gstack && ./setupThis post lists every command, names every role, walks the workflow, handles the criticism honestly, and tells you where gstack stops and a production skeleton starts.
What gstack Is
gstack is an open-source pack of opinionated Claude Code skills published by Garry Tan, the President and CEO of Y Combinator. He shipped it on March 12, 2026 under MIT. Every slash command embodies a specialist persona with its own priorities, constraints, and outputs. The work moves through a fixed loop: Think, Plan, Build, Review, Test, Ship, Reflect.
Repo: github.com/garrytan/gstack. As of 2026-05-15, the README ships 23 core skills (the count grew from 6 at launch, to 13 by the TechCrunch coverage on March 17, to 23 now).
Stats today: 97.4k stars, 14.5k forks, 597 watchers, plus 915 stars in the last 24 hours alone. (source)
Why It Matters
YC has funded 4,000+ companies. The CEO running that engine is publishing the exact skill setup he uses to ship code. That is the rarest possible signal for a Claude Code workflow. You can argue with the design. You cannot argue with the source.
Garry stated his own throughput on the launch post:
"I just open-sourced my entire Claude Code setup I used to average 10K LOC and 100 PRs per week in the last 50 days." (LinkedIn, 2026-03-12)
Self-reported numbers. Frame them that way. The shape of the workflow is the part you can copy.
The Role-Based Architecture
gstack groups every skill under a job title. A role owns priorities, constraints, and outputs. Free-form prompting asks Claude to wear many hats inside one message. Roles split the hats and pass work between them.
| Role | What it owns |
|---|---|
| CEO / Founder | Scope, product framing, what to build |
| Designer | Visual system, mockups, AI slop detection |
| Eng Manager | Architecture, data flow, edge cases |
| Release Manager | Sync, test, push, PR, deploy |
| Doc Engineer | Docs in sync with shipped code |
| QA Lead | Real browser testing, bug fixes |
| Chief Security Officer | OWASP Top 10 + STRIDE threat modeling |
The loop reads as one sentence: Think (Office Hours plus CEO review), Plan (Eng plus Design plus DevEx review), Build, Review, Test (QA plus CSO), Ship, Reflect (retro). Source: gstack README.
The 23 Tools
The core inventory, sourced verbatim from the gstack README and docs/skills.md, accessed 2026-05-15.
| # | Slash Command | Role | What it does |
|---|---|---|---|
| 1 | /office-hours | YC Office Hours | Six forcing questions that reframe the product before any code |
| 2 | /plan-ceo-review | CEO / Founder | Scope review across four modes: expansion, selective, hold, reduction |
| 3 | /plan-eng-review | Eng Manager | Architecture, data flow, diagrams, edge cases, test matrix |
| 4 | /plan-design-review | Senior Designer | 0 to 10 ratings per design dimension; flags AI slop |
| 5 | /plan-devex-review | DX Lead | Interactive developer-experience audit, three modes |
| 6 | /design-consultation | Design Partner | End-to-end design system: research plus mockups |
| 7 | /design-shotgun | Design Explorer | 4 to 6 AI mockup variants with taste-memory learning |
| 8 | /design-html | Design Engineer | Mockup converted to production HTML, ~30KB, zero deps |
| 9 | /review | Staff Engineer | Production bug detection with auto-fixes and coverage audit |
| 10 | /investigate | Debugger | Systematic root-cause analysis, traces data flow, three-fix limit |
| 11 | /design-review | Designer Who Codes | Post-ship design audit and atomic-commit auto-fixes |
| 12 | /devex-review | DX Tester | Live onboarding audit with timing and error screenshots |
| 13 | /qa | QA Lead | Browser testing, fixes bugs, generates regression tests |
| 14 | /qa-only | QA Reporter | Bug reports only, no code edits |
| 15 | /cso | Chief Security Officer | OWASP Top 10 + STRIDE; 17 false-positive exclusions |
| 16 | /ship | Release Engineer | Sync, test, audit, push, open PR; bootstraps frameworks |
| 17 | /land-and-deploy | Release Engineer | Merge, run CI, deploy, verify production health |
| 18 | /canary | SRE | Post-deploy monitoring: console errors, perf, failures |
| 19 | /benchmark | Performance Engineer | Core Web Vitals, resource sizes, before-and-after diffs |
| 20 | /document-release | Technical Writer | Auto-updates docs, Diataxis coverage map |
| 21 | /document-generate | Doc Author | Generates missing reference, how-to, and tutorial docs |
| 22 | /retro | Eng Manager | Weekly retro, per-person breakdowns, streak tracking |
| 23 | /browse | QA Engineer | Real Chromium browser, ~100ms per command |
That is the 23-tool count locked to 2026-05-15. The README is moving fast. Pin the commit you cloned if you want a stable reference.
A Real Workflow End to End
A clean session moves through six commands in this order: think, plan, build, review, test, ship.
/office-hours: Six forcing questions reframe the feature. You answer in plain English. Output is a sharper scope./plan-ceo-review: A CEO persona checks scope across four modes (expand, selective, hold, reduce). Often kills work before it starts./plan-eng-review: An Eng Manager persona drafts architecture, data flow, edge cases, and a test matrix.- Implement the work as Claude normally would, against the plan.
/review: A Staff Engineer persona scans for production bugs and applies atomic auto-fixes. Coverage audit runs in the same pass./qa: A QA Lead persona opens a real Chromium browser, runs the flow, fixes bugs it finds, and writes regression tests./ship: A Release Engineer persona syncs, tests, audits, pushes, and opens a PR in one chain.
Each command keeps Claude in one role for the duration of that step. That is the design choice the rest of the system is built on.
The Power Tools Most Posts Skip
Beyond the 23 there is a second tier of utility commands: /codex, /careful, /freeze, /guard, /unfreeze, /open-gstack-browser, /setup-deploy, /gstack-upgrade, /setup-browser-cookies, /setup-gbrain, /sync-gbrain, /autoplan, /pair-agent, /context-restore, /learn. Two CLI binaries also ship with the repo: gstack-model-benchmark and gstack-taste-update.
The big one for teams is /pair-agent. It coordinates Claude Code, Codex, and Hermes against the same task. Garry sits on top of this layer when he runs many sessions at once.
"I regularly run 10-15 parallel sprints. That's the practical max right now." (gstack README)
What Garry Says, in His Own Words
Three quotes worth keeping near the workflow because they shape how to use it.
On the philosophy: "A single builder with the right tooling can move faster than a traditional team." (gstack README)
On parallelism: "gstack is powerful with one sprint. It is transformative with ten running at once." (gstack README)
On the cost: "I sleep, like, four hours a night right now. I have cyber psychosis." (SXSW with Bill Gurley, March 2026)
That last one matters. Throughput at this rate is not the average user's normal day. Treat the headline numbers as a ceiling, not a floor.
The Reception Was Split
The launch trended on Product Hunt and pulled 33k stars in week one (now 97k+). Garry's tweet hit 849k views. A CTO friend called it god mode. (TechCrunch)
The pushback was loud too. Mo Bitar shipped a critique calling gstack "a bunch of prompts in a text file." Sherveen Mashayekhi argued the visibility came from Tan's YC role, not from the artifact's merit. The Hacker News thread has the full debate.
The steelman of the critique is fair. gstack does not run novel infrastructure. Each skill is markdown plus a system prompt. You could write the same files yourself in a weekend.
The counter is also fair. Most people will not write the same files in a weekend. They will write three, ship none, and revert to free-form prompting. A finished, opinionated set, vetted by someone shipping 10K lines a week, is a different artifact than your own half-written prompts.
Both readings are correct. Pick the one your team needs to hear.
Five Lessons for Founders
- Roles beat prompts. Defining a persona with priorities and constraints produces more consistent output than asking one Claude to wear seven hats.
- Forcing functions before code.
/office-hoursexists because reframing the product is cheaper than rewriting it. - AI slop is a real failure mode. A dedicated Senior Designer skill (
/plan-design-review) exists specifically to catch it. - Ship is a workflow, not a button.
/shipand/land-and-deploychain sync, test, audit, push, deploy, verify into one command. - Parallelism multiplies leverage. Garry runs 10 to 15 sprints at once. Solo builders get a structured team. Team leads get a fleet.
gstack vs Default Claude Code
| Default Claude Code | gstack |
|---|---|
| Free-form prompting | Slash commands with defined roles |
| Whatever code style emerges | Architecture and design enforced by /plan-eng-review and /plan-design-review |
| Manual testing | /qa opens a real browser, fixes bugs, writes regression tests |
| Manual ship steps | /ship runs sync, test, audit, push, PR as one chain |
| Memory resets each session | /learn plus GBrain carries patterns across sessions |
| Single agent | /pair-agent coordinates Claude Code, Codex, Hermes |
Clone gstack in 5 Minutes
You need Claude Code, Git, Bun v1.0+, and Node.js if you are on Windows. A Claude Pro subscription ($20/mo) is required for Claude Code itself.
Run the canonical install:
git clone --single-branch --depth 1 https://github.com/garrytan/gstack.git \
~/.claude/skills/gstack && cd ~/.claude/skills/gstack && ./setupThen follow the loop. Start with /office-hours to reframe the product. Move to /plan-ceo-review and /plan-eng-review. Build. Run /review. Run /qa. Ship with /ship. That is the loop Garry is using to move at his stated rate.
Where gstack Stops
gstack gives you the workflow team. Twenty-three roles, a Think to Ship loop, parallel sprints, GBrain memory. It does not give you a SaaS to ship. It assumes you already have a product in front of it.
That gap is where Build This Now fits. Build This Now is the production skeleton the workflow operates on: auth (Google OAuth, email plus OTP, JWT, RLS), payments (Stripe checkout, subscriptions, webhooks, customer portal), database (PostgreSQL via Supabase with RLS on every table), email, file storage, background jobs, design system, landing page, legal pages, edge functions.
Adopt gstack on a blank repo and you spend the first three weeks rebuilding auth and payments before any of the 23 commands does real product work. Start with Build This Now and the skeleton is already there. gstack runs on top of it from day one. The workflow team plus the codebase the team builds on, $79 one-time on the CodeKit tier.
FAQ
What is gstack? An open-source collection of opinionated Claude Code skills built by Garry Tan that turns Claude Code into a virtual engineering team with seven specialist roles.
How many tools does gstack ship? 23 core slash commands as of 2026-05-15, plus around 15 supporting power tools and 2 CLI binaries.
What license? MIT.
Who created it? Garry Tan, President and CEO of Y Combinator. Open-sourced March 12, 2026.
How do I install it? git clone --single-branch --depth 1 https://github.com/garrytan/gstack.git ~/.claude/skills/gstack && cd ~/.claude/skills/gstack && ./setup
Does it require Claude Pro? Yes. Claude Code requires a Claude Pro subscription ($20/mo) or Team plan.
What is the role-based architecture? Each slash command embodies a specialist persona (CEO, Designer, Eng Manager, Release Manager, Doc Engineer, QA Lead, Chief Security Officer) with its own priorities, constraints, and outputs. Work moves through Think to Plan to Build to Review to Test to Ship to Reflect.
The CEO of YC published his exact Claude Code stack. The 23 commands are listed. The criticism is real. Pick the parts that fit your team and leave the rest.
Posted by @speedy_devv
Pare de configurar. Comece a construir.
Templates SaaS com orquestração de IA.
Padrões de Agentes
Orchestrator, fan-out, cadeia de validação, routing especializado, refinamento progressivo e watchdog. Seis formas de orquestração para ligar sub-agentes no Claude Code.
Agentes com Comportamento Humano
Padrões de personalidade para agentes do Claude Code: raciocinar em voz alta, admitir incerteza, avaliar trade-offs, fazer perguntas de acompanhamento. Um bloco para o CLAUDE.md que podes colar.