Skip to content
Back to all posts
January 14, 2026·10 min read

The "Creator’s Stack": The Definitive Guide to How Boris (Anthropic) Actually Uses Claude Code

A deep dive into the "vanilla" but highly disciplined setup used by the creator of Claude Code. Learn why verification loops and strict context management—not dangerous permissions—are the keys to scaling AI development in production.

LS
Lakshya Soni
The "Creator’s Stack": The Definitive Guide to How Boris (Anthropic) Actually Uses Claude Code

When a tool like Claude Code is released, the developer community immediately floods the internet with "hacks," complex agent chains, and experimental shortcuts. Everyone is looking for the "God Mode" prompt that automates their entire job.

But recently, we received something far more valuable than a hack: a detailed breakdown of the actual daily workflow of Boris, the creator of Claude Code at Anthropic.

His setup is not flashy. It isn't filled with dangerous permissions or chaotic agent swarms. It is surprisingly "vanilla." But in the world of high-stakes software engineering, "vanilla" is code for robust, scalable, and safe.

At EchoPulse, we analyzed his workflow and found that what looks simple on the surface is actually a highly disciplined system of constraints, verification loops, and context management. If you are trying to integrate Claude Code into a serious production environment, stop looking for magic tricks and start adopting the "Creator’s Stack."

Here is the definitive, deep-dive breakdown of how the tool is meant to be used at the highest level.

The Core Philosophy: Verification-Led Development

The single biggest mistake developers make with Claude Code is treating it like a magic wand. They give it a vague, high-level task—"Build me a snake game" or "Refactor this API"—and then get frustrated when the snake runs into a wall or the API throws a 500 error.

Boris explains this failure mode with a simple but profound analogy: Humans don't work that way, so why should AI?

If you asked a junior developer to ship a complex feature without testing it, they would fail too. The core of Boris’s workflow is Verification-Led Development. This goes beyond simply "checking the work" after it is done. It involves architecting a loop where Claude cannot proceed without proving it is right.

The "Test-First" Protocol If you are asking Claude to write code (e.g., Python or TypeScript), you don't just ask for the function. You explicitly instruct it to write the tests for that function first.

Automated Self-Verification Boris emphasizes that you don't even need to be the one coming up with the tests. You can automate this requirement in your configuration.

Domain-Specific Verification Verification isn't just for code. Boris highlights that verification changes based on the domain:

The Context Engine (The claude.md Strategy)

The Project Brain: Mastering the claude.md File

If Verification is the engine of this workflow, the claude.md file is the steering wheel.

Most users treat this file as a generic "ReadMe," throwing in a few links and forgetting about it. For the Anthropic team, this file is a living, breathing document that defines the intelligence of the session. It is the single most important asset in your repository.

Operational Modes & The Speed Paradox

Strategic Modes: Why Experts Don't Use "God Mode"

There is a feature in Claude Code called "dangerously skip permissions." To the uninitiated, this sounds like the ultimate efficiency hack. It allows the AI to execute any terminal command—file deletion, git pushes, deployment triggers—without asking for human confirmation. It sounds fast. It feels frictionless.

Boris does not use it.

Why would the creator of the tool refuse to use its fastest feature? Because in a production environment, speed is secondary to safety. A single hallucinated rm -rf command, a misguided database migration, or an accidental push to main can cost a company millions in downtime or data loss.

Instead of "God Mode," Boris utilizes a granular permission structure via settings.json.

The Speed Paradox: Why Slower Models Win

In the race for AI efficiency, everyone is obsessed with latency. "Haiku" is instant. "Sonnet" is quick. Yet, Boris uses Opus 4.5 with Thinking Enabled for almost everything.

To the average user, this makes no sense. Opus 4.5 is slower. It takes time to "think" (Chain of Thought). It costs more tokens. But Boris argues that we are measuring speed incorrectly. We shouldn't measure "Tokens Per Second"; we should measure "Time To Completion."

For complex orchestration, deep reasoning, and architectural decisions, raw speed is a trap.First-Time Accuracy is the only metric that matters.

What Belongs in claude.md? Boris describes this file not as documentation, but as a set of boundaries. To be effective, it must contain:

  1. The Tech Stack: Explicit versions (e.g., "We use Next.js 14, not 15") and libraries.
  2. The Directory Structure: A map of where key components live, so the agent doesn't get lost.
  3. The "Anti-Patterns": This is the most critical part. It shouldn't just list what to do; it must list what not to do.

The "Living Document" Habit Boris has operationalized this with his team. It is a habit at Anthropic to update the claude.md file multiple times a week.

Context Sharding for Scale Furthermore, Boris advocates for Context Sharding. In a massive full-stack application, you shouldn't have one massive prompt file that confuses the model.

Advanced Orchestration & The "Junior Dev" Mental Model

Orchestration at Scale: The Parallel Workflow

Boris doesn't just stare at one loading bar waiting for a response. He runs the tool like a Mission Control Center. He typically runs five Claude Code sessions in parallel.

Web Sessions: The "Fire and Forget" Strategy He heavily utilizes Web Sessions for asynchronous workflows.

Tool Integration: The Agent as Orchestrator Boris proves that Claude Code isn't just for writing code; it's for controlling your entire dev stack. He uses the agent to interact with Slack, BigQuery, and Sentry directly via their CLIs. Instead of context-switching between four different dashboards to debug an error, he asks Claude:"Check Sentry for the latest error logs, query BigQuery for the affected user IDs, and post a summary to the Slack engineering channel." The agent performs the orchestration; Boris makes the decision.

Conclusion: The "Junior Developer" Mental Model

The ultimate takeaway from Boris’s workflow is a fundamental shift in perspective. He does not treat Claude Code as a text generator or a search engine. He treats it as a Junior Developer.

Think about how you successfully manage a Junior Developer:

  1. You give them a Plan: You don't just say "build this." You write a spec (Plan Mode).
  2. You give them Rules: You give them an onboarding doc that explains how things are done (claude.md).
  3. You don't give them Root Access: You limit their permissions until they earn trust (settings.json).
  4. You ask for Tests: You refuse to merge their Pull Request until they prove it works (Verification Loops).

This is exactly how EchoPulse approaches AI architecture. We don't look for the magic button. We build the systems, the constraints, and the workflows that allow AI to function as a reliable, high-velocity part of your engineering team. The tool is powerful, but only if you respect the protocol.