Claude Code is powerful. Pilot makes it reliable.

Start a task, grab a coffee, come back to production-grade code. Tests enforced. Context preserved. Quality automated.

⭐ Star this repo · 🌐 Website · 🔔 Follow for updates · 📋 Changelog · 📄 License

curl -fsSL https://raw.githubusercontent.com/maxritter/claude-pilot/main/install.sh | bash

Works on macOS, Linux, and Windows (WSL2).

Why I Built This

I'm a senior IT freelancer from Germany. My clients hire me to ship production-quality code — tested, typed, formatted, and reviewed. When something goes into production under my name, quality isn't optional.

Claude Code writes code fast. But without structure, it skips tests, loses context, and produces inconsistent results — especially on complex, established codebases where there are real conventions to follow and real regressions to catch. I tried other frameworks — they burned tokens on bloated prompts without adding real value. Some added process without enforcement. Others were prompt templates that Claude ignored when context got tight. None made Claude reliably produce production-grade code.

So I built Pilot. Instead of adding process on top, it bakes quality into every interaction. Linting, formatting, and type checking run as enforced hooks on every edit. TDD is mandatory, not suggested. Context is monitored and preserved across sessions. Every piece of work goes through verification before it's marked done.

Demo

A full-stack project — created from scratch with a single prompt, then extended with 3 features built in parallel using /spec. Every line of code was planned, implemented, tested, and verified entirely by AI. Zero manual code edits, zero bug fixes by a human.

Watch the full demo on YouTube · Browse the demo repository

Each /spec prompt one-shotted a complete feature — plan, TDD implementation, automated verification, and squash merge — all running simultaneously in isolated git worktrees.

Before & After

| Without Pilot | With Pilot | | --------------------------- | --------------------------------------------------------------- | | Writes code, skips tests | TDD enforced — RED, GREEN, REFACTOR on every feature | | No quality checks | Hooks auto-lint, format, type-check on every file edit | | Context degrades mid-task | Hooks preserve and restore state across compaction cycles | | Every session starts fresh | Persistent memory across sessions via Pilot Console | | Hope it works | Verifier sub-agents perform code review before marking complete | | No codebase knowledge | Production-tested rules loaded into every session | | Generic suggestions | Coding standards activated conditionally by file type | | Changes mixed into branch | Isolated worktrees — review and squash merge when verified | | Manual tool setup | MCP servers + language servers pre-configured and ready | | Requires constant oversight | Start a task, grab a coffee, come back to verified results |

Why This Approach Works

There are other AI coding frameworks out there. I tried them. They add complexity — dozens of agents, elaborate scaffolding, thousands of lines of instruction files — but the output doesn't improve proportionally. More machinery burns more tokens, increases latency, and creates more failure modes. Complexity is not a feature.

Pilot optimizes for output quality, not system complexity. The rules are minimal and focused. There's no big learning curve, no project scaffolding to set up, no state files to manage. You install it in any existing project — no matter how complex — run pilot, then /sync to learn your codebase, and the quality guardrails are just there — hooks, TDD, type checking, formatting — enforced automatically on every edit, in every session.

This isn't a vibe coding tool. It's built for developers who ship to production and need code that actually works. Every rule in the system comes from daily professional use: real bugs caught, real regressions prevented, real sessions where the AI cut corners and the hooks stopped it. The rules are continuously refined based on what measurably improves output.

The result: you can actually walk away. Start a /spec task, approve the plan, then go grab a coffee. When you come back, the work is done — tested, verified, formatted, and ready to ship. Hooks preserve state across compaction cycles, persistent memory carries context between sessions, quality hooks catch every mistake along the way, and verifier agents review the code before marking it complete. No babysitting required.

The system stays fast because it stays simple. Quick mode is direct execution with zero overhead — no sub-agents, no plan files, no directory scaffolding. You describe the task and it gets done. /spec adds structure only when you need it: plan verification, TDD enforcement, independent code review, automated quality checks. Both modes share the same quality hooks. Both modes benefit from persistent memory and hooks that preserve state across compaction.

Getting Started

Prerequisites

Claude Subscription: Solo developers should choose Max 5x for moderate usage or Max 20x for heavy usage. Teams and companies should use Team Premium which provides 6.25x usage per member plus SSO, admin tools, and billing management. Using the API instead may lead to much higher costs.

Installation

Works with any existing project. Pilot doesn't scaffold or restructure your code — it installs alongside your project and adapts to your conventions. cd into your project folder, then run:

curl -fsSL https://raw.githubusercontent.com/maxritter/claude-pilot/main/install.sh | bash

Choose your environment:

Local Installation — Install directly on your system using Homebrew. Works on macOS, Linux, and Windows (WSL2).
Dev Container — Pre-configured, isolated environment with all tools ready. No system conflicts and works on any OS.

After installation, run pilot or ccp in your project folder to start Claude Pilot.

8-step installer with progress tracking, rollback on failure, and idempotent re-runs:

Prerequisites — Checks Homebrew, Node.js, Python 3.12+, uv, git
Dependencies — Installs Vexor, playwright-cli, Claude Code
Shell integration — Auto-configures bash, fish, and zsh with pilot alias
Config & Claude files — Sets up .claude/ plugin, rules, commands, hooks, MCP servers
VS Code extensions — Installs recommended extensions for your stack
Dev Container — Auto-setup with all tools pre-configured
Automated updater — Checks for updates on launch with release notes and one-key upgrade
Cross-platform — macOS, Linux, Windows (WSL2)

Installing a Specific Version

If the current version has issues, you can install a specific stable version (see releases):

export VERSION=6.10.3
curl -fsSL https://raw.githubusercontent.com/maxritter/claude-pilot/main/install.sh | bash

Uninstalling

To completely remove Claude Pilot:

curl -fsSL https://raw.githubusercontent.com/maxritter/claude-pilot/main/uninstall.sh | bash

This removes the Pilot binary, plugin files, managed commands/rules, settings and shell aliases.

How It Works

/sync — Sync Rules & Standards

Run /sync to learn your existing codebase and sync rules with it. Explores your project structure, builds a semantic search index, discovers your conventions and undocumented patterns, updates project documentation, and creates new custom skills. This is how Pilot adapts to your project — not the other way around. Run it once initially, then anytime again:

pilot
> /sync

| Phase | Action | | ----- | --------------------------------------------------------------- | | 0 | Load reference guidelines, output locations, error handling | | 1 | Read existing rules and standards from .claude/ | | 2 | Build Vexor semantic search index (first run may take 5-15 min) | | 3 | Explore codebase with Vexor/Grep to find patterns | | 4 | Compare discovered vs documented patterns | | 5 | Sync/update project.md with tech stack and commands | | 6

claude-pilot

Related Skills