sd0x-dev-flow

The harness layer for Claude Code.

Quality gates that AI can't skip. A reference implementation of AI Agent Harness Engineering for Claude Code — hook-enforced dual review, state-machine gates that survive context compaction, and fail-closed safety where it counts.

96 skills · 15 agents — ~4% of Claude's context window

What This Harness Does

Harness engineering is the discipline of engineering everything around the LLM — tool loops, context management, hooks, state machines, safety layers — as opposed to training the model itself. Mitchell Hashimoto coined the term in Feb 2026; Anthropic engineering and Martin Fowler have published on it; arXiv 2603.05344 formalizes it.

sd0x-dev-flow is a reference implementation. Each row below maps a canonical harness sub-problem to concrete code you can study:

| # | Harness sub-problem | sd0x-dev-flow implementation | Code evidence | |---|---------------------|------------------------------|---------------| | 1 | | → auto-loop with sentinel-driven transitions | + | | 2 | | / / gate markers parsed into durable state | (producer) + (parser) | | 3 | | stdout injection after SessionStart(compact) | | | 4 | | 5 hook event types dispatched to 8 scripts: PreToolUse / PostToolUse / Stop / SessionStart / UserPromptSubmit | (8 scripts) + | | 5 | | Skill frontmatter — e.g., has no Edit/Write | 86 of 95 public skills declare | | 6 | | 5 layers: pre-edit-guard → commit-msg-guard → pre-push-gate → stop-guard → sidecar fail-closed marker | + + | | 7 | | Dual review: Codex (primary) + Claude (secondary) dispatched in parallel on every review cycle | + (Dual Review Mode) | | 8 | | + + convergence plateau detection | (exit conditions + strategic reset) | | 9 | | confirmation + for destructive ops | + | | 10 | | Correction → record lesson → promote to rule after 3+ recurrences | |

sd0x-dev-flow

sd0x-dev-flow

What This Harness Does

Related Skills

Why sd0x-dev-flow?

Quick Start

How It Works

Feature Spotlight: Dual-Reviewer Architecture

How We Compare

When to Use

Install

Codex CLI / Other AI Agents

Workflow Tracks