dryforge

Name: dryforge
Author: fn-opt

Pending

Dryforge : New generation of harness engineering - Claude Code & Codex Plugin.

64stars

8forks

Shell

Installation

# Add to your Claude Code skills
git clone https://github.com/fn-opt/dryforge

Getting Started

Guides for using ai agents skills like dryforge.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

README.md

Frequently Asked Questions

What is dryforge?

dryforge is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by fn-opt. Dryforge : New generation of harness engineering - Claude Code & Codex Plugin. It has 64 GitHub stars.

Is dryforge safe to use?

dryforge's catalog security scan is still queued. You can run an instant dependency and prompt-injection check now with the "Scan for vulnerabilities" button above.

How do I install dryforge?

Clone the repository with "git clone https://github.com/fn-opt/dryforge" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is dryforge written in?

dryforge is primarily written in Shell. It is open-source under fn-opt on GitHub, so you can review or fork the full source.

Are there alternatives to dryforge?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh dryforge against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

build-ai-agent-platform hermes-desktop

dryforge

A plugin harness for Claude Code & Codex.

Your agent works like a senior developer.

Zero-ceiling architecture, uncapped reasoning. Exhaustive elicitation, lossless intent. Bounded autonomy, zero-drift execution.

Website

Install — Claude Code

/plugin marketplace add fn-opt/dryforge
/plugin install dryforge

Codex

codex plugin marketplace add fn-opt/dryforge
codex plugin add dryforge@dryforge

Requires git and Claude Code or Codex.

Update — Codex picks up new releases automatically at startup. Claude Code: enable auto-update for the dryforge marketplace in /plugin → Marketplaces (off by default for community marketplaces), or update manually:

# Claude Code
/plugin marketplace update dryforge
/plugin update dryforge@dryforge

# Codex (immediately, without restarting)
codex plugin marketplace upgrade dryforge

Every prompt is underspecified

"Build a booking system" reads like a requirement. It's a goal statement — the requirement-level decisions are all still open: booking-to-service cardinality, the lifecycle of bookings whose service is retired, whether cancellation releases the slot or holds it.

An agent does not stop at those gaps. It resolves each one inline with a plausible default and ships code that compiles, runs, and demos cleanly. One booking = one service is now load-bearing schema with no decision record behind it; the cost surfaces months later as a migration, the day packages become a feature.

Prompts carry intent. Implementations are fixed by decisions. An unsupervised agent makes every decision you never saw — and reports none of them.

The second failure compounds the first: nothing persists. Whatever was decided lives in the session transcript, and the transcript dies with the session. The next session re-derives project state from code alone — and code encodes outcomes, not rationale. The result is structural drift: settled questions re-litigated, invariants re-implemented incompatibly, conventions diverging module by module.

dryforge intervenes at both points: the open decisions are enumerated before code exists, and every resolution is recorded with its rationale at the path every future session reads first.

Three commands

  /ready <INPUT>  ──▶  /go      ──▶  working software + the project harness

  # <INPUT> — an idea, a spec, scattered notes: anything, or nothing
  # short for /dryforge:ready · /dryforge:go · /dryforge:migration


  Already have running code?
  /migration  grafts the harness onto it   (one-time; afterwards /ready → /go)

Command	Consumes	Produces
`/dryforge:ready`	anything — one line to full documents	a reviewed, executable design contract
`/dryforge:go`	the approved contract	verified code + the project harness
`/dryforge:migration`	an existing codebase	the project harness (one-time)

`/ready` — from intent to executable spec

/ready accepts arbitrary input — a one-line idea, a requirements document generated by another tool, scattered notes, or nothing but a hunch. All of it enters with the same status: material under challenge, not ground truth. A document's existence is no evidence of the design conversation behind it. Conflicts inside the input become questions, not silent picks; embedded code fragments are reduced to the behavioral contract they encode — inputs, outputs, invariants — rather than carried forward as implementation.

From that material, /ready enumerates the decisions the design is obligated to answer — stated or not — and drives each to one of two terminal states: derived from your expressed intent, or asked. Silent defaulting is not a terminal state.

Question volume is bounded by construction:

Derivable from previous answers → never asked.
A tuning value inside a confirmed mechanism → defaulted, recorded as tunable.
Every question leads with a recommendation — accepting it is one keypress.
Domain questions carry an open "none of these" — your domain knowledge outranks the option list.

On a project's first cycle, /ready additionally runs the conversation that precedes any serious build: what the project is and is not, the domain model — entities, lifecycles, rules — and the stack, recommended with explicit trade-offs when you bring none. No technical direction is fixed by silence.

Before the result reaches you, it passes independent review by an agent that did not author it — checking that nothing you said was dropped or distorted, and that the artifact is executable as written. Your approval is the only event that makes it final.

`/go` — execution that only passes on evidence

/go consumes the approved contract and owns all git state from that point.

Scheduling. The plan carries an explicit dependency graph. /go validates it — cycles, dangling references, coverage gaps — before any git mutation; a malformed graph fails fast as a producer defect, not something to patch at execution time. Independent tasks execute concurrently, each implementer in an isolated git worktree, and re-enter through a merge gate that verifies the branch actually advanced and the diff touches its declared surface. Implementer self-reports carry no weight.

Verification is risk-proportional. A mechanical rename and a payment-path change do not get the same ceremony. High-risk tasks receive independent review against the spec slice and the raw diff — never the implementer's summary, which is how reviewers get anchored.

Gates, end to end.

Each parallel wave ends with the project's verification suite running against the merged base — the first point at which cross-task interactions exist.
Completion re-runs full verification and adds runtime smoke: a spec-declared service must boot and answer a live request. Compiles and works are different claims.
A verification that cannot be evaluated — the command died before asserting anything — is a failure, not an inferred pass.

Escalation is synchronous. A blocked task halts and waits for your answer. It does not assume one and build on top of it.

Git stays yours. Existing projects execute on a feature branch; main is never written directly, and final integration — merge, PR, manual — is never autonomous. A dirty working tree or unpushed commits abort the run before it starts.

When everything passes, /go writes or updates the project harness, runs one final independent review across the full change, and archives the design contract.

`/migration` — onboarding an existing codebase

A one-time conversion, not a task runner. /migration scans the codebase, then elicits precisely what code cannot attest: a code path shows what an auth check does, not whether that is the entire policy. The elicitation is risk-weighted — what is inferable and cheap to get wrong is inferred; what is inferable but expensive to get wrong is confirmed with you: domain invariants, security boundaries, the business model behind the checks.

Questions arrive in plain language — "if this changes, must that change with it?" — and answers are compiled back into precise rules. It generates the full harness, leaves the commit to you, and exits. From then on, the project runs on /ready → /go.

Anatomy of a cycle

/ready → /go is one pipeline with exactly two approval points — both yours. Everything between them runs autonomously.

/ready    decompose input → resolve every open decision → write the contract
          → independent review → ▶ your approval

/go       validate the graph → execute in parallel waves → integration gates
          → runtime smoke → write/update the harness → final review
          → ▶ your approval → archive the contract

What /ready leaves on disk is a three-document design contract in .dryforge/:

spec — the authority on what: behavior rules, invariants, API surface, every edge case with an explicit disposition, and the verifications the result must pass.
plan — the blueprint for how: per-task behavior contracts and the dependency graph /go schedules from.
handoff — the governing document: how the three relate, and the hard gates no step may cross.

The contract is self-contained by construction — written so a future agent can act on it without the conversation that produced it. Decisions that cannot be re-derived from code carry their reasoning inline. The authority hierarchy is explicit: when spec and code disagree, spec wins. When the spec itself looks wrong, the agent does not patch it — it comes back to you.

After /go completes, the contract is archived under .dryforge/, cycle by cycle — a durable record of what was decided, when, and why.

What persists — the project harness

your-project/
├── CLAUDE.md                  # entry point for Claude Code — identity + work rules
├── AGENTS.md                  # entry point for Codex — identical content
├── docs/
│   ├── architecture.md        # composition: components, flow, dependencies
│   ├── business-rules.md      # domain logic: entities, invariants, edge cases
│   ├── security.md            # policy: protected assets, access, audit
│   ├── standards.md           # the rules: hard gates, conventions, boundaries
│   ├── engineering-notes.md   # hard-won knowledge: traps, mechanisms, checklists
│   ├── operations.md          # how to run it: setup, build, deploy
│   ├── contracts.md           # external interface contracts
│   └── tracking/
│       ├── status.md          # where the project stands vs. its full scope
│       ├── decisions/         # decision records — what was chosen, and why
│       └── findings.md        # k

dryforge

Frequently Asked Questions

What is dryforge?

Is dryforge safe to use?

How do I install dryforge?

What programming language is dryforge written in?

Are there alternatives to dryforge?

Related Skills

dryforge

A plugin harness for Claude Code & Codex.

Every prompt is underspecified

Three commands

/ready — from intent to executable spec

/go — execution that only passes on evidence

/migration — onboarding an existing codebase

Anatomy of a cycle

What persists — the project harness

`/ready` — from intent to executable spec

`/go` — execution that only passes on evidence

`/migration` — onboarding an existing codebase