by a5c-ai
Babysitter enforces obedience to agentic workforces and enables them to manage extremely complex tasks and workflows through deterministic, hallucination-free self-orchestration
# Add to your Claude Code skills
git clone https://github.com/a5c-ai/babysitterhttps://a5c.ai
Enforce obedience to agentic workforces. Manage extremely complex workflows through deterministic, hallucination-free self-orchestration.
Getting Started | Documentation | Community
https://github.com/user-attachments/assets/8c3b0078-9396-48e8-aa43-5f40da30c20b
No comments yet. Be the first to share your thoughts!
Babysitter enforces obedience to agentic workforces, enabling them to manage extremely complex tasks and workflows through deterministic, hallucination-free self-orchestration. Define your workflow in code - Babysitter enforces every step, ensures quality gates pass before progression, requires human approval at breakpoints, and records every decision in an immutable journal. Your agents do exactly what the process permits, nothing more.
Babysitter supports multiple AI coding harnesses. Install the plugin for your harness of choice:
Native marketplace install:
claude plugin marketplace add a5c-ai/babysitter
claude plugin install --scope user babysitter@a5c.ai
Restart Claude Code, then type /skills to verify "babysit" appears.
After cloning this repo, From within the Codex CLI:
codex
> /plugins
Navigate to the "babysitter" entry and select "Install".
Via the Cursor marketplace or npm:
npm install -g @a5c-ai/babysitter-cursor
npm install -g @a5c-ai/babysitter-gemini
babysitter-gemini install --global
Via the GitHub Copilot CLI marketplace, or:
npm install -g @a5c-ai/babysitter-github
Native Pi plugin install:
pi install npm:@a5c-ai/babysitter-pi
Native omp plugin install:
omp plugin install @a5c-ai/babysitter-omp
npm install -g @a5c-ai/babysitter-opencode
The postinstall script copies the plugin to .opencode/plugins/babysitter/ automatically.
Babysitter ships with a built-in internal harness that runs processes programmatically without any external AI coding agent. This is useful for CI/CD pipelines, scripts, automated testing, and headless orchestration:
npm install -g @a5c-ai/babysitter-sdk
# Run a process definition using the internal harness
babysitter harness:call --harness internal --process .a5c/processes/my-process.js#process --workspace .
# Or run a free-form prompt
babysitter harness:call --harness internal --prompt "run lint and tests" --workspace .
The internal harness uses the SDK's built-in Pi execution engine directly. It supports all capabilities (Programmatic, SessionBinding, StopHook, HeadlessPrompt) and requires no external CLI.
During process execution, the internal harness can delegate tasks to any discovered installed harness via the invoker. A process running under --harness internal can spawn subagent tasks that execute through Claude Code, Codex, Gemini CLI, or any other harness found on the system -- the SDK discovers available harness CLIs at runtime and routes task execution accordingly. This means you can orchestrate a multi-agent workflow from a single headless entry point, with different tasks delegated to whichever harness is best suited for them.
Babysitter has its own plugin system -- and it works differently from what you might expect. A babysitter plugin is not a code module with extension points. It's a set of natural language instructions (markdown files) or deterministic coded processes (JS files) that an AI agent reads and executes. The SDK stores, versions, and distributes the instructions. The AI agent is the runtime.
This means a plugin can do anything an AI agent can do: install npm packages, generate CI/CD pipelines, set up git hooks, create Terraform configs, modify your linter rules, copy babysitter processes into your project, and interview you about your preferences along the way.
The official marketplace includes plugins for security (gitleaks, ESLint security rules, audit processes), testing (Vitest/Playwright/pytest setup, coverage gates, TDD processes), deployment (Terraform, Helm, Dockerfiles, multi-environment pipelines), themes (sound effects, design systems, conversational personality), CI/CD (GitHub Actions workflows), and rate limiting (exponential backoff hooks).
To manage plugins, use the /babysitter:plugins command inside your harness (or babysitter harness:plugins from the CLI). The agent reads the plugin's install instructions, interviews you, analyzes your project, and executes the setup -- all within a babysitter orchestration run.
See the full Plugins documentation for details on how installs work, the marketplace format, creating your own plugins, and the migration system.
After installation, set up your environment:
/babysitter:user-install
This creates your personal profile with:
/babysitter:project-install
This analyzes your codebase and configures:
/babysitter:doctor
Run diagnostics to confirm everything is working.
claude "/babysitter:call implement user authentication with TDD"
Or in natural language:
Use the babysitter skill to implement user authentication with TDD
Claude will create an orchestration run, execute tasks step-by-step, handle quality checks and approvals, and continue until completion.
| Mode | Command | When to Use |
|------|---------|-------------|
| Interactive | /babysitter:call | Learning, critical workflows - pauses for approval |
| Autonomous | /babysitter:yolo | Trusted tasks - full auto, no breakpoints |
| Planning | /babysitter:plan | Review process before executing |
| Continuous | /babysitter:forever | Monitoring, periodic tasks - runs indefinitely |
| Command | Purpose |
|---------|----------|
| /babysitter:doctor | Diagnose run health and issues |
| /babysitter:observe | Launch real-time monitoring dashboard |
| /babysitter:resume | Continue an interrupted run |
| /babysitter:help | Documentation and usage help |
Beyond the in-session skill commands (/babysitter:call, etc.), the Babysitter SDK provides harness:* CLI commands that let you create, run, and manage orchestration sessions from the terminal. These commands work with any installed harness.
# Run a process interactively via Claude Code (pauses at breakpoints)
babysitter harness:call --harness claude-code --prompt "implement user authentication with TDD" --workspace .
# Run fully autonomous (no breakpoints)
babysitter harness:yolo --harness claude-code --prompt "add pagination to the API" --workspace .
# Plan only (stops after Phase 1)
babysitter harness:plan --harness claude-code --prompt "implement feature X"
# Run with the internal harness (no external AI agent needed)
babysitter harness:call --harness internal --prompt "run lint and tests" --workspace .
# Resume an interrupted run
babysitter harness:resume --run-id <runId> --harness claude-code --workspace .
# Diagnose run health
babysitter harness:doctor --run-id <runId>
# Analyze past runs for insights
babysitter harness:retrospect --all --harness claude-code --workspace .
# Clean up old runs
babysitter harness:cleanup --keep-days 7 --harness claude-code --workspace .
# See which harness CLIs are installed on your system
babysitter harness:discover
# Install a harness CLI
babysitter harness:install claude-code
# Install a harness plugin
babysitter harness:install-plugin claude-code