ralph

Name: ralph
Author: SantanderAI

Pending

A configurable Bash/PowerShell loop that runs an AI coding CLI with a fresh session each iteration

55stars

14forks

PowerShell

Installation

# Add to your Claude Code skills
git clone https://github.com/SantanderAI/ralph

Getting Started

Guides for using ai agents skills like ralph.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

README.md

Frequently Asked Questions

What is ralph?

ralph is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by SantanderAI. A configurable Bash/PowerShell loop that runs an AI coding CLI with a fresh session each iteration. It has 55 GitHub stars.

Is ralph safe to use?

ralph's catalog security scan is still queued. You can run an instant dependency and prompt-injection check now with the "Scan for vulnerabilities" button above.

How do I install ralph?

Clone the repository with "git clone https://github.com/SantanderAI/ralph" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is ralph written in?

ralph is primarily written in PowerShell. It is open-source under SantanderAI on GitHub, so you can review or fork the full source.

Are there alternatives to ralph?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh ralph against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

not-fade-away orangeo-ai-visibility-skill

ralph

Open source by Santander AI Lab. A dependency-free Bash / PowerShell developer tool that runs an AI coding CLI / LLM agent in a loop, starting a fresh agent session on every iteration for long unattended runs.

Part of Santander AI Open Source — open source AI projects from Banco Santander (santander.com).

ralph runs an AI coding CLI in a loop, starting a fresh session on every iteration and feeding it the same prompt. It is a thin, dependency-free Bash wrapper (ralph-loop.sh) around the CLIs you already have installed — Codex, Claude Code, Gemini CLI, and Devin CLI — so you can drive long, unattended "keep working on this until it's done" runs.

The name comes from the "Ralph Wiggum" technique: run the same prompt against a clean agent over and over, letting the work accumulate in the repository between runs.

How it works

Each iteration the script:

Checks for a stop signal (stop.md) and exits cleanly if found.
Reloads configuration (see Live reload).
Launches the selected CLI as a brand-new session with your prompt piped to it, running in the directory where you invoked the script.
Writes a timestamped log to .ralph/logs/ and rotates old logs.

Because every iteration is a fresh session, all continuity has to live in the workspace itself: the files the agent edits, a plan, notes, etc. The prompt should tell the agent to read that state and make incremental progress.

Requirements

Bash.
At least one of the supported CLIs on your PATH: codex, claude, gemini, or devin.
Optionally just for the install recipe.

Installation

Install the script to ~/.local/bin:

just install

Or just copy ralph-loop.sh somewhere on your PATH and make it executable.

Usage

ralph-loop.sh MAX_ITERATIONS PROMPT_FILE

MAX_ITERATIONS — positive integer; the maximum number of loops to run.
PROMPT_FILE — a file whose contents are sent to the CLI as the prompt.

Example — run Claude with a high-capability model up to 25 times. Configuration lives in .ralph/.env (created with defaults on first run), so edit it and then launch:

# .ralph/.env
RALPH_TOOL=claude
RALPH_MODEL_CAPABILITY=high

ralph-loop.sh 25 prompt.md

The directory you invoke the script from is treated as the workspace. Run ralph-loop.sh with the wrong number of arguments to print full inline help.

Configuration

Behavior is controlled entirely through .ralph/.env in your workspace, created with these defaults the first time you run the loop. It is the only configuration source — the shell environment is ignored. Quote values that contain spaces. The most common keys:

Variable	Values	Default	Purpose
`RALPH_TOOL`	`codex`, `claude`, `gemini`, `devin`	`codex`	Which CLI to run.
`RALPH_MODEL_CAPABILITY`	`low`, `med`, `high`	`med`	Normalized model tier, mapped per tool.
`RALPH_THINKING`	`true`, `false`	`false`	Best-effort reasoning/thinking toggle.
`RALPH_SWITCH_ON_EXHAUSTION`	`true`, `false`	`true`	On a failed iteration, auto-switch agent if it ran out of tokens.
`RALPH_MEMORY_MAX`	systemd memory value (`8G`, `512M`, bytes, `%`) or empty	`8G`	Hard RAM limit for the agent, kernel-enforced via a systemd user scope; the agent is OOM-killed if it exceeds it. Empty disables.
`RALPH_LOOP_MAX_LOGS`	positive integer	`min(MAX_ITERATIONS, 50)`	Logs to retain in `.ralph/logs`.

Each tool also has overrides for its command, flags, and the model used for each capability tier, e.g. RALPH_CLAUDE_COMMAND, RALPH_CLAUDE_FLAGS, RALPH_CLAUDE_MODEL_HIGH, and the equivalents for CODEX, GEMINI, and DEVIN. The capability tier (low/med/high) is translated into the right per-tool model name and reasoning effort / thinking budget automatically.

Devin is special in two ways: it has no reasoning/thinking knob (so RALPH_THINKING is ignored for it and the tier only selects the model), and it reads the prompt from --prompt-file rather than stdin. Its RALPH_DEVIN_MODEL_* defaults point at Devin's Claude models (claude-haiku-4.5, claude-sonnet-4.6, claude-opus-4.8); run devin --model x to print the full list of valid identifiers. On the Devin Free tier, override all three tiers with swe-1.6-slow, the only model that plan can access.

See the inline help in ralph-loop.sh for the complete list.

Live reload of configuration

.ralph/.env is sourced before every iteration, so the loop can be reconfigured while it is running — no restart needed. Edit the file mid-run and the change takes effect on the next iteration.

# .ralph/.env
RALPH_TOOL=claude
RALPH_MODEL_CAPABILITY=high
RALPH_THINKING=true

Notes:

Plain KEY=value lines work; quote values that contain spaces.
An invalid value is reported and the previous good configuration is kept, so a typo will not abort the loop.
The file is sourced as plain shell variables (not exported), so the shell environment is never a configuration channel. RALPH_LOCAL_DIR (the script-managed .ralph directory) is the only exported variable and cannot be moved by the file.

Stopping the loop

Create a stop.md file to stop before the next iteration:

stop.md in the invocation directory, or
any stop.md anywhere inside the plan/ subtree.

The script checks for it before each iteration and exits cleanly without deleting the file. If a matching stop.md already exists at startup, the loop exits immediately. This lets the agent itself signal "I'm done" by creating the file as part of its work.

Auto-switching agent on token exhaustion

When an iteration exits non-zero, the loop can detect that the current agent ran out of tokens (usage/quota/credits or rate limit) and switch to a different agent automatically. This is on by default (RALPH_SWITCH_ON_EXHAUSTION=true).

Agent rotation

Switching follows a fixed cycle, where each agent's successor is both the detector consulted on failure and the agent switched to:

codex → claude → gemini → devin → codex

Failed agent	Detector / next agent
`codex`	`claude`
`claude`	`gemini`
`gemini`	`devin`
`devin`	`codex`

The successor is always a different agent from the one that failed. Because each exhaustion advances one step, repeated token exhaustion walks the full cycle (e.g. codex exhausted → claude; if claude later exhausts → gemini, and so on, wrapping back to codex).

How it works

After a non-zero iteration, the next agent in the rotation is launched as a cheap, low-capability "detector". It reads the tail of the failed iteration's log.
The detector answers a single structured line — TOKENS_EXHAUSTED=true or TOKENS_EXHAUSTED=false.
If true, RALPH_TOOL is rewritten in .ralph/.env to that next agent, so the next iteration runs with it (the failed iteration is not retried). If false, nothing changes.

If the next agent's CLI is not installed, the check is skipped and no switch happens. The whole detection (detector agent, its output, and the decision) is appended to the iteration's log. Set RALPH_SWITCH_ON_EXHAUSTION=false to disable.

Hard RAM limit

Agents can leak or balloon in memory over long autonomous runs. RALPH_MEMORY_MAX (default 8G) caps the RAM available to the agent process. The limit is enforced by the Linux kernel: each iteration runs the agent inside a transient systemd user scope created with

systemd-run --user --scope -p MemoryMax=<RALPH_MEMORY_MAX> -p MemorySwapMax=0 -- <agent>

MemorySwapMax=0 keeps the cap on real RAM rather than letting it spill to swap. If the agent exceeds the limit it is OOM-killed, the iteration exits non-zero (and is then subject to the usual exhaustion check / agent switch). The value accepts any systemd memory format (8G, 512M, raw bytes, or a percentage of total RAM); set it empty to disable the limit.

If systemd-run user scopes are not available on the host, the agent runs without a limit and a one-time warning is printed.

Skills

This repo also distributes the skills that ralph itself relies on, so they travel with the project instead of living only under ~/.claude / ~/.codex. The same approach as rolemaster: the source of truth is skills/<name>/SKILL.md, and just recipes copy them into each tool's user-level skills directory.

Skill	What it does
`juez`	Independent reviewer of `plan/task/*.md`: inserts `[juez]` checkpoints, judges reproducible evidence, unblocks stuck blocks, and (only on `/juez auditar`) audits global progress in read-only mode.
`maestro`	Curator of the project's local skills: after each task its `review` action reads the plan and loop logs and creates, extends, or deletes local skills (methodologies, pitfalls, decisions) under `skills/` so future planning and executi