atomic-agent

Name: atomic-agent
Author: AtomicBot-ai

Issues

Local First Ai Agent. Optimized for Local Ai models. Long context window. Proper tools callings. Runs privately on your device.

1,091stars

153forks

TypeScript

Installation

# Add to your Claude Code skills
git clone https://github.com/AtomicBot-ai/atomic-agent

Getting Started

Guides for using ai agents skills like atomic-agent.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

Security ReportIssues

Last scanned: 6/17/2026

{
  "issues": [
    {
      "type": "npm-audit",
      "message": "@vitest/mocker: Vulnerability found",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "esbuild: esbuild enables any website to send any requests to the development server and read the response",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "exceljs: Vulnerability found",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "fast-xml-builder: fast-xml-builder allows attribute values with unwanted quotes to bypass malicious or unwanted attributes",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "hono: hono: Path traversal in `serve-static` on Windows via encoded backslash (`%5C`)",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "node-notifier: Vulnerability found",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "shell-quote: shell-quote quote() does not escape newlines in object .op values",
      "severity": "critical"
    },
    {
      "type": "npm-audit",
      "message": "tmp: tmp has Path Traversal via unsanitized prefix/postfix that enables directory escape",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "tsx: Vulnerability found",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "uuid: uuid: Missing buffer bounds check in v3/v5/v6 when buf is provided",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "vite: Vite Vulnerable to Path Traversal in Optimized Deps `.map` Handling",
      "severity": "high"
    },
    {
      "type": "npm-audit",
      "message": "vite-node: Vulnerability found",
      "severity": "medium"
    },
    {
      "type": "npm-audit",
      "message": "vitest: Vulnerability found",
      "severity": "critical"
    },
    {
      "type": "npm-audit",
      "message": "ws: ws: Uninitialized memory disclosure",
      "severity": "high"
    },
    {
      "file": "README.md",
      "line": 224,
      "type": "secret-exfiltration",
      "message": "Instruction appears to send credentials/secrets to an external endpoint",
      "severity": "medium"
    },
    {
      "file": "README.md",
      "line": 79,
      "type": "remote-install",
      "message": "Install command (remote install script piped to a shell — review the source before running): \"curl -fsSL https://api.atomicbot.ai/agent-install | sh\"",
      "severity": "low"
    },
    {
      "file": "starter-skills/notion/SKILL.md",
      "line": 64,
      "type": "secret-exfiltration",
      "message": "Instruction appears to send credentials/secrets to an external endpoint",
      "severity": "medium"
    }
  ],
  "status": "FAILED",
  "scannedAt": "2026-06-17T09:04:49.588Z",
  "npmAuditRan": true,
  "pipAuditRan": true,
  "promptInjectionRan": true
}

README.md

Frequently Asked Questions

What is atomic-agent?

atomic-agent is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by AtomicBot-ai. Local First Ai Agent. Optimized for Local Ai models. Long context window. Proper tools callings. Runs privately on your device. It has 1,091 GitHub stars.

Is atomic-agent safe to use?

atomic-agent failed SkillsLLM's automated security scan, which flagged one or more high-severity issues. Review the Security Report section carefully before using it.

How do I install atomic-agent?

Clone the repository with "git clone https://github.com/AtomicBot-ai/atomic-agent" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is atomic-agent written in?

atomic-agent is primarily written in TypeScript. It is open-source under AtomicBot-ai on GitHub, so you can review or fork the full source.

Are there alternatives to atomic-agent?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh atomic-agent against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

ECC

by affaan-m

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

236,687

ai-prompts mcp-searxng

Atomic Agent

A local-first AI agent that runs on your machine, with local or cloud models.

Drives your browser, edits files, runs approved commands, and remembers context across sessions. Open source, running on our TurboQuant llama.cpp for +30-50% throughput on small local models.

Quick Install · Benchmarks · Why Local-First · Ways to Use It · Docs

atomic-agent terminal demo

A local-first AI agent that runs the control loop and all state on your machine. It drives your desktop: browse, read and edit files, run approved shell commands, inspect documents, remember context across sessions, schedule follow-ups, and call external tools over MCP. Embed it in your own apps over HTTP or a Tauri sidecar. llama.cpp first, so small quantized models stay useful for long, multi-step work on consumer hardware.

Quick Install

macOS / Linux:

curl -fsSL https://atomicagent.io/install | sh

Windows (PowerShell):

irm https://atomicagent.io/install.ps1 | iex

The installer downloads the release archive, verifies the checksum, and installs the CLI plus support assets (grammars/, native prebuilds, and bundled ripgrep). Atomic Agent updates itself in place; after an update the TUI prompts you to restart.

[!NOTE] Developer preview. APIs, commands, config, and behavior are still moving, so pin a release if you need a stable integration point. Current builds: macOS, Linux x64, and Windows x64.

Run

atomic-agent

[!TIP] Coming from Hermes or OpenClaw? Run /import in the TUI for a one-shot migration: sessions, cron jobs, and optionally your provider keys.

Troubleshooting

If something isn't working:

Copy your error logs and system specs.
Open an issue on GitHub.
Or ask for help in our Discord.

Benchmarks

On the public GAIA validation Level 1 split (53 tasks), Atomic Agent and Hermes drove the same local qwen-3.6-35b-a3b (llama-server, UD-Q4_K_XL), with the same step budget and timeout. The only variable is the agent loop.

GAIA Level 1 benchmark: atomic-agent 69.8% vs Hermes 58.5%

Metric	Atomic Agent	Hermes
Accuracy	37/53 = 69.8%	31/53 = 58.5%
Avg wall / task	~217 s	~351 s
Head-to-head wins	+15 atomic-only	+9 Hermes-only

%%{init: {"themeVariables": {"xyChart": {"backgroundColor": "transparent", "titleColor": "#0b63f6", "plotColorPalette": "#0b63f6"}}}}%%
xychart-beta
    title "GAIA L1 accuracy (higher is better, %)"
    x-axis ["Atomic Agent", "Hermes"]
    y-axis "Accuracy (%)" 0 --> 100
    bar [69.8, 58.5]

%%{init: {"themeVariables": {"xyChart": {"backgroundColor": "transparent", "titleColor": "#0b63f6", "plotColorPalette": "#0b63f6"}}}}%%
xychart-beta
    title "Avg wall time per task (lower is better, s)"
    x-axis ["Atomic Agent", "Hermes"]
    y-axis "Seconds / task" 0 --> 400
    bar [217, 351]

Model Scaling

The same loop holds up as the local model shrinks. Same GAIA L1 split, Atomic Agent alone:

Chat model	Accuracy	Avg wall / task
`qwen-3.6-35b-a3b` (UD-Q4_K_XL)	37/53 = 69.8%	~217 s
`qwen-3.5-9b` (Q4_K_M)	28/53 = 52.8%	~152 s
`gemma-4-12b` (it-qat UD-Q4_K_XL)	24/53 = 45.3%	~423 s

Even a 9B model clears half of GAIA L1 through the same context-frugal loop. (Different Atomic Agent versions per row; see the write-up for provenance.)

Full reproducible write-up: GAIA-L1-EXPERIMENT.md · Raw artifacts (matrices, NDJSON traces, logs): gaia-l1-eval-2026-06-11 release.

Why Local-First

The control loop and all state run on your machine, not a hosted service:

Your data never leaves. Sessions, memory, tasks, traces, skills, browser profile, and config live under <stateDir> on disk. Nothing leaves unless you configure it to.
No API costs. Run quantized models locally through llama.cpp. Bring your own llama-server or let the CLI manage one.
Nothing is hidden. Inspect the prompt, replay trace drift, edit skills, and swap parts without waiting for a vendor. Plain local models, SQLite files, and NDJSON traces.
Runs on your hardware. Small quantized models run on everyday consumer GPUs and CPUs, no datacenter needed.

Core Idea

How the Agent Loop Works

An agent is a loop: the model picks an action, something runs it, the result feeds back in, and it repeats until the job is done. The catch is cost. Every turn re-sends the growing context through the model, so a naive loop gets slower and pricier each pass, and small local models choke on it fastest.

Atomic Agent keeps the loop cheap. One inference produces one JSON array of tool calls, and it runs them without re-encoding the whole world every turn:

flowchart LR
    A[Prompt] --> B[Decide]
    B --> C[Run]
    C --> D[Compress]
    D -->|not done| A
    D -->|done| E[Reply]

Prompt: a compact prompt goes to the local model.
Decide: the model returns one JSON array of tool calls, grammar-checked so the format is always valid.
Run: the core executes them; independent reads run in parallel, risky actions ask first.
Compress: results and state are summarized, not pasted back in full.
Repeat: loop again until reply, finish, cancel, or a max-step limit.

The model chooses actions. Atomic Agent owns the loop, the state, the approvals, the traces, the stop conditions, and the failure boundaries.

Built to Make Local Models Work

We run local models on our own TurboQuant llama.cpp (AtomicBot-ai/atomic-llama-cpp-turboquant):

TurboQuant KV-cache: WHT-rotated low-bit quantization compresses the KV-cache up to ~6.4× versus F16, with a fused Metal decode kernel, so long-context sessions fit in far less memory.
TurboQuant weights: Lloyd-Max weight quantization with WHT rotation and fused Metal/Vulkan kernels keeps quality high while small models fit on consumer hardware.
Custom speculative decoding: purpose-built Gemma 4 MTP and Qwen 3.6 NextN heads reuse the loaded model (no second context, tokenizer, or model load) for +30-50% throughput.
Curated quantized models: hand-picked GGUF quants that keep quality high while fitting real VRAM budgets.
Managed mode: the CLI downloads, pins, and runs the backend and models for you, no manual llama.cpp setup.

Tuned for Small Local Models

Atomic Agent's prompt is engineered so a small model never wastes tokens or breaks format:

Stable prefix: persona, rules, tools, skills, capabilities, and instructions stay byte-stable inside a session so cache_prompt and slot_id can reuse KV-cache instead of re-encoding the prompt every turn.
Bounded tail: conversation, memory, world state, recalled notes, lessons, procedures, and loaded skill bodies are clipped into a predictable prompt budget.
Externalized state: sessions, memory, tasks, skills, traces, browser snapshots, and model config live outside the prompt.
GBNF tool calls: completions are constrained into a JSON array of tool calls, including the solo case [{...}].
Parallel read batches: independent read-only calls can run concurrently after a single inference; dangerous actions remain approval-gated.
Compact browser view: ordinary web operation uses accessibility / ARIA snapshots instead of screenshot-heavy page dumps.

This is why small local models can stay useful across long, tool-heavy work.

What It Can Do

Atomic Agent drives a full desktop tool surface. Dangerous actions are routed through approvals; independent read-only calls run in parallel.

Area	Capabilities
Browser	Navigate, click, type, search, manage tabs, scroll, and read compact ARIA state via `playwright-core` (Chrome / Edge / Chromium).
Web & HTTP	Web search with configurable providers (Exa), fetch and extract pages (SSRF-guarded), and make arbitrary HTTP requests, separate from the browser.
Filesystem & shell	Read, write, edit, patch, glob, grep, diff, watch, hash, list, archi