claude-context-os

Name: claude-context-os
Author: Arkya-AI

Warning

A complete operating system for Claude that prevents context loss and makes multi-session work reliable

299stars

24forks

Added 2/11/2026

Installation

# Add to your Claude Code skills
git clone https://github.com/Arkya-AI/claude-context-os

Getting Started

Guides for using ai agents skills like claude-context-os.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

Security ReportWarning

Last scanned: 5/30/2026

{
  "issues": [
    {
      "type": "clone-failed",
      "message": "Could not clone repository",
      "severity": "medium"
    }
  ],
  "status": "WARNING",
  "scannedAt": "2026-05-30T15:05:23.686Z",
  "npmAuditRan": false,
  "pipAuditRan": false
}

README.md

Frequently Asked Questions

What is claude-context-os?

claude-context-os is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by Arkya-AI. A complete operating system for Claude that prevents context loss and makes multi-session work reliable. It has 299 GitHub stars.

Is claude-context-os safe to use?

claude-context-os returned warnings in SkillsLLM's automated security scan. It has no critical vulnerabilities, but review the flagged issues in the Security Report section before adding it to your workflow.

How do I install claude-context-os?

Clone the repository with "git clone https://github.com/Arkya-AI/claude-context-os" and add it to your Claude Code skills directory (see the Installation section above).

Are there alternatives to claude-context-os?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh claude-context-os against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

ChatClaw omni

Claude Context OS v4

New CLAUDE.md that solves the compaction/context loss problem and mimics persistent memory through effective session handoff in token efficient manner

What changed in v4

v3 got 150+ stars. Then I read 9 peer-reviewed papers on how LLMs actually process system prompts. Turns out v3 was working against itself.

The research said three things that broke my assumptions:

LLMs track 5-10 rules before performance degrades. v3 had 15+. Some rules were silently dropped every session. (arXiv:2409.10715)
Explanatory prose actively interferes with instruction compliance. The "Why It Matters" sections in v3 weren't neutral — they competed with the actual rules for Claude's attention. Topically related filler is worse than random noise. (Chroma Context Rot, 2025)
Claude 4.6 overtriggers on emphatic language. "CRITICAL", "NEVER", "YOU MUST" worked for Claude 3.x. The newer models are more responsive to system prompts and now overcorrect when they see aggressive language. (Anthropic Claude 4 Best Practices)

The result: v4 is 47 lines and 7 rules. Down from 327 lines and 15+ rules. Every design choice maps to a specific research finding.

	v3	v4
Lines	327	47
Rules	15+	7
Explanatory prose	Heavy	Zero
Language style	"CRITICAL / NEVER / MUST"	Plain imperatives
Session startup	Read all summaries	Read only what handoff references
Reference content	Embedded in CLAUDE.md	Moved to docs/context/ (loaded on demand)
Templates	Same	Same + task template + handoff index

The problem (unchanged from v3)

Claude's default summarization loses five specific things:

Precise numbers get rounded or dropped
Conditional logic (IF/BUT/EXCEPT) collapses to simple statements
Decision rationale — the WHY evaporates, only the WHAT survives
Cross-document relationships flatten to single statements
Open questions get silently resolved as settled

The fix isn't better summarization — it's structured templates with explicit fields that mechanically prevent these five failure modes.

What's in the box

CLAUDE.md                    (47 lines, ~1,200 tokens) — the operating system
templates/
  claude-templates.md        (on-demand only) — 6 templates, 3 subagent contracts,
                              3 workflow phases, task definition format
docs/
  context/                   (on-demand) — reference files loaded by work type
    processing-protocol.md   — how to handle multiple documents
    archive-rules.md         — summary lifecycle and file management
    project-structure.md     — full directory tree + scaffold commands
    subagent-rules.md        — when to use subagents vs. main agent
    client-briefs/           — per-client context (you populate these)
.claude/commands/
  handoff.md                 /handoff — end a session cleanly
  process-doc.md             /process-doc — summarize an input document
  status.md                  /status — check project state
examples/
  software-dev-project/      worked example with filled-in templates
CLAUDE-v3.md                 previous version (preserved for reference)

The 7 rules

Do not mix client contexts in one session
Write state to disk, not conversation — summaries to docs/summaries/ using structured templates
Before compaction or session end, write everything to disk — numbers, decisions, questions, file paths, next actions
When switching work types, write a handoff and suggest a new session
Do not silently resolve open questions — mark them OPEN or ASSUMED
Do not bulk-read documents — process one at a time, summarize, release before reading next
Sub-agent returns must be structured — use output contracts, not free-form prose

Why 7 rules and not 15

Research on transformer working memory (arXiv:2409.10715) shows LLMs can actively track 5-10 constraints before performance degrades to near-random. With 15 rules, Claude silently drops the ones it considers least relevant. With 7, compliance per rule goes up significantly.

The rules that were cut didn't disappear — they moved to docs/context/ where Claude loads them only when the task requires it. This matches Anthropic's own recommendation for "just-in-time context".

Why zero explanation in CLAUDE.md

v3 had sections like "Structured Summarization: Why It Matters" and "Cost of NOT Splitting" — prose that explained why each rule existed.

The Chroma Context Rot study (July 2025, 18 LLMs tested) found that coherent filler text that's topically related to the target task interferes MORE than random noise. Explanatory prose about context management rules actively competes with the context management rules themselves.

v4 has rules only. No justification, no persuasion, no theory. The rationale lives here in the README — for humans, not for Claude.

Why plain language instead of "CRITICAL" and "NEVER"

Anthropic's Claude 4 best practices note that Claude Opus 4.5/4.6 is "significantly more proactive" and "more responsive to the system prompt." Instructions that needed emphasis for Claude 3.x now cause overtriggering in Claude 4.6.

v4 uses calm imperatives: "Do not mix client contexts" instead of "NEVER mix client contexts. This is CRITICAL."

Why "Before Delivering Output" is last

The Lost in the Middle paper (Liu et al., TACL 2024) showed LLM performance follows a U-shaped curve — highest attention at the beginning and end of context, lowest in the middle. Anthropic's own testing shows queries placed at the end of prompts improve response quality by up to 30%.

The verification checklist is deliberately placed at the end of CLAUDE.md so it's the last instruction Claude processes before starting work.

New in v4

Work-type context mapping

CLAUDE.md now tells Claude which context files to load based on the type of work:

- Proposals/sales → client-briefs/[client].md + sales-frameworks.md
- Content/thought leadership → content-strategy.md
- Workshops → workshop-design.md + client-briefs/[client].md
- Agent development → use codebase directly

No more loading everything. Claude loads what the task needs.

Task definition template

Inspired by GSD's atomic task structure:

## Task: [name]
Context files: [which files to load]
Action: [what to produce]
Verify: [how to check it's correct]
Done: [what files exist when complete]

Handoff index layer

Inspired by claude-mem's progressive disclosure pattern. Every handoff now includes a "Files to Load Next Session" field — an explicit manifest that tells the next session exactly what to read, instead of loading all summaries.

Quick start

Copy CLAUDE.md to your project root
Copy the templates/ and docs/context/ directories
Copy .claude/commands/ for slash commands
Edit the Identity section in CLAUDE.md for your workflow
Populate docs/context/ files for your domain (sales frameworks, client briefs, etc.)
Start a Claude Code session

No dependencies. No install.

Who this is for

Anyone doing multi-session work with Claude — consultants, developers, researchers, sales professionals, no-code builders. If your projects span days or weeks, not minutes.

v4 was specifically designed to work for non-developers too — consultants, sales professionals, researchers, and no-code builders using Claude Code for business tasks like proposals, competitive analysis, content strategy, and workshop design.

The research behind v4

Design Decision	Research
7 rules max	Working Memory Limits — LLMs track 5-10 constraints
No explanatory prose	Context Rot — coherent filler interferes
Rules are 1-3 lines each	Hidden in the Haystack — too-short instructions harder to find
Verification at end	Lost in the Middle — U-shaped attention curve
Plain imperatives	Claude 4 Best Practices — overtriggering
"State your understanding" step	Anthropic + EMNLP 2025 — recite-before-reasoning
Just-in-time context loading	Anthropic Context Engineering
Session splitting as architecture	NoLiMa — 32K token performance cliff

Additional references:

Context Length Alone Hurts LLM Performance — EMNLP 2025
Claude Code Best Practices — Anthropic official

v3 → v4 migration

If you're using v3:

Your templates/claude-templates.md is mostly compatible — v4 adds Template 6 (Task Definition) and a "Files to Load Next Session" field to the handoff template
Copy the new docs/context/ directory — this is where v3's displaced content now lives
Replace your CLAUDE.md with v4
Your existing docs/summaries/