by keyuchen21
The definitive OpenAI, Claude, MCP, Harness, Evals, and Production Agent Systems learning roadmap.
# Add to your Claude Code skills
git clone https://github.com/keyuchen21/agentic-engineering-handbookGuides for using ai agents skills like agentic-engineering-handbook.
agentic-engineering-handbook is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by keyuchen21. The definitive OpenAI, Claude, MCP, Harness, Evals, and Production Agent Systems learning roadmap. It has 114 GitHub stars.
agentic-engineering-handbook's catalog security scan is still queued. You can run an instant dependency and prompt-injection check now with the "Scan for vulnerabilities" button above.
Clone the repository with "git clone https://github.com/keyuchen21/agentic-engineering-handbook" and add it to your Claude Code skills directory (see the Installation section above).
agentic-engineering-handbook is primarily written in Python. It is open-source under keyuchen21 on GitHub, so you can review or fork the full source.
Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh agentic-engineering-handbook against similar tools.
No comments yet. Be the first to share your thoughts!
Unlocks once the catalog security scan passes (runs nightly).
The deep catalog scan for this skill is still queued. Run an instant dependency check now instead.
The definitive OpenAI, Anthropic, Google, MCP, Harness, Evals, and Production Agent Systems learning roadmap.
If this repository helps you, consider giving it a ⭐
The AI industry has entered the Agentic Era. Building production-grade AI systems now requires mastering agents, tool use, MCP, memory, long-running workflows, coding agents, agent harnesses, evals, and safety — but the knowledge is scattered across OpenAI blogs, Anthropic engineering posts, SDK docs, cookbooks, and research papers.
This repository consolidates 161 curated resources into one structured learning roadmap.
The goal: Become a world-class Agentic Engineer.
Pick the path that matches your starting point:
Read First, Then Read, and Build Exercise as a checklist.Build Exercise prompts, then branch into Applied Practice Tracks for coding agents, security, code review, or SRE.P0 first, use P1 for implementation detail, and keep P2 as optional background.If you treat Claude Code as a coding CLI, many capabilities can feel like magic: it reads files, runs commands, edits code, delegates work, and stays oriented during complex tasks.
From an engineering perspective, the core is much simpler:
model + tools + one loop.
Understanding that loop makes the rest of the system easier to reason about:
These pages are based on the upstream English Markdown tutorials from shareAI-lab/mini-claude-code, with added Study Notes and inline source code for this handbook.
Supporting files are included in the same folder: requirements.txt, .env.example, v0_bash_agent_mini.py, and skills/.
Build shared vocabulary for workflow vs agent, tool loop, handoff, guardrails.
Should I build an agent? (4-question checklist from Barry Zhang's talk)
| Question | If No → Workflow | If Yes → Agent |
|---|---|---|
| Is the task complex enough? | Decision tree is fully mappable | Ambiguous problem space |
| Is the task valuable enough? | <$0.10 per run | >$1 per run, cost doesn't matter |
| Are all core capabilities doable? | Weak links break the chain | Model handles every step well |
| Is error cost low & detectable? | High cost + hard to detect → human-in-the-loop | Errors caught by tests/CI |
Think like the agent. Most failures come from designing with a human perspective. Put yourself inside the agent's context window: you only see ~10K–20K tokens (system prompt + tool descriptions + recent observations). Ask: does the agent have enough information to act correctly at each step?
→ Source: How We Build Effective Agents
| # | Title | Vendor |
|---|---|---|
| 1 | System Prompts | Anthropic |
| 2 | Prompt guidance | OpenAI |
| 3 | Function Calling | OpenAI |
| 4 | Tool use overview | Anthropic |
| 5 | Function calling - Gemini API | |
| 6 | Building effective agents | Anthropic |
| 7 | New tools for building agents | OpenAI |
| 8 | Agents SDK overview | OpenAI |
| Title | Vendor |
|---|---|
| How We Build Effective Agents: Barry Zhang, Anthropic | Anthropic |
| Phistory — Claude Code & Codex CLI System Prompt Diff History | Community |
| Coding Agents 101: The Art of Actually Getting Things Done | Cognition |
| OpenAI Agents SDK examples | OpenAI |
| Structured Outputs for Multi-Agent Systems | OpenAI |
Build a customer service/ticket triage agent: router → specialist → evaluator, with all outputs constrained by structured schemas.
Understand MCP server/client, remote vs local, tool loading, approval, connector boundaries.
| # | Title | Vendor |
|---|---|---|
| 1 | Introducing the Model Context Protocol | Anthropic |
| 2 | MCP and Connectors | OpenAI |
| 3 | Building MCP servers for ChatGPT Apps and API integrations | OpenAI |
| Title | Vendor |
|---|---|
| Code execution with MCP: Building more efficient agents | Anthropic |
| Writing effective tools for AI agents - with AI agents | Anthropic |
| Model Context Protocol - Codex | OpenAI |
| Build a Remote MCP server | Cloudflare |
| Introducing the MCP Registry | MCP |
| OpenAI Docs MCP | OpenAI |
| Build your ChatGPT UI | OpenAI |
Build a read-only repo/docs MCP server, then create an eval to verify the agent correctly cites documentation.
Learn to control context window, short/long-term memory, skills/plugins, CLAUDE.md/AGENTS.md.
| # | Title | Vendor |
|---|---|---|
| 1 | Agent Skills Specification | Agent Skills |
| 2 | Effective context engineering for AI agents | Anthropic |
| 3 | How Long Contexts Fail | Drew Breunig |
| 4 | Context Rot | Chroma |
| 5 | Progressive disclosure | Claude-Mem |
| 6 | Equipping agents for the real world with Agent Skills | Anthropic |
| 7 | Agent Skills | Anthropic |
| 8 | Skills | OpenAI |
| 9 | Building Reliable Agents with Memory and Compaction | OpenAI |
| Title | Vendor |
|---|---|
| Custom instructions with AGENTS.md - Codex | OpenAI |
| Best practices for Claude Code | Anthropic |
| Agent Skills - Codex | OpenAI |
| Skills in OpenAI API | OpenAI |
Implement the same task as a Skill/Plugin, then measure accuracy and token cost across three variants: no skill, long prompt, and skill-based.
Master agent runtime: event stream, thread, tool execution, state, sandbox, approval, recovery.
| # | Title | Vendor |
|---|---|---|
| 1 | Unrolling the Codex agent loop |