hanzi-browse - AI Agents | SkillsLLM

hanzi-browse

Name: hanzi-browse
Author: hanzili

by hanzili

Pending

let any ai agent use the local browser

158stars

48forks

JavaScript

Added 3/29/2026

View on GitHub Download ZIP

AI Agentsai-agentai-agentsbrowser-automationchromechrome-extension

Installation

# Add to your Claude Code skills
git clone https://github.com/hanzili/hanzi-browse

Getting Started

Guides for using ai agents skills like hanzi-browse.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

README.md

English | 中文

Hanzi Browse

The context layer for browsing agents.

Your browsing agent keeps failing on real sites — X uses Draft.js, LinkedIn hides the connect button, Gmail needs keyboard shortcuts. Hanzi Browse ships 24 site playbooks — hints for the LLM, not brittle scripts — so it actually finishes the task.

Works with

Two ways to use Hanzi Browse

Same 24 site playbooks underneath. Two install paths depending on who's driving.

For your agent — a browser sub-agent for your coding agent

One command. npx hanzi-browse setup detects every AI agent on your machine (Claude Code, Cursor, Codex, and 9 more) and wires Hanzi Browse in as an MCP tool. Your main agent delegates browser work; a sub-agent runs the loop — read page → plan next action → click/type/scroll → observe → repeat until done — and returns a clean answer. Site playbooks auto-load by URL so the model already knows the quirks.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

n8n

by n8n-io

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

187,645

claude-code-open obsidianos_work

npx hanzi-browse setup

npx hanzi-browse setup
│
├── 1. Detect browsers ──── Chrome, Brave, Edge, Arc, Chromium
│
├── 2. Install extension ── Opens Chrome Web Store, waits for install
│
├── 3. Detect AI agents ─── Claude Code, Cursor, Codex, Windsurf,
│                           VS Code, Gemini CLI, Amp, Cline, Roo Code
│
├── 4. Configure MCP ────── Merges hanzi-browse into each agent's config
│
├── 5. Install skills ───── Copies browser skills into each agent
│
└── 6. Choose AI mode ───── Managed ($0.05/task) or BYOM (free forever)

"Go to Gmail and unsubscribe from all marketing emails from the last week"
"Apply for the senior engineer position on careers.acme.com"
"Log into my bank and download last month's statement"
"Find AI engineer jobs on LinkedIn in San Francisco"

import { HanziClient } from '@hanzi-browse/sdk';

const client = new HanziClient({ apiKey: process.env.HANZI_API_KEY });

const { pairingToken } = await client.createPairingToken();
const sessions = await client.listSessions();

const result = await client.runTask({
  browserSessionId: sessions[0].id,
  task: 'Read the patient chart on the current page',
});
console.log(result.answer);

git clone https://github.com/hanzili/hanzi-browse
cd hanzi-browse
make fresh

make dev

hanzi-browse

Hanzi Browse

Two ways to use Hanzi Browse

For your agent — a browser sub-agent for your coding agent

Related Skills

For your product — browser automation for your users, described in English

Get Started

Examples

Skills & Free Tools

Skills

Free Tools

Site Playbooks — the context layer

Build with Hanzi Browse

Tools

Pricing

Development

First time (local setup)

Run the project

Configuration