by eggbrid2
Open Android AI agent runtime for phone control, app automation, VLM screen reading, skill routing, mini apps, and Mihomo VPN workflows.
# Add to your Claude Code skills
git clone https://github.com/eggbrid2/mobileClawLast scanned: 5/30/2026
{
"issues": [],
"status": "PASSED",
"scannedAt": "2026-05-30T16:21:08.497Z",
"npmAuditRan": true,
"pipAuditRan": true
}mobileClaw is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by eggbrid2. Open Android AI agent runtime for phone control, app automation, VLM screen reading, skill routing, mini apps, and Mihomo VPN workflows. It has 402 GitHub stars.
Yes. mobileClaw passed SkillsLLM's automated security scan — a dependency vulnerability audit plus prompt-injection heuristics — with no high-severity issues. You can read the full report in the Security Report section on this page.
Clone the repository with "git clone https://github.com/eggbrid2/mobileClaw" and add it to your Claude Code skills directory (see the Installation section above).
mobileClaw is primarily written in Kotlin. It is open-source under eggbrid2 on GitHub, so you can review or fork the full source.
Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh mobileClaw against similar tools.
No comments yet. Be the first to share your thoughts!
MobileClaw is an experimental Android app for running LLM agents on a real phone. It sits at the intersection of Android automation, mobile AI agents, accessibility-based phone control, on-device Python tools, multi-agent workflows, and VPN/proxy operations.
The idea is simple: a mobile agent should not just chat about your device. It should be able to observe the screen, choose the right tools, act through Android capabilities, create new workflows, and keep enough memory to improve across tasks.
MobileClaw is currently going through a UI refresh, so some screens may feel visually inconsistent or rough while the new interface is being rebuilt.
The roadmap also includes MCP support, including MCP connection and MCP creation flows, so agents can connect to standard MCP servers and expose compatible tools through the same runtime.
Captured from a Xiaomi device running the debug build. These are real agent runs, not mockups: MobileClaw created and opened a WebView MiniAPP, created a native AI Page, kept a multi-agent group chat with stickers, manages on-device models with vision packs, and exposes its skill/VPN/runtime surfaces.
Join the WeChat group to discuss MobileClaw usage, Android agent development, local models, skills, ROM compatibility, and real-device bugs.
This WeChat group QR code is valid until June 19, 2026. If it expires, open the latest README or ask for an updated invite.
Most mobile AI apps are chat surfaces. MobileClaw is closer to a small operating layer for agents.
A user request is turned into a scoped task. The task gets a role, a short plan, a filtered tool set, and an execution loop. That shape is the core of the project:
user goal -> task type -> role scheduler -> planner -> allowed skills -> observe -> act -> verify
This matters because phone automation fails quickly when every tool is always available. MobileClaw keeps phone control, web research, file work, app building, image generation, VPN control, skill management, and code execution in different task modes.
The project is still moving fast. Some pieces are stable enough to use daily; some are research-grade and need device-specific fixes. The code is open because this kind of Android agent needs real devices, real ROM quirks, and real users to become good.
see_screen, which captures a screenshot, marks interactive targets, and returns coordinates for direct action.screenshot fallback when XML is empty or misleading, especially for Flutter, React Native, WebView, and game-like UIs.bg_launch, bg_read_screen, bg_screenshot, bg_stop.TaskClassifier maps requests into task types such as PHONE_CONTROL, WEB_RESEARCH, APP_BUILD, VPN_CONTROL, SKILL_MANAGEMENT, and CODE_EXECUTION.TaskPlanner makes a planning call before tool execution.TaskToolPolicy controls which tools are visible for each task.RoleScheduler chooses from built-in and user-created roles.AgentRuntime runs a ReAct-style loop with repeated-perception guards, screenshot context trimming, structured observations, and task events.Built-in roles include:
Roles are not just personas. They can declare preferred task types, keywords, scheduler priority, forced skills, and model overrides. User-created roles participate in the same scheduler.
The role UI is designed around quick task assignment rather than decorative persona editing. The Roles page highlights the current role first, then lists built-in and custom roles with readable capability labels such as code, research, phone control, apps, images, VPN, and skills. Built-in roles are protected as presets: editing them creates a custom copy, while custom roles can be edited directly. Advanced fields such as system prompt addenda, model override, and pinned skills are kept behind an advanced section so normal role creation stays approachable.
MobileClaw has a native skill registry with injection levels:
Built-in skill groups include:
see_screen, screenshot, read_screen, tap, scroll, input_text, navigate, list_apps.web_search, fetch_url, hidden WebView browsing, page content extraction, JavaScript execution.vpn_control.Dynamic skills can be Python or HTTP definitions saved under app storage. Native and shell skills are intentionally not generated by the agent through the normal meta-skill path.
MobileClaw has two app-building paths:
Claw JavaScript bridge for HTTP, SQLite, Python, shell, memory, config, files, clipboard, device info, app launch, URL opening, sharing, and asking the agent.Both are created from chat through skills. Mini apps are good for fast web-like tools. AI Pages are better when a workflow should feel native.
Follow-up edits keep artifact context. If the user asks to change "that page" after creating an AI Page, MobileClaw carries the recent page ID into the next task and routes the update back through ui_builder instead of falling back to one-off HTML.
MobileClaw includes a VPN stack designed for Android agent use:
MATCH,GLOBAL.VpnService creates the TUN interface.hev-socks5-tunnel bridges Android TUN traffic to mihomo.This stack does not use Xray. mihomo handles the proxy protocols; hev is kept because Android still needs a