web-eval-agent
An MCP server that autonomously evaluates web applications.
1,234stars
105forks
Python
Added 12/27/2025
MCP Serversdebuggingdebugging-toolmcpmcp-servermodelcontextprotocolplaywrightqavibe-codingvibe-testing
Installation
# Add to your Claude Code skills
git clone https://github.com/refreshdotdev/web-eval-agent⚠️ PROJECT HAS BEEN SUNSET ⚠️
This project has been discontinued. We're building something new at withrefresh.com
🚀 operative.sh web-eval-agent MCP Server
Let the coding agent debug itself, you've got better things to do.

🔥 Supercharge Your Debugging
operative.sh's MCP Server launches a browser-use powered agent to autonomously execute and debug web apps directly in your code editor.
⚡ Features
- 🌐 Navigate your webapp using BrowserUse (2x faster with operative backend)
- 📊 Capture network traffic - requests are intelligently filtered and returned into the context window
- 🚨 Collect console errors - captures logs & errors
- 🤖 Autonomous debugging - the Cursor agent calls the web QA agent mcp server to test if the code it wrote works as epected end-to-end.
🧰 MCP Tool Reference
| Tool | Purpose |
|------|---------|
| web_eval_agent | 🤖 Automated UX evaluator that drives the browser, captures screenshots, console & network logs, and returns a rich UX report. |
| setup_browser_state | 🔒 Opens an interactive (non-headless) browser so you can sign in once; the saved cookies/local-storage are reused by subsequent web_eval_agent runs. |
Key arguments
-
web_eval_agenturl(required) – address of the running app (e.g.http://localhost:3000)task(required) – natural-language description of what to test ("run through the signup flow and note any UX issues")headless_browser(optional, defaultfalse) – set totrueto hide the browser window
-
setup_browser_stateurl(optional) – page to open first (handy to land directly on a login screen)
You can trigger these tools straight from your IDE chat, for example:
Evaluate my app at http://localhost:3000 – run web_eval_agent with the task "Try the full signup flow and report UX issues".
🏁 Quick Start
Easy Setup with One-Click Integration
- Get your API key (free) - when you create your API key, you'll see:
- "Add to Cursor" button with a deeplink for instant Cursor installation
- Prefilled Claude Code command with your API key automatically included
Manual Setup (macOS/Linux)
- Pre-requisites (typically not needed):
- brew:
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)" - npm: (
brew install npm) - jq:
brew install jq
- Run the installer after getting an api key (free)
- Installs playwright
- Installs uv
- Inserts JSON into your code editor (Cursor/Cline/Windsurf) for you!
curl -LSf https://operative.sh/install.sh -o install.sh && bash install.sh && rm install.sh
- Visit your favorite IDE and restart to apply the changes
- Send a prompt in chat mode to call the we...