by oxylabs
AI Browser Agent is an advanced Browser AI tool developed by Oxylabs AI Studio that automates real user browsing tasks using natural language instructions.
# Add to your Claude Code skills
git clone https://github.com/oxylabs/browser-agent-pyBrowser Agent is an AI browser automation tool from Oxylabs AI Studio. It simulates real user browsing by executing multi-step actions like clicking links, filling forms, scrolling, capturing screenshots, and then extracting structured data – all controlled through natural language prompts.
Unlike traditional automation frameworks (e.g., Puppeteer or Selenium), Browser Agent requires no static scraping rules or manual scripting. Users can describe tasks in plain English or provide a sequence of steps, and the AI will carry them out just like a human would.
To run tasks with browser AI agent, follow these steps:
click, type, navigate, wait, extract).To begin, be sure you have access to an API key (or get a free trial with 1000 credits) and Python ver. 3.10 or above installed. You can install the oxylabs-ai-studio package using pip:
pip install oxylabs-ai-studio
The following examples show how to use the browser AI agent to perform browsing and data extraction.
from oxylabs_ai_studio.apps.browser_agent import BrowserAgent
browser_agent = BrowserAgent(api_key="<API_KEY>")
schema = browser_agent.generate_schema(
prompt="game name, platform, review stars and price"
)
print("schema: ", schema)
prompt = "Find if there is game 'super mario odyssey'...