Browser Agent is an AI browser automation tool from Oxylabs AI Studio . It simulates real user browsing by executing multi-step actions like clicking links, filling forms, scrolling, capturing screenshots, and then extracting structured data – all controlled through natural language prompts.
Unlike traditional automation frameworks (e.g., Puppeteer or Selenium), Browser Agent requires no static scraping rules or manual scripting. Users can describe tasks in plain English or provide a sequence of steps, and the AI will carry them out just like a human would.
- Full control through browser AI – execute clicks, inputs, navigation, and scrolling.
- Multi-step task execution – define browsing flows in natural language.
- Multiple outputs – get results in JSON, Markdown, HTML, or PNG screenshots.
- Dynamic content support – interact with JavaScript-rendered pages.
- Schema-based extraction – request structured JSON after the browsing sequence completes.
To run tasks with browser AI agent, follow these steps:
- Enter the target URL.
- Describe the browsing process as:
- Natural language prompt (e.g. "Open the pricing page, accept cookies, and extract all product names with prices.)
- Structured step list – provide an array of AI browser actions (
click
,type
,navigate
,wait
,extract
).
- Select output format: JSON, Markdown, HTML, or PNG screenshot.
- (Optional) If JSON is selected, define or auto-generate a schema to structure the gathered data.
To begin, be sure you have access to an API key (or get a free trial with 1000 credits) and Python ver. 3.10
or above installed. You can install the oxylabs-ai-studio
package using pip:
pip install oxylabs-ai-studio
The following examples show how to use the browser AI agent to perform browsing and data extraction.
from oxylabs_ai_studio.apps.browser_agent import BrowserAgent browser_agent = BrowserAgent(api_key="<API_KEY>") schema = browser_agent.generate_schema( prompt="game name, platform, review stars and price" ) print("schema: ", schema) prompt = "Find if there is game 'super mario odyssey' in the store. If there is, find the price. Use search bar to find the game." url = "https://sandbox.oxylabs.io/" result = browser_agent.run( url=url, user_prompt=prompt, output_format="json", schema=schema, ) print(result.data)
Learn more about Browser Agent and Oxylabs AI Studio Python SDK in our PyPI repository.
You can also check out our AI Studio JavaScript SDK guide for JS users.
Parameter | Description | Default Value |
---|---|---|
url * |
Starting URL to browse | – |
user_prompt * |
Natural language prompt for extraction | – |
output_format |
Output format (json , markdown , html , screenshot ) |
markdown |
schema |
OpenAPI schema for structured extraction (mandatory for JSON) | – |
geo_location |
Proxy location in ISO2 format | – |
* – mandatory parameters
Browser Agent can return parsed results or screenshots that are easy to integrate into your applications. This is a direct output example of our request code:
type='json' content={'games': [{'game_name': 'Super Mario Odyssey', 'platform': None, 'review_stars': None, 'price': 89.99}]}
Browser Agent supports multiple output formats ("output": "YOUR_FORMAT"
):
json
– structured data using schema-based parsing.markdown
– easy-to-read data, perfect for AI and automation workflows.html
– raw HTML data of the webpage.screenshot
– PNG image of the browser content.
You can use AI Browser Agent in various ways, including:
- E-commerce checkout simulation – add items to cart, apply coupon, confirm checkout flow.
- Travel search automation – enter destinations, apply filters, and extract flight or hotel prices.
- Job search scraping – search for a role, click through postings, extract job details.
- Event & ticket discovery – navigate event sites, retrieve titles, dates, and prices.
- And many more...
Traditional tools rely on writing selectors and scripts for every action. AI browser agents replace that with natural language instructions and add organic browsing, making automation much faster, easier, and less fragile.
Yes, you can instruct the agent to enter text, submit forms, or click buttons. Keep in mind that sites with advanced bot detection may require advanced setup.
Browser Agent works on most public websites, including ones that rely on JavaScript or interactive flows. However, you should always make sure your use case complies with the target website’s Terms of Service and applicable laws.
Oxylabs AI Studio Browser Agent is free to try by signing up for a free trial that includes 1,000 credits. After the trial, the monthly plans start at just 12ドル/month with 3000 credits and 1 request/s, with higher plans offering more credits and higher request rates.
For a deeper dive into available parameters, advanced integrations, and additional examples, check out the AI Studio documentation.
If you have questions or need support, reach out to us at hello@oxylabs.io, through live chat, or join our Discord community.