AI Browser Agent
Learn about AI browser agent nodes and their capabilities
The AI Browser Agent node executes browser automation tasks through natural language instructions. It can navigate web pages, interact with DOM elements, fill forms, and extract data.
Navigation
Navigate to URLs and traverse page structures
Form Interaction
Fill input fields, select dropdowns, and submit forms
Data Extraction
Extract text, attributes, and structured data from elements
Vision Support
Process visual page elements when enabled
Configuration Options
Basic Settings
Name
Name
Set a descriptive name for your AI Browser Agent node. This helps identify the node’s purpose in your workflow.
Description
Description
Provide a clear description of what this specific browser agent instance will accomplish. This serves as documentation for your workflow. This will not be interpreted by the AI
Prompt Configuration
The prompt field contains the task instructions for the browser agent. Include:
- Task Objective: Specify the primary action (navigate, extract, fill form, click element)
- Target Elements: Describe specific elements, selectors, or page sections to interact with
- Transitions: Specify when and where should the node transition to
You can add variables to the AI Browser Node by entering text inbetween two curly braces and a dot before the variable name
{{.USER_INFO}}
Example:
Agent Tools
Base Browser Interaction
Default: Enabled - functions:
- URL navigation
- Element selection and clicking
- Form field population
- Text and attribute extraction
- Page screenshot capture
Vision Capabilities
Optional: Visual processing capabilities:
- Image-based element recognition
- Layout understanding
- Visual CAPTCHA handling
- Screenshot analysis
Vision capabilities increase resource usage and execution time. Enable only when visual analysis is required.
Additional Tools
Custom tools such as Send User Message
, Scratchpad
or Upload/Download Files
can be added to extend browser agent functionality. Learn more here:
Use Cases
Form Submission
Form Submission
Fill and submit web forms. Handles input fields, dropdowns, checkboxes, and radio buttons.
Data Extraction
Data Extraction
Extract structured data from web pages.
Navigation Testing
Navigation Testing