> ## Documentation Index
> Fetch the complete documentation index at: https://docs.asteroid.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# AI Capabilities

> Available capabilities for your AI Task nodes

AI Task nodes can be equipped with a wide range of capabilities.
Each capability unlocks additional tools the agent can use during execution. We suggest starting with the default capabilities, and enabling additional ones when strictly needed.

<Tip>
  **API Tool Identifiers** — When configuring AI Task node `tools` via the API or MCP server, use the snake\_case identifiers listed below each tool. You can also call the `listAvailableTools` MCP tool or `GET /agents/available-tools` API endpoint to discover all available tools programmatically.
</Tip>

***

## Web Browsing Essentials

Core tools for basic navigation and DOM interaction.

<Card title="Go to URL" icon="globe" horizontal>
  Navigate the browser to any URL (`go_to_url`)
</Card>

<Card title="Refresh Page" icon="arrow-rotate-right" horizontal>
  Reload the current webpage (`refresh_page`)
</Card>

<Card title="Basic Web Interaction" icon="mouse-pointer" horizontal>
  Click, type, select elements, and interact via the DOM (`dom_browser_interaction`)
</Card>

<Warning>
  **Critical: Basic Web Interaction is Required**

  **Basic Web Interaction is enabled by default and should always remain enabled for AI Task nodes.** This capability provides essential tools for clicking, typing, selecting elements, and interacting with the DOM.

  * Disabling Basic Web Interaction will prevent AI Task nodes from performing basic browser interactions
  * You should not disable this capability unless you have a very specific reason
  * This capability is part of the default "Web Browsing Essentials" set and is required for AI Task nodes to function properly
</Warning>

***

## Advanced Web Browsing Toolkit

Tools for extraction, evaluation, and page manipulation

<Card title="Extract HTML" icon="code" horizontal>
  Download or inspect the full HTML of the page (`extract_html`)
</Card>

<Card title="Get Text" icon="text" horizontal>
  Extract visible text as clean markdown (`get_text`)
</Card>

<Card title="Evaluate JavaScript" icon="code" horizontal>
  Run custom JS in the page context (`evaluate_javascript`)
</Card>

<Card title="Zoom Out" icon="magnifying-glass-minus" horizontal>
  Reduce zoom level to view more content (`zoom_out`)
</Card>

<Card title="Zoom In" icon="magnifying-glass-plus" horizontal>
  Increase zoom level for readability (`zoom_in`)
</Card>

<Card title="Take Screenshot" icon="camera" horizontal>
  Take a screenshot of the current page (`take_screenshot`)
</Card>

<Card title="Solve Captcha" icon="puzzle-piece" horizontal>
  Trigger automatic captcha solving (`solve_captcha`)
</Card>

<Card title="Save PDF" icon="file-pdf" horizontal>
  Save the current page as a PDF file (`save_pdf`)
</Card>

***

## Computer Vision

Tools for image-based interaction when DOM access is insufficient.

<Card title="Computer Vision" icon="eye" horizontal>
  Click, locate, and interact using visual recognition (`computer_use`)
</Card>

***

## Communication

Send messages or work with email during execution.

<Card title="Send User Message" icon="message" horizontal>
  Ask the user questions or request clarification (`send_user_message`)
</Card>

<Card title="Send Mail" icon="envelope" horizontal>
  Send emails with custom subject and body (`send_mail`)
</Card>

<Card title="Get Mail" icon="inbox" horizontal>
  Retrieve inbound emails from the agent’s inbox (`get_mail`)
</Card>

<Card title="Send API Request" icon="server" horizontal>
  Make HTTP API requests to external services (`send_api_request`)
</Card>

<Info>
  `send_mail` and `get_mail` require an agent profile to be attached to the execution — the inbox address is derived from the profile. See [Agent Emails](/fundamentals/emails) for the full setup and a prompt pattern for receiving email verification codes.
</Info>

***

## File System

Work with files locally or inside the browser session.

<Card title="List Files" icon="folder-open" horizontal>
  View all files available in the execution context (`list_files`)
</Card>

<Card title="Read Files" icon="file-lines" horizontal>
  Read text, images, PDFs, or downloaded files (`read_files`)
</Card>

<Card title="Upload File" icon="upload" horizontal>
  Upload files to file input elements on a webpage (`upload_file`)
</Card>

***

## Memory & Storage

Store and retrieve execution-scoped data.

<Card title="Write Scratchpad" icon="pen-to-square" horizontal>
  Save notes or structured data to memory (`write_scratchpad`)
</Card>

<Card title="Read Scratchpad" icon="book-open" horizontal>
  Retrieve previously stored information (`read_scratchpad`)
</Card>

<Card title="Read Clipboard" icon="clipboard" horizontal>
  Access the current clipboard contents (`read_clipboard`)
</Card>

***

## Google Sheets

Read and write spreadsheet data.

<Card title="Sheets: Get Data" icon="table" horizontal>
  Retrieve values from cell ranges like A1:B10 (`google_sheets_get_data`)
</Card>

<Card title="Sheets: Set Data" icon="table" horizontal>
  Update values in specific cells (`google_sheets_set_data`)
</Card>

***

## Authentication

Generate tokens and handle one-time passwords.

<Card title="Generate TOTP Secret" icon="shield" horizontal>
  Produce 6-digit MFA codes using stored credentials (`generate_totp_secret`)
</Card>

***

## Context & Utilities

Access deeper execution context or system utilities.

<Card title="Query Context" icon="search" horizontal>
  Ask questions about past actions and stored information (`query_context`)
</Card>

<Card title="Get Datetime" icon="clock" horizontal>
  Fetch the current datetime in any timezone (`get_datetime`)
</Card>
