> ## Documentation Index > Fetch the complete documentation index at: https://docs.asteroid.ai/llms.txt > Use this file to discover all available pages before exploring further. # AI Capabilities > Available capabilities for your AI Task nodes AI Task nodes can be equipped with a wide range of capabilities. Each capability unlocks additional tools the agent can use during execution. We suggest starting with the default capabilities, and enabling additional ones when strictly needed. **API Tool Identifiers** — When configuring AI Task node `tools` via the API or MCP server, use the snake\_case identifiers listed below each tool. You can also call the `listAvailableTools` MCP tool or `GET /agents/available-tools` API endpoint to discover all available tools programmatically. *** ## Web Browsing Essentials Core tools for basic navigation and DOM interaction. Navigate the browser to any URL (`go_to_url`) Reload the current webpage (`refresh_page`) Click, type, select elements, and interact via the DOM (`dom_browser_interaction`) **Critical: Basic Web Interaction is Required** **Basic Web Interaction is enabled by default and should always remain enabled for AI Task nodes.** This capability provides essential tools for clicking, typing, selecting elements, and interacting with the DOM. * Disabling Basic Web Interaction will prevent AI Task nodes from performing basic browser interactions * You should not disable this capability unless you have a very specific reason * This capability is part of the default "Web Browsing Essentials" set and is required for AI Task nodes to function properly *** ## Advanced Web Browsing Toolkit Tools for extraction, evaluation, and page manipulation Download or inspect the full HTML of the page (`extract_html`) Extract visible text as clean markdown (`get_text`) Run custom JS in the page context (`evaluate_javascript`) Reduce zoom level to view more content (`zoom_out`) Increase zoom level for readability (`zoom_in`) Take a screenshot of the current page (`take_screenshot`) Trigger automatic captcha solving (`solve_captcha`) Save the current page as a PDF file (`save_pdf`) *** ## Computer Vision Tools for image-based interaction when DOM access is insufficient. Click, locate, and interact using visual recognition (`computer_use`) *** ## Communication Send messages or work with email during execution. Ask the user questions or request clarification (`send_user_message`) Send emails with custom subject and body (`send_mail`) Retrieve inbound emails from the agent’s inbox (`get_mail`) Make HTTP API requests to external services (`send_api_request`) `send_mail` and `get_mail` require an agent profile to be attached to the execution — the inbox address is derived from the profile. See [Agent Emails](/fundamentals/emails) for the full setup and a prompt pattern for receiving email verification codes. *** ## File System Work with files locally or inside the browser session. View all files available in the execution context (`list_files`) Read text, images, PDFs, or downloaded files (`read_files`) Upload files to file input elements on a webpage (`upload_file`) *** ## Memory & Storage Store and retrieve execution-scoped data. Save notes or structured data to memory (`write_scratchpad`) Retrieve previously stored information (`read_scratchpad`) Access the current clipboard contents (`read_clipboard`) *** ## Google Sheets Read and write spreadsheet data. Retrieve values from cell ranges like A1:B10 (`google_sheets_get_data`) Update values in specific cells (`google_sheets_set_data`) *** ## Authentication Generate tokens and handle one-time passwords. Produce 6-digit MFA codes using stored credentials (`generate_totp_secret`) *** ## Context & Utilities Access deeper execution context or system utilities. Ask questions about past actions and stored information (`query_context`) Fetch the current datetime in any timezone (`get_datetime`)