get_account_info
Get Account Info
Retrieve Scrapfly account information including current project details, usage statistics, remaining credits, concurrency limits, and subscription information.
get_account_info
Retrieve Scrapfly account information including current project details, usage statistics, remaining credits, concurrency limits, and subscription information.
get_crawl_status
Retrieve the current status and progress of a running or completed crawl job. Returns page counts, credit usage, duration, and completion state.
start_crawl
Start a recursive website crawl from a given URL. The crawler automatically discovers and follows links, respecting configurable limits for page count, depth, duration, and budget. Supports URL path filtering, external domain control, proxy rotation, anti-bot bypass, and multiple content formats.
get_crawl_results
Retrieve the discovered URLs and their extracted content from a completed or running crawl. Returns the list of crawled URLs with metadata and optionally the page contents in the specified format.
extract_data
Extract structured data from document content using Scrapfly's standalone Extraction API. Supports three extraction methods: AI auto-extraction with predefined models (product, article, review, real estate), LLM prompt-based extraction with natural language instructions, and custom template-based extraction with CSS/XPath/JMESPath rules. Accepts HTML, XML, JSON, CSV, RSS, Markdown, and plain text input.
scrape_webpage
Scrape any web page and retrieve its content. Supports JavaScript rendering for dynamic pages, anti-bot bypass (ASP), proxy rotation across 120+ countries, and multiple output formats (raw HTML, clean HTML, JSON, markdown, text). Can also perform inline data extraction using AI models, LLM prompts, or custom templates during the scrape.
capture_screenshot
Capture a screenshot of any web page. Supports full-page captures, viewport-only captures, or targeting specific elements via CSS selectors. Includes options for ad/banner blocking, dark mode, custom viewport resolution, accessibility testing, and JavaScript execution before capture.
Scrape web pages with anti-bot bypass, proxy rotation across 120+ countries, and JavaScript rendering. Capture full-page or targeted screenshots of any website. Extract structured data from web content using AI models, LLM prompts, or custom template rules. Crawl entire websites recursively with configurable depth, limits, and URL filtering. Supports browser automation scenarios, multiple output formats (HTML, JSON, markdown, text), session persistence, caching, and asynchronous processing via webhooks. Manage scraping projects with separate API keys, budgets, and quotas.
Common questions about connecting Scrapfly to AI agents with Metorial.