Connect Scrape Do to AI agents

Connect Scrape Do to Claude, Codex, Cursor, or other AI agents for your entire team. Metorial security, governance, observability, and gives your team a unified Magic MCP url to connect.

Supported Tools

get_async_job

Get Async Job

Check the status of an async scraping job and optionally retrieve task results. Returns job status, task statuses, and scraped content for completed tasks. Can also list all recent jobs or cancel a running job.

take_screenshot

Take Screenshot

Capture a visual screenshot of any public web page. Supports viewport screenshots, full-page screenshots, and partial screenshots targeting a specific CSS selector. Returns the screenshot as base64-encoded image data. Configure the viewport size, device type, and geo-targeting to capture location- or device-specific views.

amazon_search

Amazon Search

Search Amazon and get structured JSON results with product listings and rankings. Returns product details including ASIN, title, price, rating, review count, Prime eligibility, and sponsored status. Supports 21 international Amazon marketplaces with pagination and ZIP code-based location targeting.

scrape_webpage

Scrape Webpage

Scrape any public web page and return its content. Supports JavaScript rendering via headless browser, geo-targeting through specific countries or continents, device emulation, custom headers/cookies, and browser interactions like clicking and scrolling. Use **render** to enable JavaScript execution for dynamic pages. Use **output: "markdown"** for cleaner text extraction. Use **super** for harder-to-scrape targets with residential proxies.

get_account_stats

Get Account Stats

Retrieve your Scrape.do account usage statistics including subscription status, concurrent request limits, and remaining monthly request quota.

amazon_product

Amazon Product Details

Retrieve structured product data from Amazon by ASIN. Returns JSON with product details including title, pricing, images, ratings, specifications, and more. Also supports fetching offer listings (all sellers) for a product. Covers 21 international Amazon marketplaces with ZIP code-based location targeting for local pricing.

google_search

Google Search

Search Google and get structured JSON results including organic results, ads, knowledge graphs, local packs, video results, and 15+ other result types. Supports 84 Google domains for regional targeting, localization via language and geo-location codes, desktop/mobile SERP layouts, SafeSearch, time-based filters, and pagination.

create_async_job

Create Async Scraping Job

Submit a batch of URLs for asynchronous scraping. Returns a job ID to track progress and retrieve results later. Jobs run in a separate thread pool independent from the main API concurrency. Supports all standard scraping parameters including geo-targeting, headless rendering, and webhook delivery.

More integrations teams use with Scrape Do

GitHub

Manage repositories, issues, and pull requests. Create and configure branches, star repositories, review code, and merge changes. Automate CI/CD workflows with GitHub Actions, manage workflow runs, secrets, and artifacts. Track issues with labels, milestones, and assignees. Search across code, repositories, issues, and users. Manage organizations, teams, and memberships. Create and manage projects, gists, packages, deployments, and environments. Access security alerts including code scanning, secret scanning, and Dependabot alerts. Read and write file contents in repositories. Manage webhooks, notifications, and codespaces.

Sharepoint

Manage SharePoint sites, document libraries, lists, and files. Create, read, update, and delete lists and list items with custom columns. Upload, download, move, copy, and version files in document libraries. Search across sites, files, folders, lists, and list items using Microsoft Search. Manage permissions at site, list, and item levels with granular access control. Define and manage content types and site columns. Subscribe to webhooks for list and library change notifications. Retrieve site properties and search for sites across Microsoft 365.

Salesforce

Manage CRM data including Accounts, Contacts, Leads, Opportunities, Cases, and custom objects. Create, read, update, and delete records. Query data using SOQL and search across objects using SOSL. Perform bulk data operations for large-scale imports, exports, and migrations. Execute composite requests to batch multiple operations in a single API call. Access analytics, reports, and dashboards. Manage files and attachments associated with records. Interact with Chatter feeds, posts, and groups for social collaboration. Subscribe to real-time change events via Change Data Capture and Platform Events. Manage org metadata including custom objects, fields, layouts, and workflows. Query data using GraphQL for precise data retrieval across related objects.

Airtable

Create, read, update, and delete records in Airtable bases and tables. Manage base schemas including creating tables and fields. Filter records using formulas, sort by fields, and scope queries to specific views. Upsert records to find, create, or update in a single call. Upload attachments to records, read and write record comments, list accessible bases, and receive real-time base change events through webhooks.

Bitbucket

Manage Git repositories, pull requests, and CI/CD pipelines on Bitbucket Cloud. Create, fork, and configure repositories within workspaces and projects. Create, review, approve, merge, and decline pull requests with inline code comments. Browse source code, list commits, and manage branches and tags. Track issues with the built-in issue tracker. Trigger, monitor, and manage Bitbucket Pipelines. List workspace members, configure repository default reviewers and branch restrictions, create and manage repository webhooks, and search code across repositories.

Heroku

Deploy, manage, and scale applications on Heroku's cloud platform. Create and configure apps, scale dynos, provision add-ons (databases, caching, etc.), manage configuration variables, build and release code, add custom domains and SSL certificates, manage collaborators and team permissions, configure pipelines for continuous delivery, set up log drains, and sync data with Salesforce via Heroku Connect. Subscribe to webhooks for real-time notifications on app changes, builds, releases, dyno lifecycle events, and more.

Technical notes for Scrape Do

Scrape public web pages and return raw HTML, JSON, or markdown content. Bypass anti-bot protections and CAPTCHAs automatically with rotating proxies across 150 countries. Render JavaScript-heavy pages using a headless browser with configurable wait conditions, viewport settings, and simulated user interactions. Capture screenshots of web pages. Extract structured product data from Amazon (product details, offers, search results) across 21 marketplaces. Scrape Google Search results and return structured JSON with organic results, ads, knowledge graphs, local packs, and more across 84 Google domains. Run large-scale asynchronous scraping jobs with webhook delivery. Manage geo-targeting, session persistence, device emulation, and custom headers/cookies for all requests. Use as a standard HTTP proxy compatible with Scrapy, Selenium, Puppeteer, and Playwright.

Connect Scrape Do to production AI agents

See how Metorial gives Scrape Do access the governance, tracing, and security controls teams need.

Frequently asked questions

Common questions about connecting Scrape Do to AI agents with Metorial.

  1. Can Metorial connect Scrape Do to AI agents?
    Yes. Metorial connects AI agents to Scrape Do through a governed integration layer, so teams can use the provider while keeping access controlled and observable.
  2. Metorial is MCP compatible and lets teams expose approved provider tools to MCP-capable agents and clients through a controlled access layer.
  3. Metorial applies policies across users, groups, providers, agents, and individual tools, then records the context around every agent interaction.
  4. Yes. Metorial records provider activity so teams can inspect tool calls, troubleshoot integrations, and give security teams the visibility they need.