Web Search integrations on Metorial

Tools that allow models to search and retrieve real-time information from the internet.

Browse all integrations

Showing 40-63 of 63

Parsehub

Manage web scraping projects, start and monitor scraping runs, and retrieve extracted data. List projects, launch scraping jobs with custom start URLs and parameters, cancel or delete runs, and download structured data in JSON format. Receive webhook notifications when run statuses change or extracted data becomes available.

Parsera

Extract structured data from web pages using AI-powered natural-language descriptions. Scrape any URL by specifying desired attributes with names, descriptions, and types. Parse raw HTML or text content directly. Convert web pages to markdown. Create reusable scrapers that can be run across multiple URLs at once. Generate deterministic Python scraping code for faster, cost-effective extraction. Supports proxy configuration for geo-specific access, precision mode for hidden HTML data, and custom cookies for authenticated pages.

Perplexity Ai

Perform AI-powered web searches and generate grounded answers with citations. Access multiple LLM providers (OpenAI, Anthropic, Google, xAI, Sonar) through a unified chat completions interface. Retrieve raw ranked web search results for RAG pipelines. Generate text embeddings for semantic search and retrieval. Supports streaming responses, structured JSON output, search recency and domain filtering, multi-step reasoning, deep research reports, model fallback, and configurable presets for common workflows. Manage API keys and groups programmatically.

Piloterr

Scrape and extract structured data from websites using 50+ pre-built endpoints. Crawl any webpage with anti-bot bypass, rotating proxies, and rendered HTML extraction. Extract LinkedIn profiles, companies, jobs, and posts. Search Google, Bing, and Brave and retrieve structured results. Scrape e-commerce product data from Amazon, Walmart, Best Buy, and other retailers. Enrich company data from a 60M+ company database. Detect website technology stacks. Look up domain WHOIS information, validate emails, and find email addresses. Retrieve Crunchbase funding and company data. Capture webpage screenshots. Extract data from real estate, automotive, and prediction market platforms. Monitor API usage and remaining credits.

Prerender

Manage pre-rendered HTML cache for JavaScript-heavy websites to improve SEO. Recache single or multiple URLs on demand, search cached URLs by query or exact match, clear cache using wildcard patterns, submit and manage XML sitemaps for automated crawling, control recache speed, manage connected domains, and render pages through a headless browser. Supports desktop and mobile adaptive rendering.

Scrape Do

Scrape public web pages and return raw HTML, JSON, or markdown content. Bypass anti-bot protections and CAPTCHAs automatically with rotating proxies across 150 countries. Render JavaScript-heavy pages using a headless browser with configurable wait conditions, viewport settings, and simulated user interactions. Capture screenshots of web pages. Extract structured product data from Amazon (product details, offers, search results) across 21 marketplaces. Scrape Google Search results and return structured JSON with organic results, ads, knowledge graphs, local packs, and more across 84 Google domains. Run large-scale asynchronous scraping jobs with webhook delivery. Manage geo-targeting, session persistence, device emulation, and custom headers/cookies for all requests. Use as a standard HTTP proxy compatible with Scrapy, Selenium, Puppeteer, and Playwright.

Seat Geek

Search and discover live events (sports, concerts, theater) across the United States and Canada. Look up detailed event information including dates, venues, performers, ticket price statistics, and popularity scores. Filter events by performer, venue, location, date range, taxonomy, and ticket pricing. Search for performers with details like images, genres, and external links. Look up venues by city, state, or geolocation. Retrieve event and performer recommendations based on similarity seeds. Browse hierarchical event category taxonomies. Note: does not support purchasing or booking tickets — users must be directed to SeatGeek URLs for transactions.

Scrapegraph Ai

Extract structured data from webpages using natural language prompts and AI. Scrape single pages or crawl entire websites with configurable depth and rules. Perform AI-powered web searches across multiple sources with structured results. Convert webpages to clean Markdown. Fetch raw HTML with JavaScript rendering. Automate browser interactions (clicking, typing, form filling, logging in) before extracting data. Discover website sitemaps. Supports custom output schemas, stealth mode, proxy routing, geo-targeting, and webhook notifications for crawl job completion. Check API credit balance and retrieve past request history.

Search API

Perform real-time web searches across multiple search engines (Google, Bing, Baidu, Yahoo, Yandex, DuckDuckGo, Naver) and retrieve structured JSON results. Search Google verticals including Images, Videos, News, Shopping, Maps, Scholar, Trends, Flights, Hotels, Jobs, Finance, Patents, and Autocomplete. Scrape results from YouTube, Amazon, Walmart, eBay, Airbnb, Tripadvisor, and app stores. Access ad transparency libraries for Meta, LinkedIn, Reddit, and TikTok. Track keyword rankings for SEO, look up geo-targeting locations, retrieve search history, and check account usage. Supports localization by language, country, and device targeting.

Semantic Scholar

Search and retrieve academic paper metadata from over 200 million scientific publications. Find papers by keyword queries with filters for year, venue, fields of study, and citation count. Look up detailed paper metadata including titles, abstracts, authors, citation counts, TLDR summaries, and SPECTER2 embeddings using various identifiers (DOI, ArXiv ID, PMID, etc.). Explore citation and reference graphs to trace connections between papers. Look up author profiles with publication history, affiliations, h-index, and citation metrics. Get paper autocomplete suggestions and paper recommendations based on positive and negative examples. Download full dataset releases or incremental diffs for large-scale offline research.

Semrush

Retrieve SEO, PPC, and competitive intelligence data for domains, keywords, and backlinks. Analyze domain rankings, organic and paid search performance, traffic estimates, and competitor landscapes across regional databases. Research keywords by volume, difficulty, CPC, and related terms. Pull backlink profiles including referring domains, anchor texts, and authority scores. Access website traffic analytics, audience demographics, geo distribution, and market intelligence. Manage Semrush projects, position tracking campaigns, and site audit configurations. Update and distribute business listing data in bulk. Retrieve local map rank tracking data including heatmaps and competitor rankings.

Serpapi

Search and extract structured data from 100+ search engines and platforms. Query Google, Bing, DuckDuckGo, Yahoo, Yandex, Baidu, and more for web, image, news, and video results. Search product listings on Amazon, Walmart, eBay, and Google Shopping. Retrieve local business data from Google Maps and Yelp. Search Google Flights, Hotels, and travel platforms for travel options. Access Google Scholar for academic papers and citations. Retrieve Google Trends data, finance/stock information, job listings, events, and app store results. Use Google Lens for reverse image search. Access AI-generated search answers from Google AI Mode and Bing Copilot. Get autocomplete suggestions, location-targeted results, and cached search archives. Returns clean structured JSON with organic results, knowledge graphs, featured snippets, ratings, reviews, pricing, and rich snippets.

Serpdog

Scrape search engine results and extract web data from multiple platforms. Search and extract Google results including web search, maps, news, shopping, images, videos, finance, jobs, scholar, and autocomplete. Scrape Bing search results, Amazon and Walmart product data, Yelp business listings, YouTube search results, and LinkedIn job postings. Perform general-purpose web scraping with JavaScript rendering, premium proxies, and CAPTCHA handling. Returns data in JSON or HTML format with geo-targeting and language configuration.

Serply

Search Google and Bing programmatically and retrieve structured SERP data. Perform web, news, image, video, product, job, and scholar searches with geo-targeting and device emulation. Track domain SERP rankings for SEO monitoring. Retrieve organic results, answer boxes, news articles, product listings, job postings, academic papers, and citation data as structured JSON. Receive webhook notifications for search completion, failures, and quota events.

Serphouse

Extract structured search engine results from Google, Bing, and Yahoo. Perform real-time SERP queries across web, image, news, video, jobs, and shopping verticals. Submit batch searches of up to 100 keywords for asynchronous processing. Retrieve parsed result types including organic results, knowledge graphs, People Also Ask, AI overviews, ads, map packs, and more. Query Google Trends data for keyword interest over time with geographic and category targeting. Look up supported locations, languages, and search engine domains for query targeting. Receive webhook notifications when batch search tasks complete. Check account credit usage and plan details.

Similarweb Digital Rank API

Retrieve website ranking data powered by SimilarWeb's SimilarRank algorithm. Look up a domain's global rank, country-specific rank, and category rank. List top-ranked websites globally (up to 5,000 results). Filter rankings by date range, country, and subdomain inclusion. Check remaining API credits and subscription status.

Supadata

Extract video transcripts, media metadata, and web page content as structured text and data. Transcribe videos from YouTube, TikTok, Instagram, Facebook, and X (Twitter) with language preferences and timestamped or plain text output. Fetch unified media metadata including engagement stats, author info, and platform-specific details. Use AI to extract structured data from video content via natural language prompts and JSON schemas. Scrape any web page into clean markdown, map website URLs, and crawl entire websites with configurable page limits. Search YouTube for videos, channels, and playlists, retrieve channel and playlist metadata, list channel or playlist video IDs, and translate YouTube transcripts into target languages.

Tavily

Search the web and extract content from URLs, optimized for AI and LLM workflows. Perform AI-powered web searches with configurable depth, topic filtering, time ranges, and domain controls. Extract clean, structured content from specific URLs in markdown or plain text. Crawl entire websites following links across pages with natural language instructions to guide traversal. Map website structures to discover all URLs without extracting content. Conduct autonomous multi-step research that produces comprehensive reports with citations. Track API credit usage across projects.

Browserless

Automate headless Chrome/Chromium browsers in the cloud for web scraping, content extraction, and browser automation tasks. Scrape structured data from web pages using CSS selectors, retrieve fully rendered HTML after JavaScript execution, generate PDFs from URLs or raw HTML, capture screenshots in JPEG or PNG, download files triggered by browser interactions, and execute custom Puppeteer code for multi-step workflows. Perform web searches with optional result scraping returning LLM-ready markdown or HTML. Unblock protected websites and bypass bot detection using stealth mode and CAPTCHA solving via BrowserQL. Run Lighthouse performance audits, record browser sessions as video, and manage persistent browser sessions with configurable TTL. Supports residential proxy routing for geo-targeted requests.

Jigsawstack

Scrape websites using natural language prompts to extract structured data. Perform AI-powered web search and deep research on topics. Analyze text sentiment, summarize content, translate text across 160+ languages, and check spelling. Convert natural language to SQL queries. Extract structured data from images using vision OCR (e.g., receipts, documents). Transcribe audio and video files to text with speaker diarization and language detection. Generate images from text prompts using multiple model backends. Convert HTML to PDF or images. Detect NSFW content, profanity, and spam. Classify text into custom categories. Upload, retrieve, and delete files in cloud storage. Search for addresses and places with geocoding. Generate text embeddings for semantic search. Receive webhook callbacks for long-running task results.

Moz

Analyze SEO metrics for URLs and domains, including Domain Authority, Page Authority, Spam Score, and link counts. Retrieve backlink data, anchor text analysis, and linking domain details. Perform keyword research with difficulty scores, search volume, intent analysis, and related suggestions. Discover link building opportunities through competitor link intersect analysis. Get top pages for any domain, global top-ranking pages and domains, and Brand Authority scores. Monitor index freshness and track API usage.

Scrapfly

Scrape web pages with anti-bot bypass, proxy rotation across 120+ countries, and JavaScript rendering. Capture full-page or targeted screenshots of any website. Extract structured data from web content using AI models, LLM prompts, or custom template rules. Crawl entire websites recursively with configurable depth, limits, and URL filtering. Supports browser automation scenarios, multiple output formats (HTML, JSON, markdown, text), session persistence, caching, and asynchronous processing via webhooks. Manage scraping projects with separate API keys, budgets, and quotas.

Scrapingant

Scrape web pages and extract data from websites using headless Chrome browsers with automatic proxy rotation, CAPTCHA bypass, and Cloudflare handling. Render JavaScript-heavy pages, execute custom JS snippets, and wait for specific page elements. Convert scraped HTML to Markdown for LLM/RAG use cases. Extract structured data using AI-powered extraction with free-form field descriptions. Configure proxy geo-targeting, block specific resource types, pass custom cookies and headers, and retrieve extended response data including cookies, headers, and XHR details. Check API credit usage and subscription status.

Zyte API

Fetch and extract web content from any website with automatic anti-bot bypassing. Perform raw HTTP requests or browser-rendered requests that execute JavaScript. Use AI-powered extraction to get structured data such as products, articles, job postings, and forum threads. Capture screenshots, execute browser actions (scroll, click, type, wait, run JavaScript), manage sessions and cookies, and configure geolocation and IP type. Monitor API usage via a stats endpoint. Supports proxy mode for integration with existing HTTP tools.

Connect agents to your company's tools

See how Metorial gives integrations the governance, tracing, and production controls teams need.