arXiv

Connect AI Agents to
arXiv

Automate workflows and connect AI agents to arXiv. Metorial is built for developers. Handling OAuth, compliance, observability, and more.

arXiv on Metorial

The arXiv integration lets you search for academic papers, retrieve abstracts and metadata, and access preprints directly from the repository to help with literature reviews and research discovery.

Deploy on Metorial

Combine arXiv with other tools

Metorial has 600+ integrations available. Here are some related ones you might find interesting.

Exa

Exa

The Exa integration lets you perform semantic searches across the web and retrieve high-quality content directly within your workflows, enabling AI agents to find and access relevant information from billions of web pages in real-time.

Hackernews

Hackernews

The Hackernews integration lets you fetch and analyze stories, comments, and user data from Hacker News directly within your workflow, enabling you to track trending topics, monitor discussions, and gather insights from the tech community.

Supabase

Supabase

The Supabase integration lets you query and manipulate your database tables, manage authentication, and interact with storage buckets directly from your AI assistant. Use it to build applications, analyze data, or automate database operations without leaving your workflow.

Google Calendar

Google Calendar

The Google Calendar integration lets you view, create, update, and manage calendar events directly from your workflow, enabling seamless scheduling and calendar management without switching applications.

Google Drive

Google Drive

The Google Drive integration lets you search, read, and manage files and folders in your Google Drive directly through Claude. Use it to access documents, create new files, organize content, and collaborate on shared resources without leaving your conversation.

Brave

Brave

The Brave integration lets you perform web searches using Brave Search directly from Claude, allowing you to retrieve up-to-date information, news, and web results without leaving your conversation.

Hugging Face

Hugging Face

The Hugging Face integration lets you search and explore models, datasets, and Spaces directly from your development environment, making it easy to discover the right pre-trained models and resources for your machine learning projects.

Tavily

Tavily

The Tavily integration lets you perform AI-optimized web searches and retrieve real-time information directly within your workflow, enabling your AI assistant to access current data and research capabilities for answering questions and gathering insights.

Perplexity

Perplexity

The Perplexity integration lets you search the web and access real-time information directly through AI-powered queries, bringing current answers and citations into your workflow.

Connect anything. Anywhere.

Supported tools and capabilities

Metorial helps you connect AI agents to arXiv with various tools and resources. Tools allow you to interact with perform specific actions, while resources provide read-only access to data and information.

Help & Documentation

Find guides and articles to help you get started with arXiv on Metorial.

More about arXiv

arXiv MCP Server

A powerful Model Context Protocol (MCP) server that provides seamless access to arXiv, the world's premier open-access repository for scholarly papers in physics, mathematics, computer science, and related fields. This server enables you to search for papers, retrieve detailed metadata, access abstracts, and obtain PDF links through a simple, intuitive interface that supports advanced search capabilities including Boolean operators and field-specific queries.

Overview

The arXiv MCP server acts as a bridge between your applications and arXiv's extensive database of over 2 million scholarly articles. Whether you're conducting literature reviews, tracking research by specific authors, exploring papers in particular categories, or building research tools, this server provides comprehensive functionality to interact with arXiv's content programmatically.

With support for complex search queries, pagination, customizable sorting, and direct access to paper metadata and PDFs, this server makes it effortless to integrate arXiv's wealth of academic knowledge into your workflows. The server handles the complexities of the arXiv API, presenting a clean, well-structured interface that returns consistently formatted results.

Features

Advanced Search Capabilities

Search for papers using natural language queries or construct sophisticated searches with Boolean operators (AND, OR, ANDNOT) and field-specific prefixes. Target your searches to specific fields like titles, authors, abstracts, or categories to find exactly what you need with precision.

Author-Specific Queries

Quickly retrieve all papers by a particular researcher or author. Perfect for tracking the work of influential researchers in your field, building citation networks, or conducting comprehensive author-based literature reviews.

Category Browsing

Explore papers within specific arXiv categories such as computer science subfields (cs.AI, cs.LG), mathematics areas (math.CO, math.NT), or physics domains. The server provides a complete list of available categories to help you navigate arXiv's organizational structure.

Flexible Result Management

Control your result sets with pagination support, configurable result limits, and multiple sorting options (by relevance, submission date, or last update date) in either ascending or descending order.

Direct Resource Access

Retrieve detailed paper information, abstracts, or PDF links directly using arXiv IDs through the server's resource templates, supporting both simple IDs and version-specific requests.

Tools

search_papers

Search for academic papers using a general query with full support for advanced search syntax.

Parameters:

  • query (required, string): Your search query. Can be simple text like "machine learning" or complex queries using Boolean operators and field prefixes such as "ti:transformer AND cat:cs.LG" or "au:Hinton"
  • max_results (optional, number, default: 10): Maximum number of results to return
  • start (optional, number, default: 0): Starting index for pagination
  • sort_by (optional, string, default: "relevance"): Sort order - choose from "relevance", "lastUpdatedDate", or "submittedDate"
  • sort_order (optional, string, default: "descending"): Sort direction - either "ascending" or "descending"

This tool is your primary interface for conducting searches across arXiv's entire corpus. Use field-specific prefixes to narrow your search: ti: for title, au: for author, abs: for abstract, and cat: for category. Combine multiple criteria with Boolean operators to construct precise queries.

search_papers_by_author

Search for papers by a specific author name.

Parameters:

  • author (required, string): Author name, such as "Geoffrey Hinton" or "Yann LeCun"
  • max_results (optional, number, default: 10): Maximum number of results to return
  • start (optional, number, default: 0): Starting index for pagination

This specialized tool simplifies author-based searches. Simply provide the author's name as it appears in their publications, and the tool returns all matching papers. Pagination support allows you to retrieve large result sets incrementally.

search_papers_by_category

Search for papers in a specific arXiv category.

Parameters:

  • category (required, string): arXiv category code such as "cs.AI" for Artificial Intelligence, "cs.LG" for Machine Learning, or "math.CO" for Combinatorics
  • max_results (optional, number, default: 10): Maximum number of results to return
  • start (optional, number, default: 0): Starting index for pagination

Browse papers within specific subject areas using arXiv's category taxonomy. This tool is ideal for staying current with recent publications in your field or exploring research trends within particular domains. Use the get_categories tool to discover available category codes.

get_categories

Get a list of available arXiv category codes and their descriptions.

Parameters: None required

This utility tool returns a comprehensive list of all arXiv categories with their codes and descriptions. Use it to discover the correct category codes for the search_papers_by_category tool or for constructing category-specific queries with the general search tool.

Resource Templates

paper

Retrieve detailed information about a specific paper by its arXiv ID.

URI Pattern: arxiv://paper/{arxiv_id}

Example: arxiv://paper/2301.07041 or arxiv://paper/1706.03762v7

Access complete metadata for any paper including title, authors, abstract, publication date, categories, comments, journal references, DOI, and PDF link. Supports version-specific requests by appending the version number (e.g., v7).

paper_abstract

Retrieve only the abstract of a specific paper by its arXiv ID.

URI Pattern: arxiv://paper/{arxiv_id}/abstract

Example: arxiv://paper/2301.07041/abstract

Quickly access just the abstract text of a paper without retrieving the full metadata. Useful when you only need a summary of the paper's content for analysis or display purposes.

paper_pdf

Get the PDF download URL for a specific paper by its arXiv ID.

URI Pattern: arxiv://paper/{arxiv_id}/pdf

Example: arxiv://paper/2301.07041/pdf

Obtain the direct PDF download link for any paper. Use this URL to download the full paper for reading, archival, or further processing.

Use Cases

The arXiv MCP server enables numerous research and development workflows:

Literature Review: Conduct comprehensive literature searches across multiple fields, filter by date ranges, and retrieve relevant papers systematically.

Research Monitoring: Track new publications by specific authors or in particular categories to stay current with the latest developments in your field.

Citation Analysis: Build datasets of papers for citation network analysis, bibliometric studies, or research trend identification.

Educational Tools: Integrate arXiv content into educational applications, reading list generators, or research training platforms.

Research Assistants: Power AI research assistants that can search for papers, summarize findings, and provide paper recommendations based on user queries.

Content Aggregation: Create curated collections of papers around specific topics, conferences, or research themes.

Search Query Examples

To help you get started with the powerful search capabilities:

  • Simple keyword search: "neural networks"
  • Title-specific search: "ti:attention mechanism"
  • Author search: "au:Yoshua Bengio"
  • Category search: "cat:cs.CV"
  • Combined search: "ti:transformer AND cat:cs.CL"
  • Exclude terms: "machine learning ANDNOT deep learning"
  • Multiple authors: "au:Hinton OR au:LeCun"
  • Complex query: "(ti:quantum OR ti:entanglement) AND cat:quant-ph"

The server handles these queries efficiently, returning well-structured results that include all relevant metadata for further processing or display.

Ready to build with Metorial?

Let's take your AI-powered applications to the next level, together.

About Metorial

Metorial provides developers with instant access to 600+ MCP servers for building AI agents that can interact with real-world tools and services. Built on MCP, Metorial simplifies agent tool integration by offering pre-configured connections to popular platforms like Google Drive, Slack, GitHub, Notion, and hundreds of other APIs. Our platform supports all major AI agent frameworks—including LangChain, AutoGen, CrewAI, and LangGraph—enabling developers to add tool calling capabilities to their agents in just a few lines of code. By eliminating the need for custom integration code, Metorial helps AI developers move from prototype to production faster while maintaining security and reliability. Whether you're building autonomous research agents, customer service bots, or workflow automation tools, Metorial's MCP server library provides the integrations you need to connect your agents to the real world.

Star us on GitHub