arXiv

Connect AI Agents to
arXiv

Automate workflows and connect AI agents to arXiv. Metorial is built for developers. Handling OAuth, compliance, observability, and more.

Back to arXiv overview

Understanding Story Data Structure and Fields

What is Story Data?

When you retrieve stories from Hacker News through this MCP server, each story comes with a structured set of fields that provide comprehensive information about the submission. Understanding these fields helps you make the most of the data and enables you to filter, analyze, and present Hacker News content effectively.

Core Story Fields

Identification and Content

Every Hacker News story has a unique ID that serves as its permanent identifier. This numeric value allows you to reference specific stories consistently, even as their rankings or visibility change over time.

The title field contains the headline as submitted by the user, while the url field provides the link to the external content being discussed. Not all stories have URLs—some are "Ask HN" or "Show HN" posts that exist purely as discussion threads on Hacker News itself.

For text-only posts, the text field contains the body content written by the submitter. This is particularly common for Ask HN questions, Show HN announcements, or job postings.

Community Engagement Metrics

The score reflects the number of upvotes a story has received, indicating community interest and approval. Higher scores generally correlate with greater visibility and more thoughtful discussion.

The descendants field tells you how many comments exist in the entire discussion thread, including nested replies. This gives you a quick sense of how much conversation a story has generated.

Attribution and Timing

The by field identifies the username of the person who submitted the story, allowing you to track submissions from specific users or researchers whose content you value.

The time field contains a Unix timestamp indicating when the story was submitted. This temporal data is crucial for understanding trending topics, analyzing post timing strategies, or filtering stories by recency.

Organizational Data

The type field explicitly identifies the item as a "story" (as opposed to comments, polls, or other Hacker News content types). The kids field contains an array of comment IDs that are direct replies to the story, providing entry points into the discussion tree.

Practical Applications

Understanding these fields allows you to create sophisticated queries and analyses. You might filter stories by minimum score thresholds to focus on highly-regarded content, sort by time to track breaking news, or follow the submission history of particular users whose judgment you trust. The structured nature of this data makes it straightforward to build custom views, generate reports, or integrate Hacker News content into your own workflows.

arXiv on Metorial

The arXiv integration lets you search for research papers, retrieve abstracts and metadata, and access the latest preprints directly from arXiv's repository to stay current with cutting-edge research in your field.

Connect anything. Anywhere.

Ready to build with Metorial?

Let's take your AI-powered applications to the next level, together.

About Metorial

Metorial provides developers with instant access to 600+ MCP servers for building AI agents that can interact with real-world tools and services. Built on MCP, Metorial simplifies agent tool integration by offering pre-configured connections to popular platforms like Google Drive, Slack, GitHub, Notion, and hundreds of other APIs. Our platform supports all major AI agent frameworks—including LangChain, AutoGen, CrewAI, and LangGraph—enabling developers to add tool calling capabilities to their agents in just a few lines of code. By eliminating the need for custom integration code, Metorial helps AI developers move from prototype to production faster while maintaining security and reliability. Whether you're building autonomous research agents, customer service bots, or workflow automation tools, Metorial's MCP server library provides the integrations you need to connect your agents to the real world.

Star us on GitHub