MCPSERV.CLUB
icyclv

Arxiv Semantic Search MCP

MCP Server

AI‑powered search for arXiv papers via semantic and keyword queries

Stale(55)
1stars
2views
Updated Jul 16, 2025

About

A lightweight MCP server enabling AI assistants to query arXiv using vector embeddings for semantic search and structured keyword filters, providing paper details, categories, and server time.

Capabilities

Resources
Access data sources
Tools
Execute functions
Prompts
Pre-built templates
Sampling
AI model interactions

Arxiv Semantic Search MCP in Action

The Arxiv Semantic Search MCP is a lightweight Model Context Protocol server that bridges AI assistants with the vast repository of academic papers hosted on arXiv. By exposing a set of well‑defined functions, it allows assistants such as Claude to perform sophisticated literature searches without leaving the conversational interface. This solves a common pain point for researchers and developers: quickly locating relevant scholarly work without manually navigating the arXiv website or parsing raw API responses.

At its core, the server offers two complementary search modes. Semantic Search transforms a natural‑language query into high‑dimensional vector embeddings, then retrieves papers whose content aligns meaningfully with the intent behind the query. Powered by ArxivSearch, this mode is especially powerful for exploring nuanced topics within Computer Science categories (cs.*). Keyword Search, on the other hand, gives fine‑grained control over query structure—filtering by categories, date ranges, and field‑specific terms—to support more targeted queries across all arXiv disciplines. Together these approaches cover both exploratory and precision search needs.

Key capabilities include:

  • : Natural‑language queries mapped to semantic embeddings.
  • : Structured filtering with multiple parameters such as categories, dates, and sort options.
  • : Fetch comprehensive metadata for a paper by its arXiv ID.
  • : Retrieve the full taxonomy of arXiv subjects for informed filtering.
  • : Synchronize timestamps between the assistant and the server.

Developers can weave these functions into AI workflows by adding a simple MCP configuration. Once integrated, assistants can ask questions like “Show me recent advances in vision transformers for medical imaging” and receive a curated list of papers, complete with titles, authors, abstracts, and download links—all without leaving the chat. The server’s lightweight design (Python 3.12+, uv) ensures it can run locally or in cloud environments, making it ideal for both personal research bots and enterprise‑grade knowledge bases.

Unique advantages of this MCP include its dual search strategy, which balances semantic relevance with precise filtering; the ability to pull real‑time paper details on demand; and its straightforward integration path for any tool that speaks MCP. Whether you’re building a research assistant, an academic recommendation engine, or a citation‑generation pipeline, the Arxiv Semantic Search MCP provides a reliable, extensible bridge between AI conversational agents and scholarly literature.