pdf-agent-mcp

0 starsCommunity

Install to Claude Code

This server doesn't publish a one-line install command. Follow the setup in the source repository.

Summary

A local MCP server that extracts text-layer content from PDF files, enabling AI agents to inspect, extract text, outlines, and page content.

README.md

pdf-agent-mcp

<p align="right"> <a href="#readme-zh">中文</a> | <a href="#readme-en">English</a> </p>

<a id="readme-zh"></a>中文

pdf-agent-mcp 是一个本地 MCP 服务，给 AI agent 提供 PDF 文本层读取能力。

工具列表

inspect_pdf：检查 PDF 基本信息、页数、是否可能有文本层
extract_pdf_text：按 raw / lines / blocks 抽取文本
extract_pdf_outline：提取 PDF 目录（书签）
extract_pdf_page：提取单页文本项和坐标

使用说明

环境要求：Node.js 22+

npm install
npm run dev

npm run lint
npm test
npm run build

推荐直接用 npx 启动：

npx -y github:sanhua1/pdf-agent-mcp

Agent 自然语言交互示例

在 Claude/Codex 里可直接说：

先帮我 inspect 这个 PDF：/path/to/doc.pdf
把 1-5 页按 lines 模式提取出来
第 10 页排版乱，改用 blocks 模式再提取一次
先读取目录，再按章节整理成 Markdown 摘要

Claude Code 配置方法

{
  "mcpServers": {
    "pdf-agent-mcp": {
      "command": "npx",
      "args": ["-y", "github:sanhua1/pdf-agent-mcp"]
    }
  }
}

Codex 配置方法

[mcp_servers.pdf-agent-mcp]
command = "npx"
args = ["-y", "github:sanhua1/pdf-agent-mcp"]

<a id="readme-en"></a>English

pdf-agent-mcp is a local MCP server for extracting text-layer content from PDF files.

Tools

inspect_pdf: inspect metadata, page count, and text-layer hints
extract_pdf_text: extract text in raw / lines / blocks modes
extract_pdf_outline: read PDF bookmarks/outlines
extract_pdf_page: extract text items with coordinates from a single page

Quick Start

Requirement: Node.js 22+

npm install
npm run dev

Run with npx:

npx -y github:sanhua1/pdf-agent-mcp

Related MCP servers

Browse all →

Filesystem MCP

modelcontextprotocol/servers

Files & DocsOpen

Apify MCP

apify/actors-mcp-server

Browser & ScrapingOpen

AWS MCP Servers

awslabs/mcp

9,330 starsOpen

Axiom MCP

axiomhq/mcp-server-axiom

61 starsOpen

Azure MCP

Azure/azure-mcp

1,219 starsOpen

Browserbase MCP

browserbase/mcp-server-browserbase

3,383 starsOpen

Browse

MCP servers by category

Other2643 AI & ML2284 Search2091 Developer Tools1127 Vector & Memory1121 Cloud & DevOps721 Files & Docs719 Databases704 Finance & Payments701 Browser & Scraping485 Communication418 Productivity355