Agent skill
pdf-processing
提取PDF文件中的文本,并按标题/段落结构化输出。适用于文档总结、数据提取等场景。
Install this agent skill to your Project
npx add-skill https://github.com/ZBCccc/claude-skills/tree/main/skills/pdf-processing
SKILL.md
PDF文本提取技能
概述
本Skill用于从PDF文件中提取文本,并按照“标题→子标题→段落”的层级结构化输出。支持处理多页PDF,自动忽略页眉/页脚。
工作流程
- 上传PDF:用户将PDF文件上传至Claude Code或Claude.ai;
- 提取文本:调用
scripts/extract_text.py脚本,提取PDF中的所有文本; - 结构化处理:按换行符和标题层级(如“# 标题”“## 子标题”)拆分内容;
- 输出结果:根据
templates/output_template.md生成结构化Markdown文档。
使用示例
输入:上传一份“2024年年度报告.pdf”
输出:生成包含“1. 公司概况→1.1 业务范围→段落”的结构化总结。
依赖说明
- Python库:PyPDF2(用于PDF文本提取),需提前安装(
pip install pypdf2)。
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
template-skill
Replace with description of the skill and when Claude should use it.
doc-coauthoring
Guide users through a structured workflow for co-authoring documentation. Use when user wants to write documentation, proposals, technical specs, decision docs, or similar structured content. This workflow helps users efficiently transfer context, refine content through iteration, and verify the doc works for readers. Trigger when user mentions writing docs, creating proposals, drafting specs, or similar documentation tasks.
internal-comms
A set of resources to help me write all kinds of internal communications, using the formats that my company likes to use. Claude should use this skill whenever asked to write some sort of internal communications (status reports, leadership updates, 3P updates, company newsletters, FAQs, incident reports, project updates, etc.).
mcp-builder
Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP) or Node/TypeScript (MCP SDK).
canvas-design
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.
web-artifacts-builder
Suite of tools for creating elaborate, multi-component claude.ai HTML artifacts using modern frontend web technologies (React, Tailwind CSS, shadcn/ui). Use for complex artifacts requiring state management, routing, or shadcn/ui components - not for simple single-file HTML/JSX artifacts.
Didn't find tool you were looking for?