Agent skill

pdf-skill

Expert in generating, parsing, and manipulating PDF documents using tools like PDFKit, PDF.js, and Puppeteer. Use when creating PDFs, extracting content, merging documents, or filling forms. Triggers include "PDF", "generate PDF", "parse PDF", "extract PDF", "merge PDF", "PDF form", "PDFKit".

Stars 66
Forks 6

Install this agent skill to your Project

npx add-skill https://github.com/404kidwiz/claude-supercode-skills/tree/main/pdf-skill

SKILL.md

PDF Skill

Purpose

Provides expertise in programmatic PDF generation, parsing, and manipulation. Specializes in creating PDFs from scratch, extracting content, merging/splitting documents, and handling forms using PDFKit, PDF.js, Puppeteer, and similar tools.

When to Use

  • Generating PDFs programmatically
  • Extracting text or data from PDFs
  • Merging or splitting PDF documents
  • Filling PDF forms programmatically
  • Converting HTML to PDF
  • Adding watermarks or annotations
  • Parsing PDF structure and metadata
  • Building PDF report generators

Quick Start

Invoke this skill when:

  • Generating PDFs from code or data
  • Extracting content from PDF files
  • Merging, splitting, or manipulating PDFs
  • Filling or creating PDF forms
  • Converting HTML/web pages to PDF

Do NOT invoke when:

  • Word document creation → use /docx-skill
  • Excel/spreadsheet work → use /xlsx-skill
  • PowerPoint creation → use /pptx-skill
  • General file operations → use Bash or file tools

Decision Framework

PDF Operation?
├── Generate from scratch
│   ├── Simple → PDFKit (Node) / ReportLab (Python)
│   └── Complex layouts → Puppeteer/Playwright + HTML
├── Parse/Extract
│   ├── Text extraction → pdf-parse / PyPDF2
│   └── Table extraction → Camelot / Tabula
├── Manipulate
│   └── pdf-lib (merge, split, edit)
└── Forms
    └── pdf-lib (fill) / PDFtk (advanced)

Core Workflows

1. PDF Generation with PDFKit

  1. Install PDFKit (npm install pdfkit)
  2. Create new PDDocument
  3. Add content (text, images, graphics)
  4. Style with fonts and colors
  5. Add pages as needed
  6. Pipe to file or response

2. HTML to PDF Conversion

  1. Set up Puppeteer/Playwright
  2. Navigate to HTML content or URL
  3. Configure page size and margins
  4. Set print options (headers, footers)
  5. Generate PDF buffer
  6. Save or stream result

3. PDF Parsing and Extraction

  1. Choose parser (pdf-parse, PyPDF2, pdfplumber)
  2. Load PDF file
  3. Extract text or structured data
  4. Handle multi-page documents
  5. Clean and normalize extracted text
  6. Output in desired format

Best Practices

  • Use vector graphics over raster when possible
  • Embed fonts for consistent rendering
  • Test PDF output across different readers
  • Handle large PDFs with streaming
  • Use appropriate library for task complexity
  • Consider accessibility (tagged PDFs)

Anti-Patterns

Anti-Pattern Problem Correct Approach
Image-only PDFs Not searchable/accessible Use text with fonts
No font embedding Rendering issues Embed required fonts
Memory loading large PDFs Crashes Stream processing
Ignoring encryption Security/access issues Handle encrypted PDFs
Wrong tool for job Over-engineering Match tool to complexity

Expand your agent's capabilities with these related and highly-rated skills.

404kidwiz/claude-supercode-skills

documentation-engineer

Technical documentation and knowledge management expert. Use when creating comprehensive documentation systems, improving developer knowledge sharing, or building documentation-driven development workflows.

66 6
Explore
404kidwiz/claude-supercode-skills

backend-developer

Comprehensive backend development for building production-ready server-side applications with multiple frameworks, databases, and deployment strategies. Use when building APIs, services, databases, or server infrastructure.

66 6
Explore
404kidwiz/claude-supercode-skills

powershell-5.1-expert

Expert in legacy Windows PowerShell 5.1. Specializes in WMI, ADSI, COM automation, and maintaining backward compatibility with Windows Server environments. Use for Windows-specific automation on legacy systems. Triggers include "PowerShell 5.1", "Windows PowerShell", "WMI", "ADSI", "COM object", "legacy PowerShell".

66 6
Explore
404kidwiz/claude-supercode-skills

qa-expert

Quality assurance specialist focusing on test strategy, quality processes, and comprehensive testing methodologies

66 6
Explore
404kidwiz/claude-supercode-skills

multi-agent-coordinator

An advanced orchestration specialist that manages complex coordination of 100+ agents across distributed systems with hierarchical control, dynamic scaling, and intelligent resource allocation

66 6
Explore
404kidwiz/claude-supercode-skills

tooling-engineer

Expert in building developer tools, CLI utilities, IDE extensions, and optimizing local development environments.

66 6
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results