Agent skill
pypdf-common-issues
Sub-skill of pypdf: Common Issues.
Install this agent skill to your Project
npx add-skill https://github.com/vamseeachanta/workspace-hub/tree/main/.claude/skills/_archive/data/office/pypdf/common-issues
SKILL.md
Common Issues
Common Issues
1. Encrypted PDF Error
# Problem: Cannot read encrypted PDF
# Solution: Decrypt first
reader = PdfReader("encrypted.pdf")
if reader.is_encrypted:
reader.decrypt("password") # Provide password
2. Text Extraction Returns Empty
# Problem: extract_text() returns empty string
# Solution: PDF may be image-based (scanned)
# For scanned PDFs, use OCR:
# pip install pdf2image pytesseract
# Then use pytesseract to OCR the images
3. Memory Error with Large PDFs
# Problem: Memory error with large files
# Solution: Process incrementally
def split_large_pdf(input_path, output_dir, max_pages=100):
reader = PdfReader(input_path)
total = len(reader.pages)
for start in range(0, total, max_pages):
writer = PdfWriter()
end = min(start + max_pages, total)
for i in range(start, end):
writer.add_page(reader.pages[i])
writer.write(f"{output_dir}/part_{start//max_pages + 1}.pdf")
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
gsd-complete-milestone
Archive completed milestone and prepare for next version
gsd-reapply-patches
Reapply local modifications after a GSD update
gsd-verify-work
Validate built features through conversational UAT
gsd-thread
Manage persistent context threads for cross-session work
clinical-trial-protocol
Generate clinical trial protocols for medical devices or drugs through a modular, waypoint-based architecture with research-only and full protocol modes.
single-cell-rna-qc
Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations.
Didn't find tool you were looking for?