Agent skill
pdf-text-extractor-features
Sub-skill of pdf-text-extractor: Features.
Install this agent skill to your Project
npx add-skill https://github.com/vamseeachanta/workspace-hub/tree/main/.claude/skills/_archive/data/documents/pdf-text-extractor/features
SKILL.md
Features
Features
- Page-aware extraction - Track which page text comes from
- Intelligent chunking - Split long pages into manageable chunks
- Metadata extraction - Title, author, creation date
- Batch processing - Handle thousands of PDFs efficiently
- Error recovery - Skip corrupted files, continue processing
- Progress tracking - Resume interrupted extractions
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
gsd-complete-milestone
Archive completed milestone and prepare for next version
gsd-reapply-patches
Reapply local modifications after a GSD update
gsd-verify-work
Validate built features through conversational UAT
gsd-thread
Manage persistent context threads for cross-session work
clinical-trial-protocol
Generate clinical trial protocols for medical devices or drugs through a modular, waypoint-based architecture with research-only and full protocol modes.
single-cell-rna-qc
Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations.
Didn't find tool you were looking for?