Agent skill
pdf-text-extractor-best-practices
Sub-skill of pdf-text-extractor: Best Practices.
Install this agent skill to your Project
npx add-skill https://github.com/vamseeachanta/workspace-hub/tree/main/.claude/skills/_archive/data/documents/pdf-text-extractor/best-practices
SKILL.md
Best Practices
Best Practices
- Use timeout for SQLite -
timeout=30prevents lock errors - Batch commits - Commit every 100 files, not every file
- Handle errors gracefully - Log and continue on failures
- Track progress - Enable resumption of interrupted jobs
- Chunk appropriately - 1500-2500 chars optimal for search
- Preserve page numbers - Essential for citations
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
gsd-complete-milestone
Archive completed milestone and prepare for next version
gsd-reapply-patches
Reapply local modifications after a GSD update
gsd-verify-work
Validate built features through conversational UAT
gsd-thread
Manage persistent context threads for cross-session work
clinical-trial-protocol
Generate clinical trial protocols for medical devices or drugs through a modular, waypoint-based architecture with research-only and full protocol modes.
single-cell-rna-qc
Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations.
Didn't find tool you were looking for?