Agent skill
doc-extraction
Classify and extract structured content from engineering documents using a 3-layer taxonomy: generic content types, engineering patterns, and domain sub-skills. Use when ingesting standards, reports, or technical manuals into structured data for downstream analysis. type: reference
Install this agent skill to your Project
npx add-skill https://github.com/vamseeachanta/workspace-hub/tree/main/.claude/skills/engineering/doc-extraction
SKILL.md
Doc Extraction
When to Use
- Ingesting a new standard or code (DNV-RP, API RP, ISO, ASME)
- Extracting constants, equations, or tables from technical reports
- Building structured datasets from engineering manuals
- Populating knowledge bases from document collections
- Pre-processing documents before analysis workflow
Related Skills
- document-index-pipeline — 7-phase A→G pipeline
- doc-intelligence-promotion — Deep extraction post-processing
- cathodic-protection — CP system design
- viv-analysis — VIV assessment for risers
- fitness-for-service — FFS assessment
- structural-analysis — Structural checks
References
- DNV-RP-B401: Cathodic Protection Design
- DNV-RP-C205: Environmental Conditions and Environmental Loads
- API 579-1/ASME FFS-1: Fitness-for-Service
- API RP 16Q: Design, Selection, Operation, and Maintenance of Marine Drilling Riser Systems
Sub-Skills
- Yield Reality (WRK-1246 Corpus Assessment)
- Architecture
- 1.
constants(0% yield — not yet implemented) (+7) - Unit Detection and Normalization (+4)
- Extraction Workflow
- Output Schema
- Domain Sub-Skills
- Hybrid Classification Strategy (WRK-1188 Learning)
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
gsd-complete-milestone
Archive completed milestone and prepare for next version
gsd-reapply-patches
Reapply local modifications after a GSD update
gsd-verify-work
Validate built features through conversational UAT
gsd-thread
Manage persistent context threads for cross-session work
clinical-trial-protocol
Generate clinical trial protocols for medical devices or drugs through a modular, waypoint-based architecture with research-only and full protocol modes.
single-cell-rna-qc
Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations.
Didn't find tool you were looking for?