Agent skill
docx-extract-text-with-pandoc
Sub-skill of docx: Extract Text with Pandoc (+2).
Install this agent skill to your Project
npx add-skill https://github.com/vamseeachanta/workspace-hub/tree/main/.claude/skills/_archive/data/documents/docx/extract-text-with-pandoc
SKILL.md
Extract Text with Pandoc (+2)
Extract Text with Pandoc
pandoc document.docx -t plain -o output.txt
pandoc document.docx -t markdown -o output.md
Python Text Extraction
from docx import Document
doc = Document("document.docx")
for para in doc.paragraphs:
print(para.text)
Extract Tables
from docx import Document
doc = Document("document.docx")
for table in doc.tables:
for row in table.rows:
for cell in row.cells:
print(cell.text, end="\t")
print()
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
gsd-complete-milestone
Archive completed milestone and prepare for next version
gsd-reapply-patches
Reapply local modifications after a GSD update
gsd-verify-work
Validate built features through conversational UAT
gsd-thread
Manage persistent context threads for cross-session work
clinical-trial-protocol
Generate clinical trial protocols for medical devices or drugs through a modular, waypoint-based architecture with research-only and full protocol modes.
single-cell-rna-qc
Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations.
Didn't find tool you were looking for?