Agent skills
Skills you can use with AI coding agents, indexed from public GitHub repositories.
-
literature-engineer
Multi-route literature expansion + metadata normalization for evidence-first surveys.
Produces a large candidate pool (`papers/papers_raw.jsonl`, target ≥1200) with stable IDs and provenance, ready for dedupe/rank + citation generation.
**Trigger**: evidence collector, literature engineer, 文献扩充, 多路召回, snowballing, cited by, references, 元信息增强, provenance.
**Use when**: 需要把候选文献扩充到 ≥1200 篇并补齐可追溯 meta(survey pipeline 的 Stage C1,写作前置 evidence)。
**Skip if**: 已经有高质量 `papers/papers_raw.jsonl`(≥1200 且每条都有稳定标识+来源记录)。
**Network**: 可离线(靠 imports);雪崩/在线检索需要网络。
**Guardrail**: 不允许编造论文;每条记录必须带稳定标识(arXiv id / DOI / 可信 URL)和 provenance;不写 output/ prose。
WILLOSCAR/research-units-pipeline-skills 377
-
dedupe-rank
Dedupe and rank a raw paper set (`papers/papers_raw.jsonl`) to produce `papers/papers_dedup.jsonl` and `papers/core_set.csv`.
**Trigger**: dedupe, rank, core set, 去重, 排序, 精选论文, 核心集合.
**Use when**: 检索后需要把广覆盖集合收敛成可管理的 core set(用于 taxonomy/outline/mapping)。
**Skip if**: 已经有人手工整理了稳定的 `papers/core_set.csv`(无需再次 churn)。
**Network**: none.
**Guardrail**: 偏 deterministic;输出应可重复(稳定 paper_id、字段规范)。
WILLOSCAR/research-units-pipeline-skills 377
-
thesis-frontmatter-sync
同步中文毕业论文的非正文部分:摘要、附录、封面、成果、致谢、名单等,确保这些部分与正文口径一致,而不是最后临时补写。
**Trigger**: 摘要同步, frontmatter sync, 附录同步, 封面检查, 非正文维护, 毕业论文摘要.
**Use when**: 正文结构已经比较稳定,需要并行维护非正文部分,避免中英文摘要、封面字段、附录和成果列表与正文脱节。
**Skip if**: 正文主线还不稳定,或者还没有任何可交付版本。
**Network**: none.
**Guardrail**: 不把非正文拖到最后一刻;不让摘要、附录与正文口径脱节。
WILLOSCAR/research-units-pipeline-skills 377
-
tutorial-module-writer
Write the tutorial content (`output/TUTORIAL.md`) from an approved module plan, including exercises and answer outlines.
**Trigger**: write tutorial, tutorial modules, 教程写作, TUTORIAL.md.
**Use when**: tutorial pipeline 的写作阶段(C3),且 `DECISIONS.md` 已记录 HUMAN 对 scope/running example 的批准(C2)。
**Skip if**: module plan 未完成/未批准(先跑 `module-planner`/`exercise-builder` 并通过 Approve C2)。
**Network**: none.
**Guardrail**: 只写已批准范围;保持 running example 一致;每模块包含练习与答案要点。
WILLOSCAR/research-units-pipeline-skills 377
-
thesis-visual-layout-planner
规划中文毕业论文的图表、流程图和版面节奏:先决定哪些图表支撑主线,再做草图、拼版预演与图文映射。
**Trigger**: 图表规划, figure layout, thesis visual plan, 版面预演, 图文映射, mermaid 草图.
**Use when**: 章节结构基本明确,但图表还没有系统安排,或者担心图文节奏、图表密度和版面结构失控。
**Skip if**: 论文不依赖图表支撑主线,或仍在纯结构重构早期。
**Network**: none.
**Guardrail**: 不把图表拖成最后补丁;不先排版再倒推结构;图表服务主线而不是抢主线。
WILLOSCAR/research-units-pipeline-skills 377
-
citation-verifier
Generate and verify BibTeX entries from paper notes, writing `citations/ref.bib` and `citations/verified.jsonl`.
**Trigger**: citation, BibTeX, ref.bib, verified.jsonl, references, 引用, 参考文献.
**Use when**: 已有 `papers/paper_notes.jsonl`,需要为 prose/LaTeX 准备可追溯的引用(每条都有 url/date/title 验证记录)。
**Skip if**: 还没有 paper notes(或本次产出不需要引用/参考文献)。
**Network**: 自动验证通常需要网络;无网络时可先 record,再标注 needs manual verification。
**Guardrail**: 每个 BibTeX entry 必须对应一条 `citations/verified.jsonl` 记录;prose 只能使用已存在于 `citations/ref.bib` 的 citation keys。
WILLOSCAR/research-units-pipeline-skills 377
-
survey-visuals
Draft non-prose visuals artifacts (timeline, figure specs) for a survey, grounded in evidence and using citation keys from `citations/ref.bib`.
**Trigger**: survey visuals, timeline, figures, visuals, 图表, 时间线, figure spec.
**Use when**: survey 的 C4(NO PROSE),已有 outline + claim/evidence + citations,需要先把时间线/图规格落盘。
**Skip if**: 你只关心正文(可跳过);或缺少 `citations/ref.bib`。
**Network**: none.
**Guardrail**: NO PROSE;产物必须具体且可核对(含 citations),禁止遗留 TODO/SCAFFOLD。
WILLOSCAR/research-units-pipeline-skills 377
-
survey-seed-harvest
Identify survey/review papers in a retrieved set and extract taxonomy seeds into `outline/taxonomy.yml` (topics/subtopics/terminology).
**Trigger**: survey seed harvest, taxonomy seeds, 从 survey 提 taxonomy, bootstrap taxonomy.
**Use when**: retrieval/dedup 后想快速从已有 survey/review 论文中提取术语与主题结构,用于加速 `taxonomy-builder`。
**Skip if**: 已经有高质量 taxonomy(或你不想被 survey 既有框架限制)。
**Network**: none.
**Guardrail**: 产物是 seed,必须经 `taxonomy-builder` 重写与对齐 scope;避免生成泛化占位节点。
WILLOSCAR/research-units-pipeline-skills 377
-
subsection-writer
Write survey prose into per-section files under `sections/` so each unit can be QA'd independently before merging.
**Trigger**: subsection writer, per-section writing, split sections, sections/, 分小节写, 按章节拆分写作.
**Use when**: `Approve C2` is recorded and writer packs exist (`outline/writer_context_packs.jsonl`); you want evidence-bounded drafting without a monolithic one-shot draft.
**Skip if**: `DECISIONS.md` approval is missing, or `outline/evidence_drafts.jsonl` / `citations/ref.bib` is missing.
**Network**: none.
**Guardrail**: do not invent facts/citations; no TODO/ellipsis leakage; keep citations subsection- or chapter-scoped; H3 body files and chapter leads must not contain headings.
WILLOSCAR/research-units-pipeline-skills 377
-
paragraph-curator
Structured paragraph curation for C5: **select -> evaluate -> subset -> fuse**, so drafts converge instead of only expanding.
**Trigger**: paragraph curator, curation, select evaluate fuse, paragraph selection, 选段, 评价, 融合, 收敛, 去冗余.
**Use when**: you are in C5, `sections/*.md` exist, and the writing loop drifts toward 'longer by accumulation' (repetition, redundant paragraphs, weak synthesis).
**Skip if**: evidence packs are thin / `evidence-selfloop` is BLOCKED; or you are pre-C2 (NO PROSE).
**Network**: none.
**Guardrail**: do not invent facts; do not add/remove citation keys; do not move citations across subsections; keep section-level claims consistent with `output/ARGUMENT_SKELETON.md# Consistency Contract`.
WILLOSCAR/research-units-pipeline-skills 377
-
redundancy-pruner
Remove repeated boilerplate across sections (methodology disclaimers, generic transitions, repeated summaries) while preserving citations and meaning.
**Trigger**: redundancy, repetition, boilerplate removal, 去重复, 去套话, 合并重复段落.
**Use when**: the draft feels rigid because the same paragraph shape and disclaimer repeats across many subsections.
**Skip if**: you are still drafting major missing sections (finish drafting first).
**Network**: none.
**Guardrail**: do not add/remove citation keys; do not move citations across subsections; do not delete subsection-specific content.
WILLOSCAR/research-units-pipeline-skills 377
-
section-mapper
Map papers from the core set to each outline subsection and write `outline/mapping.tsv` with coverage tracking.
**Trigger**: section mapper, mapping.tsv, coverage, paper-to-section mapping, 论文映射, 覆盖率.
**Use when**: structure 阶段(C2),已有 `papers/core_set.csv` + `outline/outline.yml`,需要确保每小节有足够支持论文再进入 evidence/writing。
**Skip if**: 还没有 outline(先跑 `outline-builder`)或 core set 还没收敛。
**Network**: none.
**Guardrail**: 覆盖率可审计(避免所有小节重复用同几篇);为弱覆盖小节留下明确补救方向(扩 query / 合并小节)。
WILLOSCAR/research-units-pipeline-skills 377
-
extraction-form
Extract study data into a structured table (`papers/extraction_table.csv`) using the protocol’s extraction schema.
**Trigger**: extraction form, extraction table, data extraction, 信息提取, 提取表.
**Use when**: systematic review 在 screening 后进入 extraction(C3),需要把纳入论文按字段落到 CSV 以支持后续 synthesis。
**Skip if**: 还没有 `papers/screening_log.csv` 或 protocol 未锁定。
**Network**: none.
**Guardrail**: 严格按 schema 填字段;不要在此阶段写 narrative synthesis(那是 `synthesis-writer`)。
WILLOSCAR/research-units-pipeline-skills 377
-
research-pipeline-runner
Run this repo’s Units+Checkpoints research pipelines end-to-end (survey/综述/review/调研/教程/系统综述/审稿), with workspaces + checkpoints.
**Trigger**: run pipeline, kickoff, 继续执行, 自动跑, 写一篇, survey/综述/review/调研/教程/系统综述/审稿.
**Use when**: 用户希望端到端跑流程(创建 `workspaces/<name>/`、生成/执行 `UNITS.csv`、遇到 HUMAN checkpoint 停下等待)。
**Skip if**: 用户明确要手工逐条执行(用 `unit-executor`),或你不应自动推进到 prose 阶段。
**Network**: depends on selected pipeline (arXiv/PDF/citation verification may need network; offline import supported where available).
**Guardrail**: 必须尊重 checkpoints(无 Approve 不写 prose);遇到 HUMAN 单元必须停下等待;禁止在 repo root 创建 workspace 工件。
WILLOSCAR/research-units-pipeline-skills 377
-
taxonomy-builder
Build a 2+ level taxonomy (`outline/taxonomy.yml`) from a core paper set and scope constraints, with short descriptions per node.
**Trigger**: taxonomy, taxonomy builder, 分类, 主题树, taxonomy.yml.
**Use when**: survey/snapshot 的结构阶段(NO PROSE),已有 `papers/core_set.csv`,需要生成可映射且读者友好的主题结构。
**Skip if**: 已经有批准过且可映射的 taxonomy(不要无意义重构)。
**Network**: none.
**Guardrail**: 避免泛化占位桶;保持 2+ 层且每节点有具体描述。
WILLOSCAR/research-units-pipeline-skills 377
-
thesis-citation-enhance-review
为中文毕业论文补强并核验引用:找出必须有引文支撑的句子,扩展候选文献,检查引用与论断是否匹配,并回写参考文献与正文引用。
**Trigger**: 引用补强, citation enhance, 文献补充, 引用核验, 毕业论文参考文献检查.
**Use when**: 正文已有一定稳定度,需要系统补足背景、定义、对照工作和关键结论的引用支撑,并检查是否存在误引或漏引。
**Skip if**: 还没有稳定正文,或当前仅在做早期结构重构。
**Network**: optional.
**Guardrail**: 不为了堆引用而堆引用;引用必须与论断匹配;先补对,再补多。
WILLOSCAR/research-units-pipeline-skills 377
-
exercise-builder
Add exercises to each tutorial module (inputs, expected outputs, verification steps) and update `outline/module_plan.yml`.
**Trigger**: exercises, practice, verification checklist, 教程练习, 可验证作业.
**Use when**: 已有模块计划(`outline/module_plan.yml`),需要为每个模块补齐至少 1 个可验证练习以形成 teaching loop。
**Skip if**: 还没有 module plan(先跑 `module-planner`)。
**Network**: none.
**Guardrail**: 每个练习必须包含 expected output + verification steps;避免只给“思考题”无验收。
WILLOSCAR/research-units-pipeline-skills 377
-
claim-matrix-rewriter
Rewrite `outline/claim_evidence_matrix.md` as a projection/index of evidence packs (NO PROSE), so claims/axes are driven by `outline/evidence_drafts.jsonl` rather than outline placeholders.
**Trigger**: claim matrix rewriter, rewrite claim-evidence matrix, evidence-first claim matrix, matrix index, 证据矩阵重写, 从证据包生成矩阵.
**Use when**: `outline/subsection_briefs.jsonl` + `outline/evidence_drafts.jsonl` are ready and you want a clean claim→evidence index for QA/writing.
**Skip if**: `outline/claim_evidence_matrix.md` is already refined and consistent with evidence packs.
**Network**: none.
**Guardrail**: NO PROSE; do not invent facts; only cite keys present in `citations/ref.bib`; if evidence is abstract/title-only, claims must be provisional.
WILLOSCAR/research-units-pipeline-skills 377
-
appendix-table-writer
Curate reader-facing survey tables for the Appendix (clean layout + high information density), using only in-scope evidence and existing citation keys.
**Trigger**: appendix tables, publishable tables, survey tables, reader tables, 附录表格, 可发表表格, 综述表格.
**Use when**: you have C4 artifacts (evidence packs + anchor sheet + citations) and want tables that look like a real survey (not internal logs).
**Skip if**: `outline/tables_appendix.md` already exists and is refined (>=2 tables; citation-backed; no placeholders; not index-y).
**Network**: none.
**Guardrail**: no invented facts; no pipeline jargon; no paragraph cells; use only keys present in `citations/ref.bib`.
WILLOSCAR/research-units-pipeline-skills 377
-
snapshot-writer
Write a 1-page literature snapshot (`output/SNAPSHOT.md`) from a small core set + a bullets-only outline.
**Trigger**: snapshot, literature snapshot, 速览, 48h snapshot, one-page snapshot, SNAPSHOT.md.
**Use when**: 你要在 24-48h 内交付一个“可读的研究速览”(bullet-first,含关键引用),而不是完整 survey。
**Skip if**: 你已经进入 evidence-first survey 写作(有 `outline/evidence_drafts.jsonl` / `citations/ref.bib` / `output/DRAFT.md`),应改用 `subsection-writer`/`prose-writer`。
**Network**: none.
**Guardrail**: 不发明论文/引用;引用只来自 `papers/core_set.csv`(或同 workspace 的候选池);不写长段落(避免“像综述生成器”)。
WILLOSCAR/research-units-pipeline-skills 377
-
synthesis-writer
Synthesize evidence into a structured narrative (`output/SYNTHESIS.md`) grounded in `papers/extraction_table.csv`, including limitations and bias considerations.
**Trigger**: synthesis, evidence synthesis, systematic review writing, 综合写作, SYNTHESIS.md.
**Use when**: systematic review 完成 screening+extraction(含 bias 评估)后进入写作阶段(C4)。
**Skip if**: 还没有 `papers/extraction_table.csv`(或 protocol/screening 尚未完成)。
**Network**: none.
**Guardrail**: 以 extraction table 为证据底座;明确局限性与偏倚;不要在无数据支撑时扩写结论。
WILLOSCAR/research-units-pipeline-skills 377
-
citation-injector
Apply a `citation-diversifier` budget report by injecting *in-scope* citations into an existing draft (NO NEW FACTS), so the run passes the global unique-citation gate without citation dumps.
**Trigger**: citation injector, apply citation budget, inject citations, add citations safely, 引用注入, 按预算加引用, 引用增密.
**Use when**: `output/CITATION_BUDGET_REPORT.md` exists and you need to raise *global* unique citations (or reduce over-reuse) before `draft-polisher` / `pipeline-auditor`.
**Skip if**: you need more papers/citations upstream (fix C1/C2 mapping first), or `citations/ref.bib` is missing.
**Network**: none.
**Guardrail**: NO NEW FACTS; do not invent citations; only inject keys present in `citations/ref.bib`; keep injected citations within each H3’s allowed scope (via the budget report); avoid citation-dump paragraphs (embed cites per work).
WILLOSCAR/research-units-pipeline-skills 377
-
keyword-expansion
Expand and refine search keywords (synonyms, acronyms, exclusions) and update `queries.md`.
**Trigger**: keyword expansion, synonyms, exclusions, queries.md, 关键词扩展, 同义词, 排除词.
**Use when**: 检索覆盖不足/噪声过大,或主题别名很多,需要系统化扩展与收敛检索词。
**Skip if**: `queries.md` 已经能稳定检出覆盖面(无需扩大范围导致后续成本爆炸)。
**Network**: none.
**Guardrail**: 保持可控的 query 数量;明确 exclusions;避免“无限扩展”。
WILLOSCAR/research-units-pipeline-skills 377
-
post-merge-voice-gate
Post-merge paper-voice gate: detect planner-talk / axis-label artifacts introduced during merge (especially from `outline/transitions.md`), then route fixes back to the earliest source.
**Trigger**: post-merge voice gate, merge voice gate, transition leakage, planner talk, 合并后口吻门, 过渡句污染.
**Use when**: `section-merger` has produced `output/DRAFT.md` and you want to ensure merge-injected text won't drag the draft into generator voice before polishing.
**Skip if**: you are still pre-merge (no `output/DRAFT.md`) or you plan to rework structure upstream first.
**Network**: none.
**Guardrail**: analysis-only; do not edit `output/DRAFT.md`; do not invent facts/citations; write only the report + routing.
WILLOSCAR/research-units-pipeline-skills 377