Agent skill

global-reviewer

Global consistency review for survey drafts: terminology, cross-section coherence, and scope/citation hygiene. Writes `output/GLOBAL_REVIEW.md` and (optionally) applies safe edits to `output/DRAFT.md`. **Trigger**: global review, consistency check, coherence audit, 术语一致性, 全局回看, 章节呼应, 拷打 writer. **Use when**: Draft exists and you want a final evidence-first coherence pass before LaTeX/PDF. **Skip if**: You are still changing the outline/mapping/notes (do those first), or prose writing is not approved. **Network**: none. **Guardrail**: Do not invent facts or citations; do not add new citation keys; treat missing evidence as a failure signal.

Stars 377
Forks 25

Install this agent skill to your Project

npx add-skill https://github.com/WILLOSCAR/research-units-pipeline-skills/tree/main/.codex/skills/global-reviewer

SKILL.md

Global Reviewer (survey draft)

Purpose: make the draft read like a coherent paper (not stitched subsections) and make problems auditable.

Role cards (use explicitly)

Consistency Reviewer (auditor)

Mission: find cross-section issues a real reviewer would flag, and route them to the right upstream fix.

Do:

  • Check scope/taxonomy consistency and terminology drift across chapters.
  • Flag underspecified claims (numbers without task/metric/constraint).
  • Treat missing evidence as a failure signal; route upstream.

Avoid:

  • Writing around gaps by adding new claims or citations.

Coherence Editor (bridge finder)

Mission: spot stitched-island structure and front-matter weaknesses that cause it.

Do:

  • Identify where transitions/leads are doing planner talk instead of argument bridges.
  • Flag repeated evidence-policy disclaimers and point to front matter as the single home.

Avoid:

  • Style-only nitpicks that do not change readability or verifiability.

Role prompt: Consistency Reviewer (AI paper reviewer mindset)

text
You are a meticulous reviewer for a survey manuscript.

Your job is to surface cross-section problems that would matter to a real reader/reviewer:
- missing or underspecified evidence for claims
- scope drift and taxonomy inconsistency
- weak front matter (boundary/methodology not stated, so H3s carry repeated disclaimers)
- stitched-island structure (no argument chain across sections)

Constraints:
- do not invent facts or citations
- do not add new citation keys
- treat missing evidence as a failure signal: route upstream instead of writing around it

Output style:
- bullets-first
- actionable, route-to-skill recommendations

This is not “polish for style”. It is a contract check:

  • do claims align to evidence/citations?
  • do sections connect via a consistent lens?
  • does the front matter set the boundary and methodology so H3s can stay content-focused?

Inputs

  • output/DRAFT.md
  • Context (read-only; used to avoid drift):
    • outline/outline.yml
    • outline/taxonomy.yml
    • outline/mapping.tsv
    • outline/claim_evidence_matrix.md
    • citations/ref.bib

Outputs

  • output/GLOBAL_REVIEW.md (bullets-first report; always written)
  • output/DRAFT.md (optional safe edits; only when edits are low-risk)

Non-negotiables

  • No invented facts.
  • No invented citations.
  • Do not add/remove citation keys.
  • Missing evidence is a failure signal: write TODOs and route upstream; do not “write around” gaps.

What this skill owns (and what it does not)

Owns:

  • Cross-section coherence (throughline, definitions, scope)
  • Paper voice integrity (remove planner/pipeline narration where safe)
  • Terminology consistency (canonical term + synonym policy)
  • Claim→evidence hygiene (underspecified numbers, weak citations)

Does not own:

  • Changing the outline structure (route to C2)
  • Adding new sources/citations (route to C1/C4)
  • Strengthening missing evaluation details when notes are thin (route to C3/C4)

Workflow (use the context files explicitly)

  1. Check structure against outline/outline.yml
  • Verify the draft’s major sections and subsection order matches the intended ToC.
  • Identify which H2 is Introduction/Related Work so you can evaluate front-matter duties.
  1. Check scope vocabulary against outline/taxonomy.yml
  • Verify node descriptions and boundaries are consistent with how the draft uses the terms.
  • Flag mixed axes without a rule (model family vs capability vs evaluation).
  1. Check coverage signals via outline/mapping.tsv
  • Spot chapters/subsections that are under-mapped (likely under-cited or hollow).
  • Flag over-reuse of the same papers across many sections (suggests brittle synthesis).
  1. Spot-check claims using outline/claim_evidence_matrix.md
  • Sample 5–10 claims and verify each has plausible evidence fields and citations in the draft.
  • If the matrix is thin or mismatched, route upstream (C3/C4) instead of polishing prose.
  1. Sanity-check citation keys against citations/ref.bib
  • Flag undefined keys or suspicious naming (e.g., “GPT-5”) unless the cited work uses that label.

Report format (required)

output/GLOBAL_REVIEW.md must be bullets-first and contain these headings verbatim (so gates can verify them):

  • ## A. Input integrity / placeholder leakage
  • ## B. Narrative and argument chain
  • ## C. Scope and taxonomy consistency
  • ## D. Citations and verifiability (claim -> evidence)
  • ## E. Tables and structural outputs

Include a top line:

  • - Status: PASS (or - Status: OK) only after all blocking issues are addressed.

What to check (high-value, paper-like)

A. Input integrity / placeholder leakage

Look for:

  • leaked scaffolds (, TODO, “enumerate 2-4 …”, “scope/design space/evaluation practice”)
  • planner talk in transitions or section openers
  • repeated evidence-policy boilerplate inside H3s

Action:

  • If placeholders exist: block and route upstream (do not patch them with “generic prose”).
  • If evidence-policy disclaimer repeats across H3s: move/keep it once in front matter and delete repeats.

B. Narrative and argument chain

Goal: every section does an argument move.

Check:

  • H2 throughline: Introduction defines the boundary and evaluation lens; chapters execute comparisons; Discussion synthesizes cross-cutting risks/gaps.
  • H3 “argument shape”: tension → contrast → evaluation anchor → synthesis → limitation.
  • “Generator voice”: narration templates (This subsection ...) and slide navigation (Next, we ...).

Action (safe edits allowed):

  • Replace navigation sentences with argument bridges (no new facts).

Bad:

  • Next, we move from planning to memory.

Better:

  • Planning specifies how decisions are made; memory determines what information those decisions can reliably condition on under a fixed protocol.

C. Scope and taxonomy consistency

Check:

  • Scope boundary is explicit and consistent (what counts as an “agent” here; what does not).
  • Taxonomy nodes match the paper’s claims (no mixed axes without a rule).
  • No silent drift (e.g., includes lots of multi-agent safety papers when scope is tool-use agents).

Action:

  • If scope drift is structural: route to C2 (tighten outline + mapping).
  • If scope drift is minor: tighten one scope sentence in the front matter (no new citations).

D. Citations and verifiability (claim -> evidence)

Write a small claim-evidence table (5–10 rows):

  • claim | section | citations | evidence_field | evidence_level

Flag:

  • cite dumps and paragraphs with weak/irrelevant citations
  • underspecified quantitative claims (numbers without task/metric/constraint context)
  • ambiguous model naming (e.g., “GPT-5”) unless the cited paper uses that label

Action:

  • If you can clarify context without new facts (e.g., “under a fixed budget/tool access”), do so.
  • Otherwise: mark as TODO and route to C3/C4 (paper notes / evidence packs).

E. Tables and structural outputs

Check:

  • Tables answer a concrete comparison question (schema), not copied outline bullets.
  • Rows contain citations.

Action:

  • If tables are intermediate-only in this pipeline run: ensure the draft does not contain thin “table placeholder” chapters.

Recommended fix order (routing)

When the report finds issues, recommend the smallest fix path:

  • Placeholder leakage / thin packs -> C3/C4 (paper-notesevidence-draftanchor-sheetwriter-context-pack)
  • Section voice/template problems -> C5 local rewrite (writer-selfloop / subsection-polisher / draft-polisher)
  • Citation scope drift -> C2/C4 (section-mapper / evidence-binder) then rewrite the affected sections
  • Global unique citations too low -> citation-diversifiercitation-injector (then draft-polisher)

Safe edits allowed (optional)

If and only if edits are low-risk and do not change citation keys:

  • unify terminology
  • remove slide-like narration and planner talk
  • add 1–2 short argument-bridging transitions between major sections
  • tighten scope statements and conclusion closure

Script

This skill includes a deterministic helper script that generates a gate-compliant output/GLOBAL_REVIEW.md from the current draft and context (no invented facts/citations).

Quick Start

  • python .codex/skills/global-reviewer/scripts/run.py --help
  • python .codex/skills/global-reviewer/scripts/run.py --workspace workspaces/<ws>

All Options

  • --workspace <dir>
  • --unit-id <U###> (optional; for logs)
  • --inputs <semicolon-separated> (rare override; prefer defaults)
  • --outputs <semicolon-separated> (rare override; default writes output/GLOBAL_REVIEW.md)
  • --checkpoint <C#> (optional)

Examples

  • Generate a global review after merging a draft:
    • python .codex/skills/global-reviewer/scripts/run.py --workspace workspaces/<ws>

Freeze policy:

  • If you hand-edit the review and want to freeze it, create output/GLOBAL_REVIEW.refined.ok to prevent overwrites.

Notes:

  • The script does not “write” new survey content; it summarizes integrity/citation/structure signals and re-runs draft quality checks.

Troubleshooting

Issue: review flags missing citations / undefined keys

Fix:

  • Run citation-verifier and ensure citations/ref.bib contains every cited key in output/DRAFT.md.

Issue: review suggests changes that would add new claims

Fix:

  • Convert those into “missing evidence” TODOs instead; this pass must not invent facts or citations.

Expand your agent's capabilities with these related and highly-rated skills.

WILLOSCAR/research-units-pipeline-skills

thesis-compile-review

对中文毕业论文进行编译、warning 分级、模板模式检查、数据与引用复查,并把问题回写成可继续迭代的 review checklist。 **Trigger**: 毕业论文编译检查, thesis compile review, warning 分级, 终稿复查, main.pdf 检查. **Use when**: 论文已经回写到 TeX 交付层,需要确认是否真正达到“可提交”的质量,而不是只做到能编译。 **Skip if**: 还处于中间层重构阶段,`chapters/*.tex` 尚未形成稳定交付稿。 **Network**: none. **Guardrail**: 不在这里重构章节主线;如果发现结构问题,明确回退到上游修复。

377 25
Explore
WILLOSCAR/research-units-pipeline-skills

front-matter-writer

Write the survey's front matter files (Abstract, Introduction, Related Work, Discussion, Conclusion) in paper voice, with high citation density and a single evidence-policy paragraph. **Trigger**: front matter writer, introduction writer, related work writer, abstract writer, discussion writer, conclusion writer, 引言, 相关工作, 摘要, 讨论, 结论. **Use when**: you are in C5 (prose allowed) and need the paper-like shell to stop the draft reading like stitched subsections. **Skip if**: `Approve C2` is missing in `DECISIONS.md`, or `citations/ref.bib` is missing. **Network**: none. **Guardrail**: no invented facts/citations; no pipeline jargon in final prose; no repeated evidence disclaimers; only use keys present in `citations/ref.bib`.

377 25
Explore
WILLOSCAR/research-units-pipeline-skills

thesis-question-list

维护中文毕业论文的 `codex_md/question_list.md`:把本轮问题、边界、优先级、协作方案和验收口径结构化,作为整条 thesis pipeline 的控制面。 **Trigger**: 毕业论文问题清单, thesis question list, 论文修改清单, 本轮目标, 结构问题梳理, review问题整理. **Use when**: 你已经有一批材料或上一轮 review 结果,需要明确这一轮到底修什么、不修什么,并给后续重构与编译复查提供统一入口。 **Skip if**: 当前只是在做一次性局部措辞修改,且没有形成新一轮结构/证据/编译问题。 **Network**: none. **Guardrail**: 不在这里写正文;不把问题单写成长篇散文;每条问题必须可执行、可验收。

377 25
Explore
WILLOSCAR/research-units-pipeline-skills

novelty-matrix

Create a novelty/prior-work matrix comparing the submission’s contributions against related work (overlaps vs deltas). **Trigger**: novelty matrix, prior-work matrix, overlap/delta, 相关工作对比, 新颖性矩阵. **Use when**: peer review 中评估 novelty/positioning,需要把贡献与相关工作逐项对齐并写出差异点证据。 **Skip if**: 缺少 claims(先跑 `claims-extractor`)或你不打算做新颖性定位分析。 **Network**: none (retrieval of additional related work is out-of-scope unless provided). **Guardrail**: 明确 overlap 与 delta;尽量给出可追溯证据来源(来自稿件/引用/作者陈述)。

377 25
Explore
WILLOSCAR/research-units-pipeline-skills

protocol-writer

Write a systematic review protocol into `output/PROTOCOL.md` (databases, queries, inclusion/exclusion, time window, extraction fields). **Trigger**: protocol, PRISMA, systematic review, inclusion/exclusion, 检索式, 纳入排除. **Use when**: systematic review pipeline 的起点(C1),需要先锁定 protocol 再开始 screening/extraction。 **Skip if**: 不是做 systematic review(或 protocol 已经锁定且不允许修改)。 **Network**: none. **Guardrail**: protocol 必须包含可执行的检索与筛选规则;需要 HUMAN 签字后才能进入 screening。

377 25
Explore
WILLOSCAR/research-units-pipeline-skills

rubric-writer

Write a rubric-based peer review report (`output/REVIEW.md`) using extracted claims and evidence gaps (novelty/soundness/clarity/impact). **Trigger**: rubric review, referee report, peer review write-up, 审稿报告, REVIEW.md. **Use when**: peer-review pipeline 的最后阶段(C3),已有 `output/CLAIMS.md` + `output/MISSING_EVIDENCE.md`(以及可选 novelty matrix)。 **Skip if**: 上游产物未就绪(claims/evidence gaps 缺失)或你不打算输出完整审稿报告。 **Network**: none. **Guardrail**: 给可执行建议(actionable feedback),并覆盖 novelty/soundness/clarity/impact;避免泛泛而谈。

377 25
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results