Agent skill

global-reviewer

Global consistency review for survey drafts: terminology, cross-section coherence, and scope/citation hygiene. Writes `output/GLOBAL_REVIEW.md` and (optionally) applies safe edits to `output/DRAFT.md`. **Trigger**: global review, consistency check, coherence audit, 术语一致性, 全局回看, 章节呼应, 拷打 writer. **Use when**: Draft exists and you want a final evidence-first coherence pass before LaTeX/PDF. **Skip if**: You are still changing the outline/mapping/notes (do those first), or prose writing is not approved. **Network**: none. **Guardrail**: Do not invent facts or citations; do not add new citation keys; treat missing evidence as a failure signal.

View SKILL.md on GitHub Repository

Stars 377

Forks 25

Install this agent skill to your Project

npx add-skill https://github.com/WILLOSCAR/research-units-pipeline-skills/tree/main/.codex/skills/global-reviewer

SKILL.md

Global Reviewer (survey draft)

Purpose: make the draft read like a coherent paper (not stitched subsections) and make problems auditable.

Role cards (use explicitly)

Consistency Reviewer (auditor)

Mission: find cross-section issues a real reviewer would flag, and route them to the right upstream fix.

Do:

Check scope/taxonomy consistency and terminology drift across chapters.
Flag underspecified claims (numbers without task/metric/constraint).
Treat missing evidence as a failure signal; route upstream.

Avoid:

Writing around gaps by adding new claims or citations.

Coherence Editor (bridge finder)

Mission: spot stitched-island structure and front-matter weaknesses that cause it.

Do:

Identify where transitions/leads are doing planner talk instead of argument bridges.
Flag repeated evidence-policy disclaimers and point to front matter as the single home.

Avoid:

Style-only nitpicks that do not change readability or verifiability.

Role prompt: Consistency Reviewer (AI paper reviewer mindset)

text

You are a meticulous reviewer for a survey manuscript.

Your job is to surface cross-section problems that would matter to a real reader/reviewer:
- missing or underspecified evidence for claims
- scope drift and taxonomy inconsistency
- weak front matter (boundary/methodology not stated, so H3s carry repeated disclaimers)
- stitched-island structure (no argument chain across sections)

Constraints:
- do not invent facts or citations
- do not add new citation keys
- treat missing evidence as a failure signal: route upstream instead of writing around it

Output style:
- bullets-first
- actionable, route-to-skill recommendations

This is not “polish for style”. It is a contract check:

do claims align to evidence/citations?
do sections connect via a consistent lens?
does the front matter set the boundary and methodology so H3s can stay content-focused?

Inputs

output/DRAFT.md
Context (read-only; used to avoid drift):
- outline/outline.yml
- outline/taxonomy.yml
- outline/mapping.tsv
- outline/claim_evidence_matrix.md
- citations/ref.bib

Outputs

output/GLOBAL_REVIEW.md (bullets-first report; always written)
output/DRAFT.md (optional safe edits; only when edits are low-risk)

Non-negotiables

No invented facts.
No invented citations.
Do not add/remove citation keys.
Missing evidence is a failure signal: write TODOs and route upstream; do not “write around” gaps.

What this skill owns (and what it does not)

Owns:

Cross-section coherence (throughline, definitions, scope)
Paper voice integrity (remove planner/pipeline narration where safe)
Terminology consistency (canonical term + synonym policy)
Claim→evidence hygiene (underspecified numbers, weak citations)

Does not own:

Changing the outline structure (route to C2)
Adding new sources/citations (route to C1/C4)
Strengthening missing evaluation details when notes are thin (route to C3/C4)

Workflow (use the context files explicitly)

Check structure against outline/outline.yml

Verify the draft’s major sections and subsection order matches the intended ToC.
Identify which H2 is Introduction/Related Work so you can evaluate front-matter duties.

Check scope vocabulary against outline/taxonomy.yml

Verify node descriptions and boundaries are consistent with how the draft uses the terms.
Flag mixed axes without a rule (model family vs capability vs evaluation).

Check coverage signals via outline/mapping.tsv

Spot chapters/subsections that are under-mapped (likely under-cited or hollow).
Flag over-reuse of the same papers across many sections (suggests brittle synthesis).

Spot-check claims using outline/claim_evidence_matrix.md

Sample 5–10 claims and verify each has plausible evidence fields and citations in the draft.
If the matrix is thin or mismatched, route upstream (C3/C4) instead of polishing prose.

Sanity-check citation keys against citations/ref.bib

Flag undefined keys or suspicious naming (e.g., “GPT-5”) unless the cited work uses that label.

Report format (required)

output/GLOBAL_REVIEW.md must be bullets-first and contain these headings verbatim (so gates can verify them):

## A. Input integrity / placeholder leakage
## B. Narrative and argument chain
## C. Scope and taxonomy consistency
## D. Citations and verifiability (claim -> evidence)
## E. Tables and structural outputs

Include a top line:

- Status: PASS (or - Status: OK) only after all blocking issues are addressed.

What to check (high-value, paper-like)

A. Input integrity / placeholder leakage

Look for:

leaked scaffolds (…, TODO, “enumerate 2-4 …”, “scope/design space/evaluation practice”)
planner talk in transitions or section openers
repeated evidence-policy boilerplate inside H3s

Action:

If placeholders exist: block and route upstream (do not patch them with “generic prose”).
If evidence-policy disclaimer repeats across H3s: move/keep it once in front matter and delete repeats.

B. Narrative and argument chain

Goal: every section does an argument move.

Check:

H2 throughline: Introduction defines the boundary and evaluation lens; chapters execute comparisons; Discussion synthesizes cross-cutting risks/gaps.
H3 “argument shape”: tension → contrast → evaluation anchor → synthesis → limitation.
“Generator voice”: narration templates (This subsection ...) and slide navigation (Next, we ...).

Action (safe edits allowed):

Replace navigation sentences with argument bridges (no new facts).

Bad:

Next, we move from planning to memory.

Better:

Planning specifies how decisions are made; memory determines what information those decisions can reliably condition on under a fixed protocol.

C. Scope and taxonomy consistency

Check:

Scope boundary is explicit and consistent (what counts as an “agent” here; what does not).
Taxonomy nodes match the paper’s claims (no mixed axes without a rule).
No silent drift (e.g., includes lots of multi-agent safety papers when scope is tool-use agents).

Action:

If scope drift is structural: route to C2 (tighten outline + mapping).
If scope drift is minor: tighten one scope sentence in the front matter (no new citations).

D. Citations and verifiability (claim -> evidence)

Write a small claim-evidence table (5–10 rows):

claim | section | citations | evidence_field | evidence_level

Flag:

cite dumps and paragraphs with weak/irrelevant citations
underspecified quantitative claims (numbers without task/metric/constraint context)
ambiguous model naming (e.g., “GPT-5”) unless the cited paper uses that label

Action:

If you can clarify context without new facts (e.g., “under a fixed budget/tool access”), do so.
Otherwise: mark as TODO and route to C3/C4 (paper notes / evidence packs).

E. Tables and structural outputs

Check:

Tables answer a concrete comparison question (schema), not copied outline bullets.
Rows contain citations.

Action:

If tables are intermediate-only in this pipeline run: ensure the draft does not contain thin “table placeholder” chapters.

Recommended fix order (routing)

When the report finds issues, recommend the smallest fix path:

Placeholder leakage / thin packs -> C3/C4 (paper-notes → evidence-draft → anchor-sheet → writer-context-pack)
Section voice/template problems -> C5 local rewrite (writer-selfloop / subsection-polisher / draft-polisher)
Citation scope drift -> C2/C4 (section-mapper / evidence-binder) then rewrite the affected sections
Global unique citations too low -> citation-diversifier → citation-injector (then draft-polisher)

Safe edits allowed (optional)

If and only if edits are low-risk and do not change citation keys:

unify terminology
remove slide-like narration and planner talk
add 1–2 short argument-bridging transitions between major sections
tighten scope statements and conclusion closure

Script

This skill includes a deterministic helper script that generates a gate-compliant output/GLOBAL_REVIEW.md from the current draft and context (no invented facts/citations).

Quick Start

python .codex/skills/global-reviewer/scripts/run.py --help
python .codex/skills/global-reviewer/scripts/run.py --workspace workspaces/<ws>

All Options

--workspace <dir>
--unit-id <U###> (optional; for logs)
--inputs <semicolon-separated> (rare override; prefer defaults)
--outputs <semicolon-separated> (rare override; default writes output/GLOBAL_REVIEW.md)
--checkpoint <C#> (optional)

Examples

Generate a global review after merging a draft:
- python .codex/skills/global-reviewer/scripts/run.py --workspace workspaces/<ws>

Freeze policy:

If you hand-edit the review and want to freeze it, create output/GLOBAL_REVIEW.refined.ok to prevent overwrites.

Notes:

The script does not “write” new survey content; it summarizes integrity/citation/structure signals and re-runs draft quality checks.

Troubleshooting

Issue: review flags missing citations / undefined keys

Fix:

Run citation-verifier and ensure citations/ref.bib contains every cited key in output/DRAFT.md.

Issue: review suggests changes that would add new claims

Fix:

Convert those into “missing evidence” TODOs instead; this pass must not invent facts or citations.

Maintainer

WILLOSCAR Core maintainer

Source details

Full Name: WILLOSCAR/research-units-pipeline-skills
Branch: main
Path in repo: .codex/skills/global-reviewer
Topics: claude-code claude skills codex gpt pipeline research research-paper research-project research-tool tools units vibe vibe-coding vibecoding

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

WILLOSCAR/research-units-pipeline-skills

thesis-compile-review

对中文毕业论文进行编译、warning 分级、模板模式检查、数据与引用复查，并把问题回写成可继续迭代的 review checklist。 **Trigger**: 毕业论文编译检查, thesis compile review, warning 分级, 终稿复查, main.pdf 检查. **Use when**: 论文已经回写到 TeX 交付层，需要确认是否真正达到“可提交”的质量，而不是只做到能编译。 **Skip if**: 还处于中间层重构阶段，`chapters/*.tex` 尚未形成稳定交付稿。 **Network**: none. **Guardrail**: 不在这里重构章节主线；如果发现结构问题，明确回退到上游修复。

377 25

Explore

WILLOSCAR/research-units-pipeline-skills

front-matter-writer

Write the survey's front matter files (Abstract, Introduction, Related Work, Discussion, Conclusion) in paper voice, with high citation density and a single evidence-policy paragraph. **Trigger**: front matter writer, introduction writer, related work writer, abstract writer, discussion writer, conclusion writer, 引言, 相关工作, 摘要, 讨论, 结论. **Use when**: you are in C5 (prose allowed) and need the paper-like shell to stop the draft reading like stitched subsections. **Skip if**: `Approve C2` is missing in `DECISIONS.md`, or `citations/ref.bib` is missing. **Network**: none. **Guardrail**: no invented facts/citations; no pipeline jargon in final prose; no repeated evidence disclaimers; only use keys present in `citations/ref.bib`.

377 25

Explore

WILLOSCAR/research-units-pipeline-skills

thesis-question-list

维护中文毕业论文的 `codex_md/question_list.md`：把本轮问题、边界、优先级、协作方案和验收口径结构化，作为整条 thesis pipeline 的控制面。 **Trigger**: 毕业论文问题清单, thesis question list, 论文修改清单, 本轮目标, 结构问题梳理, review问题整理. **Use when**: 你已经有一批材料或上一轮 review 结果，需要明确这一轮到底修什么、不修什么，并给后续重构与编译复查提供统一入口。 **Skip if**: 当前只是在做一次性局部措辞修改，且没有形成新一轮结构/证据/编译问题。 **Network**: none. **Guardrail**: 不在这里写正文；不把问题单写成长篇散文；每条问题必须可执行、可验收。

377 25

Explore

WILLOSCAR/research-units-pipeline-skills

novelty-matrix

Create a novelty/prior-work matrix comparing the submission’s contributions against related work (overlaps vs deltas). **Trigger**: novelty matrix, prior-work matrix, overlap/delta, 相关工作对比, 新颖性矩阵. **Use when**: peer review 中评估 novelty/positioning，需要把贡献与相关工作逐项对齐并写出差异点证据。 **Skip if**: 缺少 claims（先跑 `claims-extractor`）或你不打算做新颖性定位分析。 **Network**: none (retrieval of additional related work is out-of-scope unless provided). **Guardrail**: 明确 overlap 与 delta；尽量给出可追溯证据来源（来自稿件/引用/作者陈述）。

377 25

Explore

WILLOSCAR/research-units-pipeline-skills

protocol-writer

Write a systematic review protocol into `output/PROTOCOL.md` (databases, queries, inclusion/exclusion, time window, extraction fields). **Trigger**: protocol, PRISMA, systematic review, inclusion/exclusion, 检索式, 纳入排除. **Use when**: systematic review pipeline 的起点（C1），需要先锁定 protocol 再开始 screening/extraction。 **Skip if**: 不是做 systematic review（或 protocol 已经锁定且不允许修改）。 **Network**: none. **Guardrail**: protocol 必须包含可执行的检索与筛选规则；需要 HUMAN 签字后才能进入 screening。

377 25

Explore

WILLOSCAR/research-units-pipeline-skills

rubric-writer

Write a rubric-based peer review report (`output/REVIEW.md`) using extracted claims and evidence gaps (novelty/soundness/clarity/impact). **Trigger**: rubric review, referee report, peer review write-up, 审稿报告, REVIEW.md. **Use when**: peer-review pipeline 的最后阶段（C3），已有 `output/CLAIMS.md` + `output/MISSING_EVIDENCE.md`（以及可选 novelty matrix）。 **Skip if**: 上游产物未就绪（claims/evidence gaps 缺失）或你不打算输出完整审稿报告。 **Network**: none. **Guardrail**: 给可执行建议（actionable feedback），并覆盖 novelty/soundness/clarity/impact；避免泛泛而谈。

377 25

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Global Reviewer (survey draft)

Role cards (use explicitly)

Consistency Reviewer (auditor)

Coherence Editor (bridge finder)

Role prompt: Consistency Reviewer (AI paper reviewer mindset)

Inputs

Outputs

Non-negotiables

What this skill owns (and what it does not)

Workflow (use the context files explicitly)

Report format (required)

What to check (high-value, paper-like)

A. Input integrity / placeholder leakage

B. Narrative and argument chain

C. Scope and taxonomy consistency

D. Citations and verifiability (claim -> evidence)

E. Tables and structural outputs

Recommended fix order (routing)

Safe edits allowed (optional)

Script

Quick Start

All Options

Examples

Troubleshooting

Issue: review flags missing citations / undefined keys

Issue: review suggests changes that would add new claims

Recommended Agent Skills

thesis-compile-review

front-matter-writer

thesis-question-list

novelty-matrix

protocol-writer

rubric-writer