Agent skill
grad-paragraph
Write one survey-quality paragraph from evidence packs (tension → contrast → evaluation anchor → limitation). **Trigger**: grad paragraph, paragraph micro-structure, argument paragraph, 研究生段落, 论证段落, 对比段, 段落写作. **Use when**: you are drafting `sections/S*.md` (H3 body) and want subsection-specific, evidence-bounded prose instead of templates. **Skip if**: evidence packs are missing/incomplete (fix `subsection-briefs`/`evidence-draft`/`evidence-binder` first), or `Approve C2` is not recorded in `DECISIONS.md`. **Network**: none. **Guardrail**: do not invent facts or citations; no placeholders/ellipsis; keep claims conservative when evidence is abstract-level (avoid repeating evidence-mode boilerplate in every paragraph).
Install this agent skill to your Project
npx add-skill https://github.com/WILLOSCAR/research-units-pipeline-skills/tree/main/.codex/skills/grad-paragraph
SKILL.md
Grad Paragraph (survey paragraph micro-skill)
Purpose: produce a single paragraph that reads like real survey prose, not “outline expansion”.
This is a writing micro-skill you can apply repeatedly inside subsection-writer (per H3 file under sections/).
Role cards (use explicitly)
Argument Planner
Mission: decide the paragraph’s tension/contrast/eval/limitation before writing.
Do:
- Write a 4-line plan (kept out of final prose).
- Ensure planned sentences can be grounded in evidence/citations.
Avoid:
- Writing from headings or generic axis labels.
Paragraph Author
Mission: turn the plan into one content-bearing paragraph with embedded citations.
Do:
- Use explicit contrast markers (whereas/in contrast).
- Include at least one evaluation anchor token (task/metric/constraint).
- End with a limitation that changes interpretation.
Avoid:
- Narration and repeated template stems.
Role prompt: Paragraph Author (argument move)
You are writing one paragraph of a technical survey.
Your job is to perform one argument move under evidence:
- tension/question (why this matters here)
- explicit contrast (A vs B; not a list)
- evaluation anchor (task/metric/constraint)
- limitation (what breaks transfer or comparability)
Style:
- natural prose, content-bearing
- no narration (“This paragraph surveys…”)
- no repeated discourse stems across paragraphs
Constraints:
- do not invent facts or citations
- embed citations inside the sentence that needs them
- stay within the subsection’s citation scope
What this paragraph must contain
In one paragraph (typically 4–6 sentences), cover:
- Tension / question: what this paragraph is trying to resolve (subsection-specific).
- Contrast: compare at least two approaches/routes/clusters (A vs B) with explicit contrast words.
- Evaluation anchor: name how comparisons are made (benchmark/dataset/metric/protocol), even if only abstract-level.
- Limitation / verification: state what is uncertain (missing protocol details, incomparable benchmarks, unclear constraints) without turning into boilerplate.
- If you include a number, also include: task type + metric definition + constraint (budget/cost/tool access), and cite it.
Inputs (practical)
outline/subsection_briefs.jsonl(forrq,axes,clusters,paragraph_plan)outline/evidence_drafts.jsonl(for evidence snippets + candidate comparisons)outline/evidence_bindings.jsonl(allowed citations for this H3)citations/ref.bib
Outputs
- One paragraph (4–6 sentences) to paste into the target
sections/S<sub_id>.mdfile. - Optional (when debugging): a 4-line plan (tension/contrast/eval/limitation) kept out of the final prose.
Roles (two-pass is more reliable)
Role A: Argument Planner
Write a 4-line plan before prose:
- Tension sentence (1 line)
- Contrast sentence (1 line; A vs B)
- Evaluation anchor sentence (1 line)
- Limitation sentence (1 line)
Rules:
- Each line should be anchored by at least one citation key you intend to use.
- If evidence is abstract-only, avoid “dominant / clearly / state-of-the-art” style conclusions.
Role B: Writer
Turn the plan into one natural paragraph.
Rules:
- Keep the paragraph subsection-specific (it should not be copy-pastable into other subsections).
- Place citations inside the sentence they support (not only at paragraph end).
- Do not mention pipeline internals (“working claim”, “axes we track”, “verification targets”).
Paper voice (avoid template cadence)
- Keep tone calm and academic; avoid hype words (e.g., “clearly”, “obviously”, “breakthrough”).
- Vary sentence openings; don’t start every paragraph with the same connector (“However/Moreover/Taken together”).
- Avoid explicit labels like
Key takeaway:; let the sentence carry the point. - Prefer concrete nouns + mid-sentence ties (
...; however, ...) over “PPT narration” signposting.
Examples (what to write / what to avoid)
Bad (template narration + vague claims + cite dump):
This subsection surveys how agents use memory. Taken together, these approaches improve performance across tasks [@example2023; @example2024; @example2025].
Why it is bad:
- Starts with narration ("This subsection ...").
- No explicit A-vs-B contrast (reads like a topic list).
- No evaluation anchor (benchmark/metric/protocol is missing).
- Citations are only used as a trailing tag list.
Good (tension -> contrast -> eval anchor -> limitation; citations embedded):
Plan (kept out of final prose):
- Tension: Memory increases capability but makes evaluation and reproducibility harder.
- Contrast: Retrieval-style memory [@example2023] differs from write-heavy episodic memory [@example2024] in what gets stored and when it can be trusted.
- Eval anchor: Results are typically reported on agent benchmarks with success-rate style metrics under tool/budget constraints (state the specific benchmark/metric when available).
- Limitation: Comparisons remain fragile when protocols differ or when memory writes are not logged, so some gains may not transfer.
Paragraph (final prose):
A recurring tension in agent memory is that richer state can expand what the system can do, yet it also complicates evaluation and reproducibility. Retrieval-style designs emphasize selecting and grounding a small working set of relevant context [@example2023], whereas write-heavy episodic approaches accumulate longer-term traces that can change the agent behavior across episodes [@example2024]. These choices often surface in benchmarked evaluations as different failure patterns under fixed tool and budget constraints (e.g., higher success at the cost of more brittle behavior when memory writes are noisy). At the same time, cross-paper comparisons remain limited when protocols are not aligned or when memory writes are not transparently logged, making it unclear which gains reflect memory design versus evaluation artifacts.
Checklist (quick self-audit)
- No
.../…/TODO/ scaffold phrases. - Contains at least one explicit contrast marker:
whereas,however,in contrast,相比,不同于,相较. - Contains at least one evaluation anchor token:
benchmark,dataset,metric,protocol,evaluation,评测,基准,数据集,指标. - Contains at least one limitation/provisional token:
limited,unclear,sensitive,may,缺乏,受限,尚不明确,需要核验. - Citations are real (
[@BibKey]) and subsection-scoped (inoutline/evidence_bindings.jsonl).
Troubleshooting
Issue: the paragraph still reads like a template
Symptom:
- Repeated framing (“Taken together…”, “A useful way to compare…”) across many paragraphs.
Causes:
- You are writing from outline bullets instead of evidence snippets.
Solutions:
- Rebuild the plan from
Concrete comparisons/Failure/limitationsin the evidence pack. - Force one concrete noun per sentence (task/setting/constraint/evaluation artifact), even if you can’t use numbers.
Issue: you can’t write a contrast without guessing
Symptom:
- You only have titles, so you drift into vague statements (“depends on metrics”).
Causes:
- Evidence granularity is too low.
Solutions:
- Push upstream: strengthen
papers/paper_notes.jsonl(abstract/fulltext) and rerunevidence-draft. - If you must proceed, write the paragraph as a question + verification targets, not as a conclusion.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
thesis-compile-review
对中文毕业论文进行编译、warning 分级、模板模式检查、数据与引用复查,并把问题回写成可继续迭代的 review checklist。 **Trigger**: 毕业论文编译检查, thesis compile review, warning 分级, 终稿复查, main.pdf 检查. **Use when**: 论文已经回写到 TeX 交付层,需要确认是否真正达到“可提交”的质量,而不是只做到能编译。 **Skip if**: 还处于中间层重构阶段,`chapters/*.tex` 尚未形成稳定交付稿。 **Network**: none. **Guardrail**: 不在这里重构章节主线;如果发现结构问题,明确回退到上游修复。
front-matter-writer
Write the survey's front matter files (Abstract, Introduction, Related Work, Discussion, Conclusion) in paper voice, with high citation density and a single evidence-policy paragraph. **Trigger**: front matter writer, introduction writer, related work writer, abstract writer, discussion writer, conclusion writer, 引言, 相关工作, 摘要, 讨论, 结论. **Use when**: you are in C5 (prose allowed) and need the paper-like shell to stop the draft reading like stitched subsections. **Skip if**: `Approve C2` is missing in `DECISIONS.md`, or `citations/ref.bib` is missing. **Network**: none. **Guardrail**: no invented facts/citations; no pipeline jargon in final prose; no repeated evidence disclaimers; only use keys present in `citations/ref.bib`.
thesis-question-list
维护中文毕业论文的 `codex_md/question_list.md`:把本轮问题、边界、优先级、协作方案和验收口径结构化,作为整条 thesis pipeline 的控制面。 **Trigger**: 毕业论文问题清单, thesis question list, 论文修改清单, 本轮目标, 结构问题梳理, review问题整理. **Use when**: 你已经有一批材料或上一轮 review 结果,需要明确这一轮到底修什么、不修什么,并给后续重构与编译复查提供统一入口。 **Skip if**: 当前只是在做一次性局部措辞修改,且没有形成新一轮结构/证据/编译问题。 **Network**: none. **Guardrail**: 不在这里写正文;不把问题单写成长篇散文;每条问题必须可执行、可验收。
novelty-matrix
Create a novelty/prior-work matrix comparing the submission’s contributions against related work (overlaps vs deltas). **Trigger**: novelty matrix, prior-work matrix, overlap/delta, 相关工作对比, 新颖性矩阵. **Use when**: peer review 中评估 novelty/positioning,需要把贡献与相关工作逐项对齐并写出差异点证据。 **Skip if**: 缺少 claims(先跑 `claims-extractor`)或你不打算做新颖性定位分析。 **Network**: none (retrieval of additional related work is out-of-scope unless provided). **Guardrail**: 明确 overlap 与 delta;尽量给出可追溯证据来源(来自稿件/引用/作者陈述)。
protocol-writer
Write a systematic review protocol into `output/PROTOCOL.md` (databases, queries, inclusion/exclusion, time window, extraction fields). **Trigger**: protocol, PRISMA, systematic review, inclusion/exclusion, 检索式, 纳入排除. **Use when**: systematic review pipeline 的起点(C1),需要先锁定 protocol 再开始 screening/extraction。 **Skip if**: 不是做 systematic review(或 protocol 已经锁定且不允许修改)。 **Network**: none. **Guardrail**: protocol 必须包含可执行的检索与筛选规则;需要 HUMAN 签字后才能进入 screening。
rubric-writer
Write a rubric-based peer review report (`output/REVIEW.md`) using extracted claims and evidence gaps (novelty/soundness/clarity/impact). **Trigger**: rubric review, referee report, peer review write-up, 审稿报告, REVIEW.md. **Use when**: peer-review pipeline 的最后阶段(C3),已有 `output/CLAIMS.md` + `output/MISSING_EVIDENCE.md`(以及可选 novelty matrix)。 **Skip if**: 上游产物未就绪(claims/evidence gaps 缺失)或你不打算输出完整审稿报告。 **Network**: none. **Guardrail**: 给可执行建议(actionable feedback),并覆盖 novelty/soundness/clarity/impact;避免泛泛而谈。
Didn't find tool you were looking for?