Agent skill

induct-research

Induct research sources into a research repository. Point at an issue, a single file, a directory of papers, or a URI and the skill reads, annotates, and files structured induction tasks — one per source. Similar to address-issues but for research corpora instead of code backlogs.

View SKILL.md on GitHub Repository

Stars 107

Forks 15

Install this agent skill to your Project

npx add-skill https://github.com/jmagly/aiwg/tree/main/agentic/code/frameworks/research-complete/skills/induct-research

SKILL.md

Induct Research

Process one or more research sources — an issue, a file, a directory of papers, or a URI — and file structured induction tasks into a research repository so nothing gets lost. The analogue of address-issues for research corpora.

Triggers

"induct this paper" → single file induction
"induct the research queue" → batch directory induction
"add these references to the research repo" → URI or file-path induction
"process the research from issue-planner" → induct .aiwg/research/queue/
"induct research into gitea" → named MCP service target
/induct-research <target> → direct invocation

Parameters

`<target>` (required)

What to induct. Three formats accepted:

Format	Example	Behavior
File path	`.aiwg/research/queue/`	Read all `.md` files in the directory
Single file	`.aiwg/research/queue/ref-dapper.md`	Induct one source
URI	`https://arxiv.org/abs/2307.09288`	Fetch and induct the paper at that URL
Directory glob	`papers/*/.pdf`	Induct all matched files recursively
Issue reference	`gitea:roctinam/research#42`	Read the issue body as a research stub

`--repo <dest>` (optional)

Where to file induction tasks. Accepts the same three formats as --induct-research in issue-planner:

Format	Example	Behavior
File path	`--repo .aiwg/research/inducted/`	Write task `.md` files locally
URI	`--repo https://git.integrolabs.net/roctinam/research`	File issues to that Gitea/GitHub/Jira instance
Named MCP	`--repo gitea`	Use `mcp__gitea__issue_write` directly
Named MCP	`--repo codehound`	Register in Hound search index

Falls back to AIWG_RESEARCH_REPO env var if --repo is omitted.

`--dry-run` (optional)

List what would be inducted and where, without writing or filing anything.

`--priority high|medium|low` (optional)

Override the suggested priority for all inducted items. Default: assessed per source.

`--tag <topic>` (optional)

Apply a topic tag to all inducted items. Repeatable: --tag llm --tag evaluation.

`--recursive` (optional)

When target is a directory, recurse into subdirectories. Default: top-level only.

Execution Flow

Phase 1: Source Discovery

Parse <target> — determine input type (file, directory, URI, issue ref)
Collect sources:
- File/directory: glob for .md, .pdf, .txt, .yaml files
- URI: fetch the resource; detect type (paper, doc page, repo, issue)
- Issue reference: fetch issue body and all comments via MCP or CLI
Deduplicate — skip sources already present in the destination repo (if queryable)
Report discovery:

Found 9 sources to induct:
  3 Markdown stubs (.aiwg/research/queue/)
  4 PDF papers (papers/2024/)
  2 URI references
  Skipping 1 (already inducted: REF-042)

Phase 2: Per-Source Analysis

For each source, run a focused analysis agent:

For Markdown stubs (from issue-planner queue files):

Read the stub content and relevance summary
Assess induction priority from context
Assign topic tags from content keywords

For PDFs / full papers:

Extract title, authors, year, abstract
Identify key claims and methodologies
Assess relevance to existing corpus (check .aiwg/research/ for related REF-XXX files)
Assign GRADE quality level (A–D) based on source type and peer-review status

For URIs:

Fetch content (WebFetch)
Classify: paper, blog post, official docs, repo README, specification, news
Extract key points and assess credibility
Determine if full acquisition is needed (call /research-acquire if paper)

For issue references:

Read full issue body and comments
Extract referenced URLs, files, or topics
Treat as a research brief stub

Phase 3: Induction Task Filing

For each analyzed source, file one induction task using the standard template.

Induction task body:

markdown

## Reference Induction

**Source**: <URL, file path, or issue reference>
**Type**: <paper | blog | docs | repo | spec | stub | issue>
**GRADE**: <A | B | C | D | unassessed>
**Priority**: <high | medium | low>
**Tags**: <topic1>, <topic2>

## Summary
<2–3 sentences: what this source covers and why it's relevant>

## Key Claims / Findings
- <Specific claim or finding>
- <Specific claim or finding>
- <Specific claim or finding>

## Relevance to Corpus
<How this relates to existing research — cross-references to REF-XXX if applicable>

## Induction Checklist
- [ ] Read full source
- [ ] Extract key insights as Zettelkasten notes
- [ ] Cross-reference with existing corpus
- [ ] Assign REF-XXX identifier
- [ ] Tag with topic taxonomy
- [ ] Assess with /research-quality
- [ ] Archive with /research-archive (if paper/PDF)
- [ ] Add to citation graph with /research-cite

## Origin
- Surfaced by: <issue-planner | manual | other>
- Surfaced for: <objective or context>
- Induction date: <YYYY-MM-DD>

Filing based on --repo target:

File path: write induct-<slug>.md to destination directory
Gitea URI/MCP: mcp__gitea__issue_write with label research-induction
GitHub URI: gh issue create --label research-induction
Jira URI: REST POST /rest/api/2/issue with issue type Task
Codehound MCP: register URI in search index, create stub document

Phase 4: Summary Report

## Induction Summary

| # | Source | Type | Priority | Filed At |
|---|--------|------|----------|----------|
| 1 | RFC 9110 HTTP Semantics | spec | high | gitea#301 |
| 2 | "Dapper" Google Tracing Paper | paper | high | gitea#302 |
| 3 | opentelemetry.io/docs | docs | medium | gitea#303 |
| 4 | github.com/jaegertracing/jaeger | repo | medium | gitea#304 |
| 5 | arxiv.org/abs/2012.15161 | paper | low | gitea#305 |
...

Inducted: 9
Skipped: 1 (already present)
Destination: gitea:roctinam/research

Next steps:
- /research-acquire <URL> for any paper that needs PDF download
- /research-document to annotate inducted sources
- /research-quality to score GRADE for each inducted item

Target Resolution Logic

resolve_target(target):
  if target starts with "http://" or "https://":
    host = extract_host(target)
    if host matches known_gitea_instances: use mcp__gitea__issue_write
    if host == "github.com": use gh CLI
    if host matches jira pattern: use Jira REST API
    else: fetch as web resource, induct as URI reference

  elif target matches "gitea:<owner>/<repo>#<n>":
    fetch issue via mcp__gitea__issue_read

  elif target is a named MCP service ("gitea", "codehound", "github"):
    use that service's write/register tool directly

  elif target is a file path:
    if path is directory: glob for .md/.pdf/.txt files
    if path is a file: induct single source

Batch Mode — Directory of Papers

When target is a directory, process all supported files:

/induct-research papers/2024/ --repo gitea --tag llm --recursive

⏳ Scanning papers/2024/ (recursive)...
  Found 23 PDF files
  Found 7 Markdown stubs
  Found 2 YAML records
  Deduplicating against gitea:roctinam/research...
    Skipping 4 (already inducted)

⏳ Analyzing 28 sources (parallel agents)...
  ✓ Batch A (7 sources): complete
  ✓ Batch B (7 sources): complete
  ✓ Batch C (7 sources): complete
  ✓ Batch D (7 sources): complete

⏳ Filing 28 induction tasks to gitea:roctinam/research...
✓ Inducted: 28 | Skipped: 4 | Total: 32

Integration with issue-planner

issue-planner --induct-research <target> calls this skill's Phase 3 (filing) logic directly after Phase 2 research synthesis. The references are the URLs and sources discovered during the parallel research pass.

/induct-research can also be invoked standalone to process:

Pre-existing queues: /induct-research .aiwg/research/queue/
Ad-hoc papers: /induct-research https://arxiv.org/abs/2307.09288
Full directories: /induct-research ~/Downloads/papers/ --repo gitea

Composition

induct-research <target>
    │
    ├── Phase 1: Source discovery
    │   ├── File/directory: glob + read
    │   ├── URI: WebFetch + classify
    │   └── Issue ref: mcp__gitea__issue_read or gh CLI
    ├── Phase 2: Per-source analysis (parallel agents)
    │   ├── PDF/paper agent → extract + GRADE
    │   ├── URI agent → classify + credibility
    │   └── Stub agent → parse relevance summary
    ├── Phase 3: Induction task filing
    │   ├── File path → write .md task files
    │   ├── Gitea URI/MCP → mcp__gitea__issue_write
    │   ├── GitHub URI → gh issue create
    │   └── Codehound MCP → register in search index
    └── Phase 4: Summary report

References

@$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/skills/issue-planner/SKILL.md — Calls induct-research during Phase 2b
@$AIWG_ROOT/agentic/code/frameworks/research-complete/skills/research-acquire/SKILL.md — Full PDF acquisition (called for paper URIs)
@$AIWG_ROOT/agentic/code/frameworks/research-complete/skills/research-document/SKILL.md — Annotate inducted sources
@$AIWG_ROOT/agentic/code/frameworks/research-complete/skills/research-quality/SKILL.md — GRADE scoring for inducted items
@$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/skills/address-issues/SKILL.md — Analogous pattern for code issues
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/subagent-scoping.md — Parallel batch analysis constraints

Maintainer

jmagly Core maintainer

Source details

Full Name: jmagly/aiwg
Branch: main
Path in repo: agentic/code/frameworks/research-complete/skills/induct-research
License: MIT License
Topics: claude-code anthropic workflow-automation developer-tools agentic-coding prompt-engineering autonomous-agents multi-agent orchestration sdlc

Featured Tools

Join Our Newsletter

Search for research papers across academic databases

107 15

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Induct Research

Triggers

Parameters

<target> (required)

--repo <dest> (optional)

--dry-run (optional)

--priority high|medium|low (optional)

--tag <topic> (optional)

--recursive (optional)

Execution Flow

Phase 1: Source Discovery

Phase 2: Per-Source Analysis

Phase 3: Induction Task Filing

Phase 4: Summary Report

Target Resolution Logic

Batch Mode — Directory of Papers

Integration with issue-planner

Composition

References

Recommended Agent Skills

research-document

research-archive

research-cite

research-provenance

research-quality