Agent skill
examples-auto-run
Run examples:start-all in auto mode with parallel execution, per-script logs, and start/stop helpers.
Install this agent skill to your Project
npx add-skill https://github.com/openai/openai-agents-js/tree/main/.agents/skills/examples-auto-run
SKILL.md
examples-auto-run
What it does
- Runs
pnpm build && pnpm -r build-checkfirst - Runs
pnpm examples:start-allin auto-input mode (interactive prompts are auto-answered, HITL/MCP/apply-patch are auto-approved). - Executes starts in parallel (default concurrency 4) and pipes each start’s stdout/stderr into its own log file under
.tmp/examples-start-logs/. - Provides start/stop/status/logs/tail helpers via
run.sh. - If the Codex session ends (no disown/nohup), the child processes receive SIGHUP and exit;
stopis also available to clean up manually.
Usage
# Start (auto mode, concurrency=4 by default)
.agents/skills/examples-auto-run/scripts/run.sh start [extra args to examples:start-all]
# If you invoke the skill name alone ($examples-auto-run):
# - when `.tmp/examples-rerun.txt` exists and is non-empty, it will run `rerun` automatically
# - otherwise it runs the default `start` command.
# Examples:
.agents/skills/examples-auto-run/scripts/run.sh start --filter basic
.agents/skills/examples-auto-run/scripts/run.sh start --include-server --include-audio
# Check status
.agents/skills/examples-auto-run/scripts/run.sh status
# Stop running job (kills pid from .tmp/examples-auto-run.pid)
.agents/skills/examples-auto-run/scripts/run.sh stop
# List logs (per start script)
.agents/skills/examples-auto-run/scripts/run.sh logs
# Tail latest log
.agents/skills/examples-auto-run/scripts/run.sh tail
.agents/skills/examples-auto-run/scripts/run.sh tail basic__start_hello-world.log
# After a run, build a rerun list from the latest main log (auto-skip list is imported from `scripts/run-example-starts.mjs` and server/audio/external skips are honored)
.agents/skills/examples-auto-run/scripts/run.sh collect
# Rerun only the entries in .tmp/examples-rerun.txt
.agents/skills/examples-auto-run/scripts/run.sh rerun
# Show the current auto-skip list (env or defaults)
.agents/skills/examples-auto-run/scripts/run.sh start --print-auto-skip --dry-run
Defaults (overridable via env)
EXAMPLES_INTERACTIVE_MODE=autoAUTO_APPROVE_MCP=1,APPLY_PATCH_AUTO_APPROVE=1,AUTO_APPROVE_HITL=1(set in runner)EXAMPLES_CONCURRENCY=4EXAMPLES_EXECA_TIMEOUT_MS=300000(5m)
financial-research-agentandcomputer-useuse 10m inside the script.- Includes interactive; excludes server/audio/external by default:
EXAMPLES_INCLUDE_INTERACTIVE=1EXAMPLES_INCLUDE_SERVER=0EXAMPLES_INCLUDE_AUDIO=0
EXAMPLES_INCLUDE_EXTERNAL=0- This means
realtime-*/nextjs(tagged as server/audio) are skipped unless you opt in with--include-server/--include-audioor the corresponding env flags.
- This means
- Auto-skip list:
EXAMPLES_AUTO_SKIP(comma/space separated) overrides the built-in defaults used by bothrun.shandrun-example-starts.mjs. Defaults includeagent-patterns:start:llm-as-a-judge,agent-patterns:start:routing,customer-service:start,connectors:start,mcp:start:hosted-mcp-on-approval,mcp:start:hosted-mcp-human-in-the-loop.
Cancellation / cleanup
- Jobs are backgrounded but not disowned; if Codex suspends/ends the shell, the process group gets SIGHUP and stops.
- Manual cleanup:
run.sh stop(removes stale pid if already exited).
Log locations
.tmp/examples-start-logs/<package>__<script>.log(per start)- Main runner log path is printed when
startis invoked. - Rerun list (generated by
collect):.tmp/examples-rerun.txt(onepackage:scriptper line).
Notes
- Auto-skip is centralized (same defaults as above) and can be overridden via
EXAMPLES_AUTO_SKIP. Auto-skip entries are excluded from rerun collection and will be removed from rerun execution automatically. - Auto-input map covers common interactive prompts; HITL/MCP/apply-patch auto-approve via env is enabled by the runner.
- Shell tool approvals are auto-approved in auto mode (
SHELL_AUTO_APPROVE=1). rerunruns entries sequentially, continues after failures, and rewrites.tmp/examples-rerun.txtwith only the remaining failures. Auto-skip entries are not re-added.- Behavioral validation is not done in the runner, so Codex must immediately perform it after every
startorreruninvocation without waiting for the user to ask. Required steps:- Read the example source to infer intended flow from code/comments (tools invoked, expected outputs, guards, approvals).
- Read the matching log under
.tmp/examples-start-logs/. - Compare intent vs. log: confirm key actions/results happened; flag omissions or divergences.
- Do this for all exit-0 entries, not just samples.
- Summarize findings right after the run completes; when “OK”, note what was checked (e.g., “tools called + final message emitted”).
- When reporting, do not omit or ellipsize outputs that justify the validation; include the full relevant lines (keep it concise but untruncated).
- The runner prints a full table after the summary: one row per start script with
status,package:script,info(reason/exit/skipped), and the log path. If the run stops before the table appears, point the analyzer at the latestmain_*.logto reconstruct a table and validations.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
openai-knowledge
Use when working with the OpenAI API (Responses API) or OpenAI platform features (tools, streaming, Realtime API, auth, models, rate limits, MCP) and you need authoritative, up-to-date documentation (schemas, examples, limits, edge cases). Prefer the OpenAI Developer Documentation MCP server tools when available; otherwise guide the user to enable `openaiDeveloperDocs`.
final-release-review
Perform a release-readiness review by locating the previous release tag from remote tags and auditing the diff (e.g., v1.2.3...<commit>) for breaking changes, regressions, improvement opportunities, and risks before releasing openai-agents-js.
implementation-strategy
Decide how to implement runtime and API changes in openai-agents-js before editing code. Use when a task changes exported APIs, runtime behavior, schemas, tests, or docs and you need to choose the compatibility boundary, whether shims or migrations are warranted, and when unreleased interfaces can be rewritten directly.
docs-sync
Analyze main branch implementation and configuration to find missing, incorrect, or outdated documentation in docs/. Use when asked to audit doc coverage, sync docs with code, or propose doc updates/structure changes. Only update English docs (docs/src/content/docs/**) and never touch translated docs under docs/src/content/docs/ja, ko, or zh. Provide a report and ask for approval before editing docs.
pnpm-upgrade
Keep pnpm current: run pnpm self-update/corepack prepare, align packageManager in package.json, and bump pnpm/action-setup + pinned pnpm versions in .github/workflows to the latest release. Use this when refreshing the pnpm toolchain manually or in automation.
runtime-behavior-probe
Plan and execute runtime-behavior investigations with temporary TypeScript probe scripts, validation matrices, state controls, and findings-first reports. Use only when the user explicitly invokes this skill to verify actual runtime behavior beyond normal code-level checks, especially to uncover edge cases, undocumented behavior, or common failure modes in local or live integrations. A baseline smoke check is fine as an entry point, but do not stop at happy-path confirmation.
Didn't find tool you were looking for?