Agent skill
iii-observability
Integrates OpenTelemetry tracing, metrics, and logging into iii workers. Use when setting up distributed tracing, Prometheus metrics, custom spans, or connecting to observability backends.
Install this agent skill to your Project
npx add-skill https://github.com/iii-hq/iii/tree/main/skills/iii-observability
SKILL.md
Observability
Comparable to: Datadog, Grafana, Honeycomb, Jaeger
Key Concepts
Use the concepts below when they fit the task. Not every worker needs custom spans or metrics.
- Built-in OpenTelemetry support across all SDKs — every function invocation is automatically traced
- The engine exports traces, metrics, and logs via OTLP to any compatible collector
- Workers propagate W3C trace context automatically across function invocations
- Prometheus metrics are exposed on port 9464
registerWorker()withotelconfig enables telemetry per worker- Custom spans via
withSpan(name, opts, fn)wrap async work with trace context - Custom metrics via
getMeter()create counters and histograms
Architecture
The worker SDK generates spans, metrics, and logs during function execution. These flow to the engine, which exports them via OTLP to a collector (Jaeger, Grafana, Datadog). The engine also exposes a Prometheus endpoint on port 9464 for scraping.
iii Primitives Used
| Primitive | Purpose |
|---|---|
registerWorker(url, { otel }) |
Connect worker with telemetry config |
withSpan(name, opts, fn) |
Create a custom trace span |
getTracer() |
Access OpenTelemetry Tracer directly |
getMeter() |
Access OpenTelemetry Meter for custom metrics |
currentTraceId() |
Get active trace ID for correlation |
injectTraceparent() |
Inject W3C trace context into outbound calls |
onLog(callback, { level }) |
Subscribe to log events |
shutdown_otel() |
Graceful shutdown of telemetry pipeline |
Reference Implementation
See ../references/observability.js for the full working example — a worker with custom spans,
Also available in Python: ../references/observability.py
Also available in Rust: ../references/observability.rs metrics counters, trace propagation, and log subscriptions connected to an OTel collector.
Common Patterns
Code using this pattern commonly includes, when relevant:
registerWorker('ws://localhost:49134', { otel: { enabled: true, serviceName: 'my-svc' } })— enable telemetrywithSpan('validate-order', {}, async (span) => { span.setAttribute('order.id', id); ... })— custom spangetMeter().createCounter('orders.processed')— custom counter metricgetMeter().createHistogram('request.duration')— custom histogram metriconLog((log) => { ... }, { level: 'warn' })— subscribe to warnings and abovecurrentTraceId()— get active trace ID for correlation with external systemsinjectTraceparent()— propagate trace context to outbound HTTP calls- Disable telemetry:
registerWorker(url, { otel: { enabled: false } })orOTEL_ENABLED=false
Adapting This Pattern
Use the adaptations below when they apply to the task.
- Enable
otelinregisterWorker()config to start collecting traces automatically - Add custom spans around expensive operations (DB queries, LLM calls, external APIs)
- Create domain-specific metrics (orders processed, payment failures, queue depth)
- Use
currentTraceId()to correlate iii traces with external system logs - Configure
iii-observabilityin iii-config.yaml for engine-side exporter, sampling ratio, and alerts - Point the OTLP endpoint at your collector (Jaeger, Grafana Tempo, Datadog Agent)
Engine Configuration
iii-observability must be enabled in iii-config.yaml for engine-side traces, metrics, and logs. See ../references/iii-config.yaml for the full annotated config reference.
Pattern Boundaries
- For engine-side iii-observability YAML configuration, prefer
iii-engine-config. - For SDK init options and function registration, prefer
iii-functions-and-triggers. - Stay with
iii-observabilitywhen the primary problem is SDK-level telemetry: spans, metrics, logs, and trace propagation.
When to Use
- Use this skill when the task is primarily about
iii-observabilityin the iii engine. - Triggers when the request directly asks for this pattern or an equivalent implementation.
Boundaries
- Never use this skill as a generic fallback for unrelated tasks.
- You must not apply this skill when a more specific iii skill is a better fit.
- Always verify environment and safety constraints before applying examples from this skill.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
iii-dead-letter-queues
Inspects and redrives jobs that exhausted all retries. Use when handling failed queue jobs, debugging processing errors, or implementing retry strategies.
iii-cron-scheduling
Registers cron triggers with 7-field expressions to run functions on recurring schedules. Use when scheduling periodic jobs, timed automation, crontab replacements, cleanup routines, report generation, health checks, batch processing, or any task that should run every N seconds, minutes, hours, or on a weekly/monthly calendar.
iii-http-invoked-functions
Registers external HTTP endpoints as iii functions using registerFunction(id, HttpInvocationConfig). Use when adapting legacy APIs, third-party webhooks, or immutable services into triggerable iii functions, especially when prompts ask for endpoint maps like { path, id } iterated into registerFunction calls.
iii-channels
Binary streaming between workers via channels. Use when building data pipelines, file transfers, streaming responses, or any pattern requiring binary data transfer between functions.
iii-event-driven-cqrs
Implements CQRS with event sourcing on the iii engine. Use when building command/query separation, event-sourced systems, or fan-out architectures where commands publish domain events and multiple read model projections subscribe independently.
iii-agentic-backend
Creates and orchestrates multi-agent pipelines on the iii engine. Use when building AI agent collaboration, agent orchestration, research/review/synthesis chains, or any system where specialized agents hand off work through queues and shared state.
Didn't find tool you were looking for?