Agent skill

ai-task-hub

AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. Use when users need hosted AI outcomes while host runtime manages identity, credits, payment, and risk control.

Stars 1,878
Forks 294

Install this agent skill to your Project

npx add-skill https://github.com/LeoYeAI/openclaw-master-skills/tree/main/skills/ai-task-hub

Metadata

Additional technical details for this skill

openclaw
{
    "emoji": "\ud83e\udde9",
    "homepage": "https://gateway.binaryworks.app",
    "requires": {
        "env": [
            "AGENT_TASK_TOKEN"
        ],
        "bins": [
            "node"
        ]
    },
    "skillKey": "ai-task-hub",
    "primaryEnv": "AGENT_TASK_TOKEN"
}

SKILL.md

AI Task Hub

Formerly skill-hub-gateway.

Public package boundary:

  • Only orchestrates portal.skill.execute, portal.skill.poll, and portal.skill.presentation.
  • Does not exchange api_key or userToken inside this package.
  • Does not handle recharge or payment flows inside this package.
  • Assumes host runtime injects short-lived task tokens and attachment URLs.

Chinese documentation: SKILL.zh-CN.md

When to Use This Skill

Use this skill when the user asks to:

  • detect people, faces, hands, keypoints, or tags from images
  • remove backgrounds or generate cutout/matting results for products or portraits
  • transcribe uploaded audio into text (speech to text, audio transcription)
  • generate speech from text input (text to speech, voice generation)
  • convert uploaded files into markdown (document to markdown)
  • start async jobs and check status later (poll, check job status)
  • fetch rendered visual outputs such as overlay, mask, and cutout
  • run embedding or reranking tasks for retrieval workflows

Common Requests

Example requests that should trigger this skill:

  • "Detect faces in this image and return bounding boxes."
  • "Tag this image and summarize the main objects."
  • "Remove the background from this product photo."
  • "Create a clean cutout from this portrait image."
  • "Transcribe this meeting audio into text."
  • "Generate speech from this paragraph."
  • "Convert this PDF file into markdown."
  • "Start this job now and let me poll the run status later."
  • "Fetch overlay and mask files for run_456."
  • "Generate embeddings for this text list and rerank the candidates."

Search-Friendly Capability Aliases

  • vision aliases: face detection, human detection, person detection, image tagging
  • background aliases: remove background, background removal, cutout, matting, product-cutout
  • asr aliases: speech to text, audio transcription, transcribe audio
  • tts aliases: text to speech, voice generation, speech synthesis
  • markdown_convert aliases: document to markdown, file to markdown, markdown conversion
  • poll aliases: check job status, poll long-running task, async run status
  • presentation aliases: rendered output, overlay, mask, cutout files
  • embeddings/reranker aliases: vectorization, semantic vectors, relevance reranking

Runtime Contract

Default API base URL: https://gateway-api.binaryworks.app

Action to endpoint mapping:

  • portal.skill.execute -> POST /agent/skill/execute
  • portal.skill.poll -> GET /agent/skill/runs/:run_id
  • portal.skill.presentation -> GET /agent/skill/runs/:run_id/presentation

Auth Contract (Host-Managed)

Every request must include:

  • X-Agent-Task-Token: <jwt_or_paseto>

Recommended token claims:

  • sub (user_id)
  • agent_uid
  • conversation_id
  • scope (execute|poll|presentation)
  • exp
  • jti

CLI argument order for scripts/skill.mjs:

  • [agent_task_token] <action> <payload_json> [base_url]
  • If token arg is omitted, script reads AGENT_TASK_TOKEN from environment.
  • Host runtime should refresh and inject short-lived AGENT_TASK_TOKEN automatically to avoid user-facing auth friction.

Payload Contract

  • portal.skill.execute: payload requires capability and input.
  • payload.request_id is optional and passed through.
  • portal.skill.poll and portal.skill.presentation: payload requires run_id.
  • portal.skill.presentation supports include_files (defaults to true).

Attachment normalization:

  • Prefer explicit image_url / audio_url / file_url.
  • attachment.url is mapped to target media field by capability.
  • Local file_path is disabled in the published package.
  • Host must upload chat attachments first, then pass URL fields.
  • Example host upload endpoint: /api/blob/upload-file.

Error Contract

  • Preserve gateway envelope: request_id, data, error.
  • Preserve POINTS_INSUFFICIENT and pass through error.details.recharge_url.

Bundled Files

  • scripts/skill.mjs
  • scripts/agent-task-auth.mjs
  • scripts/base-url.mjs
  • scripts/attachment-normalize.mjs
  • scripts/telemetry.mjs (compatibility shim)
  • references/capabilities.json
  • references/openapi.json
  • SKILL.zh-CN.md

Expand your agent's capabilities with these related and highly-rated skills.

LeoYeAI/openclaw-master-skills

audit-website

Audit websites for SEO, performance, security, technical, content, and 15 other issue cateories with 230+ rules using the squirrelscan CLI. Returns LLM-optimized reports with health scores, broken links, meta tag analysis, and actionable recommendations. Use to discover and asses website or webapp issues and health.

1,878 294
Explore
LeoYeAI/openclaw-master-skills

firecrawl

Web search and scraping via Firecrawl API. Use when you need to search the web, scrape websites (including JS-heavy pages), crawl entire sites, or extract structured data from web pages. Requires FIRECRAWL_API_KEY environment variable.

1,878 294
Explore
LeoYeAI/openclaw-master-skills

computer-use

Full desktop computer use for headless Linux servers. Xvfb + XFCE virtual desktop with xdotool automation. 17 actions (click, type, scroll, screenshot, drag, etc). Unlike OpenClaw's browser tool, operates at the X11 level so websites cannot detect automation. Includes VNC for live viewing.

1,878 294
Explore
LeoYeAI/openclaw-master-skills

social-media-analyzer

Social media campaign analysis and performance tracking. Calculates engagement rates, ROI, and benchmarks across platforms. Use for analyzing social media performance, calculating engagement rate, measuring campaign ROI, comparing platform metrics, or benchmarking against industry standards.

1,878 294
Explore
LeoYeAI/openclaw-master-skills

business-growth-skills

4 production-ready business and growth skills: customer success manager with health scoring and churn prediction, sales engineer with RFP analysis, revenue operations with pipeline and GTM metrics, and contract & proposal writer. Python tools included (all stdlib-only). Works with Claude Code, Codex CLI, and OpenClaw.

1,878 294
Explore
LeoYeAI/openclaw-master-skills

contract-and-proposal-writer

Contract & Proposal Writer

1,878 294
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results