Agent skill

capacity

Discovers available Azure OpenAI model capacity across regions and projects. Analyzes quota limits, compares availability, and recommends optimal deployment locations based on capacity requirements. USE FOR: find capacity, check quota, where can I deploy, capacity discovery, best region for capacity, multi-project capacity search, quota analysis, model availability, region comparison, check TPM availability. DO NOT USE FOR: actual deployment (hand off to preset or customize after discovery), quota increase requests (direct user to Azure Portal), listing existing deployments.

View SKILL.md on GitHub Repository

Stars 2,020

Forks 226

Install this agent skill to your Project

npx add-skill https://github.com/microsoft/skills/tree/main/.github/plugins/azure-skills/skills/microsoft-foundry/models/deploy-model/capacity

Metadata

Additional technical details for this skill

author: Microsoft
version: 1.0.0

SKILL.md

Capacity Discovery

Finds available Azure OpenAI model capacity across all accessible regions and projects. Recommends the best deployment location based on capacity requirements.

Quick Reference

Property	Description
Purpose	Find where you can deploy a model with sufficient capacity
Scope	All regions and projects the user has access to
Output	Ranked table of regions/projects with available capacity
Action	Read-only analysis — does NOT deploy. Hands off to preset or customize
Authentication	Azure CLI (`az login`)

When to Use This Skill

✅ User asks "where can I deploy gpt-4o?"
✅ User specifies a capacity target: "find a region with 10K TPM for gpt-4o"
✅ User wants to compare availability: "which regions have gpt-4o available?"
✅ User got a quota error and needs to find an alternative location
✅ User asks "best region and project for deploying model X"

After discovery → hand off to preset or customize for actual deployment.

Scripts

Pre-built scripts handle the complex REST API calls and data processing. Use these instead of constructing commands manually.

Script	Purpose	Usage
`scripts/discover_and_rank.ps1`	Full discovery: capacity + projects + ranking	Primary script for capacity discovery
`scripts/discover_and_rank.sh`	Same as above (bash)	Primary script for capacity discovery
`scripts/query_capacity.ps1`	Raw capacity query (no project matching)	Quick capacity check or version listing
`scripts/query_capacity.sh`	Same as above (bash)	Quick capacity check or version listing

Workflow

Phase 1: Validate Prerequisites

bash

az account show --query "{Subscription:name, SubscriptionId:id}" --output table

Phase 2: Identify Model and Version

Extract model name from user prompt. If version is unknown, query available versions:

powershell

.\scripts\query_capacity.ps1 -ModelName <model-name>

bash

./scripts/query_capacity.sh <model-name>

This lists available versions. Use the latest version unless user specifies otherwise.

Phase 3: Run Discovery

Run the full discovery script with model name, version, and minimum capacity target:

powershell

.\scripts\discover_and_rank.ps1 -ModelName <model-name> -ModelVersion <version> -MinCapacity <target>

bash

./scripts/discover_and_rank.sh <model-name> <version> <min-capacity>

💡 The script automatically queries capacity across ALL regions, cross-references with the user's existing projects, and outputs a ranked table sorted by: meets target → project count → available capacity.

Phase 3.5: Validate Subscription Quota

After discovery identifies candidate regions, validate that the user's subscription actually has available quota in each region. Model capacity (from Phase 3) shows what the platform can support, but subscription quota limits what this specific user can deploy.

powershell

# For each candidate region from discovery results:
$usageData = az cognitiveservices usage list --location <region> --subscription $SUBSCRIPTION_ID -o json 2>$null | ConvertFrom-Json

# Check quota for each SKU the model supports
# Quota names follow pattern: OpenAI.<SKU>.<model-name>
$usageEntry = $usageData | Where-Object { $_.name.value -eq "OpenAI.<SKU>.<model-name>" }

if ($usageEntry) {
  $quotaAvailable = $usageEntry.limit - $usageEntry.currentValue
} else {
  $quotaAvailable = 0  # No quota allocated
}

bash

# For each candidate region from discovery results:
usage_json=$(az cognitiveservices usage list --location <region> --subscription "$SUBSCRIPTION_ID" -o json 2>/dev/null)

# Extract quota for specific SKU+model
quota_available=$(echo "$usage_json" | jq -r --arg name "OpenAI.<SKU>.<model-name>" \
  '.[] | select(.name.value == $name) | .limit - .currentValue')

Annotate discovery results:

Add a "Quota Available" column to the ranked output from Phase 3:

Region	Available Capacity	Meets Target	Projects	Quota Available
eastus2	120K TPM	✅	3	✅ 80K
westus3	90K TPM	✅	1	❌ 0 (at limit)
swedencentral	100K TPM	✅	0	✅ 100K

Regions/SKUs where quotaAvailable = 0 should be marked with ❌ in the results. If no region has available quota, hand off to the quota skill for increase requests and troubleshooting.

Phase 4: Present Results and Hand Off

After the script outputs the ranked table (now annotated with quota info), present it to the user and ask:

🚀 Quick deploy to top recommendation with defaults → route to preset
⚙️ Custom deploy with version/SKU/capacity/RAI selection → route to customize
📊 Check another model or capacity target → re-run Phase 2
❌ Cancel

Phase 5: Confirm Project Before Deploying

Before handing off to preset or customize, always confirm the target project with the user. See the Project Selection rules in the parent router.

If the discovery table shows a sample project for the chosen region, suggest it as the default. Otherwise, query projects in that region and let the user pick.

Error Handling

Error	Cause	Resolution
"No capacity found"	Model not available or all at quota	Hand off to quota skill for increase requests and troubleshooting
Script auth error	`az login` expired	Re-run `az login`
Empty version list	Model not in region catalog	Try a different region: `./scripts/query_capacity.sh <model> "" eastus`
"No projects found"	No AI Services resources	Guide to `project/create` skill or Azure Portal

Related Skills

preset — Quick deployment after capacity discovery
customize — Custom deployment after capacity discovery
quota — For quota viewing, increase requests, and troubleshooting quota errors, defer to this skill instead of duplicating guidance

Maintainer

microsoft Core maintainer

Source details

Full Name: microsoft/skills
Branch: main
Path in repo: .github/plugins/azure-skills/skills/microsoft-foundry/models/deploy-model/capacity
License: MIT License
Topics: agent-skills mcp skills agents azure foundry sdk

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

microsoft/skills

podcast-generation

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.

2,020 226

Explore

microsoft/skills

mcp-builder

Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP), Node/TypeScript (MCP SDK), or C#/.NET (Microsoft MCP SDK).

2,020 226

Explore

microsoft/skills

frontend-design-review

Review and create distinctive, production-grade frontend interfaces with high design quality and design system compliance. Evaluates using three pillars: frictionless insight-to-action, quality craft, and trustworthy building. USE FOR: PR reviews, design reviews, accessibility audits, design system compliance checks, creative frontend design, UI code review, component reviews, responsive design checks, theme testing, and creating memorable UI. DO NOT USE FOR: Backend API reviews, database schema reviews, infrastructure or DevOps work, pure business logic without UI, or non-frontend code.

2,020 226

Explore

microsoft/skills

entra-agent-id

Microsoft Entra Agent ID (preview) for creating OAuth2-capable AI agent identities via Microsoft Graph beta API. Covers Agent Identity Blueprints, BlueprintPrincipals, Agent Identities, required permissions, sponsors, and Workload Identity Federation. Includes Microsoft Entra SDK for AgentID (containerized sidecar) for polyglot agent authentication (Docker/Kubernetes), 3P agent integration, autonomous and interactive agent patterns. Triggers: "agent identity", "agent id", "Agent Identity Blueprint", "BlueprintPrincipal", "entra agent", "agent identity provisioning", "Graph agent identity", "entra sidecar", "agent id sidecar", "auth sidecar", "3P agent", "third-party agent identity", "polyglot agent auth".

2,020 226

Explore

microsoft/skills

github-issue-creator

Convert raw notes, error logs, voice dictation, or screenshots into crisp GitHub-flavored markdown issue reports. Use when the user pastes bug info, error messages, or informal descriptions and wants a structured GitHub issue. Supports images/GIFs for visual evidence.

2,020 226

Explore

microsoft/skills

copilot-sdk

Build applications powered by GitHub Copilot using the Copilot SDK. Use when creating programmatic integrations with Copilot across Node.js/TypeScript, Python, Go, or .NET. Covers session management, custom tools, streaming, hooks, MCP servers, BYOK providers, session persistence, custom agents, skills, and deployment patterns. Requires GitHub Copilot CLI installed and a GitHub Copilot subscription (unless using BYOK).

2,020 226

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

Capacity Discovery

Quick Reference

When to Use This Skill

Scripts

Workflow

Phase 1: Validate Prerequisites

Phase 2: Identify Model and Version

Phase 3: Run Discovery

Phase 3.5: Validate Subscription Quota

Phase 4: Present Results and Hand Off

Phase 5: Confirm Project Before Deploying

Error Handling

Related Skills

Recommended Agent Skills

podcast-generation

mcp-builder

frontend-design-review

entra-agent-id

github-issue-creator

copilot-sdk