Agent skill

adf-ml-analytics

This skill should be used when the user asks about "Azure ML batch endpoints in ADF", "Azure OpenAI Batch API pipeline", "ADF ML scoring", "SQL to Storage archival", or needs guidance on AI Services integration, Databricks ML training, or Data Flow feature engineering.

View SKILL.md on GitHub Repository

Stars 22

Forks 3

Install this agent skill to your Project

npx add-skill https://github.com/JosiahSiegel/claude-plugin-marketplace/tree/main/plugins/adf-master/skills/adf-ml-analytics

SKILL.md

Azure Data Factory Machine Learning & Analytics Patterns

Overview

Azure Data Factory orchestrates ML workflows by integrating with Azure Machine Learning, Azure AI Services, Databricks ML, and Azure SQL Database. This skill covers patterns for extracting data from ephemeral sources (like Azure SQL Database), archiving to Azure Storage for long-term analysis, and leveraging ML services for scoring and insights.

Deprecation Notices & Platform Changes (Current March 2026)

Azure AI Foundry -> Microsoft Foundry (November 2025)

At Ignite November 2025, Microsoft renamed Azure AI Foundry to Microsoft Foundry.
Microsoft Foundry is the unified AI platform: agents, workflows, models, and tools under one resource provider.
ADF is positioned as the data orchestration layer within Microsoft Foundry -- handling ingestion, transformation, feature preparation, and downstream consumption by models and agents.
New AI features are primarily landing in Fabric Data Factory (Copilot, natural language pipeline generation). ADF classic remains fully supported but receives fewer new features.

Azure ML SDK v1 - SUPPORT ENDING JUNE 2026

Deprecated: March 31, 2025. Support ends: June 30, 2026 (3 months away).
Impact: AzureMLExecutePipeline activity uses SDK v1 published pipelines. These will stop working after June 2026.
Related SDKs also retiring: azureml-train-core, azureml-pipeline, azureml-pipeline-core, azureml-pipeline-steps.
Migration required: Use Azure ML SDK v2 batch endpoints via WebActivity (see references/azure-ml-patterns.md).
All new projects must use batch endpoints, not published pipelines.

Azure AI Inference SDK - RETIRING MAY 30, 2026

The azure-ai-inference SDK (Python/JS/.NET) is deprecated.
Migrate to the OpenAI SDK using the OpenAI/v1 API, which works with both Azure OpenAI and Microsoft Foundry Models.
This affects any code calling Azure AI model endpoints via the inference SDK.

Azure SQL Edge - RETIRED September 30, 2025

Azure SQL Edge (which included ONNX PREDICT on edge devices) is no longer available.
Migration: Use Azure SQL Managed Instance enabled by Azure Arc for edge SQL scenarios.

Cognitive Services for Power BI Dataflows - RETIRED

Retired: September 15, 2025. AI Insights in Power BI dataflows no longer works.
Alternative: Use ADF WebActivity to call Azure AI Services endpoints directly.

Azure Cognitive Services - REBRANDED

"Azure Cognitive Services" -> "Azure AI Services" -> now part of Microsoft Foundry.
API endpoints remain the same; branding has changed.

Apache Airflow in ADF - DEPRECATED

Deprecated in early 2025 for new customers. Existing deployments continue to function.
Migration: Use Fabric Data Factory, native ADF pipelines, or standalone Airflow deployments.

Integration Patterns Quick Reference

Pattern	Activity Type	Summary	Details
Azure ML (Legacy SDK v1)	AzureMLExecutePipeline	Execute published ML pipelines via SDK v1 linked service. Support ends June 2026 -- migrate to batch endpoints.	See `references/azure-ml-patterns.md`
Azure ML Batch Endpoints (SDK v2)	WebActivity	Recommended approach for batch inference. Submit jobs to batch endpoints via REST, poll for completion with Until loop.	See `references/azure-ml-patterns.md`
Azure ML Online Endpoints	WebActivity	Real-time scoring of individual records or small batches via managed online endpoints with MSI auth.	See `references/azure-ml-patterns.md`
T-SQL PREDICT	SqlServerStoredProcedure	In-database ONNX model scoring. Available on SQL Server 2017+, SQL MI, and Synapse -- not Azure SQL Database.	See `references/sql-archival-patterns.md`
sp_execute_external_script	SqlServerStoredProcedure	Run Python/R scripts inside SQL Managed Instance with ML Services enabled. Good for small-medium datasets.	See `references/sql-archival-patterns.md`
SQL to Storage Archival	Copy (ForEach)	Archive ephemeral SQL data to Parquet in Blob/ADLS Gen2. Includes full-snapshot and incremental watermark patterns.	See `references/sql-archival-patterns.md`
Azure AI Services	WebActivity	Call pre-built AI (sentiment, anomaly detection, vision) via REST. Use Key Vault for API keys. Batch scoring with ForEach.	See `references/ai-services-and-openai-patterns.md`
Azure OpenAI Batch API	WebActivity	LLM scoring at 50% less cost. Upload JSONL, create batch job, poll for completion. Ideal for text classification and enrichment.	See `references/ai-services-and-openai-patterns.md`
Databricks ML	DatabricksJob	Orchestrate ML training and batch scoring via Databricks Jobs with MLflow tracking. Extract from SQL, score, write back.	See `references/databricks-ml-and-e2e-patterns.md`
Data Flow Features	ExecuteDataFlow	Spark-based feature engineering with window functions, derived columns, pivots, and filters before ML scoring.	See `references/databricks-ml-and-e2e-patterns.md`
End-to-End ML Pipeline	ExecutePipeline + Switch	Modular pipeline: archive -> feature engineering -> train or score (Switch activity) using Databricks sub-pipelines.	See `references/databricks-ml-and-e2e-patterns.md`

Best Practices

Data Architecture

Archive first, analyze later - Copy ephemeral SQL data to Storage as Parquet before running ML
Use Parquet format - Columnar format is optimal for ML workloads (compression, column pruning)
Date-partition storage - Use snapshot_date=YYYY-MM-DD partitioning for versioned archives
Separate containers - Use distinct containers for raw archives, features, models, and scores

ML Orchestration

Databricks Job activity for complex ML (training, MLflow, distributed compute)
WebActivity + Azure ML batch endpoints for managed ML inference (SDK v2)
WebActivity + Azure OpenAI Batch API for LLM scoring at 50% cost (text analysis, enrichment)
WebActivity + Azure AI Services for pre-built AI capabilities (NLP, vision, anomaly detection)
Data Flows for feature engineering when Spark-based transformations are needed
Execute Pipeline pattern to modularize archive -> feature -> train -> score steps
T-SQL PREDICT for in-database scoring (SQL Server/Managed Instance/Synapse only -- not Azure SQL Database)

Security

Managed Identity for all Azure service connections (ML workspace, Storage, SQL)
Key Vault for API keys (Azure AI Services, external endpoints)
Never hardcode secrets, connection strings, or API keys in pipeline JSON
Least privilege - Grant only required roles (Blob Data Contributor for storage, ML workspace roles for ML)

Cost Optimization

Use General Purpose compute for Data Flows unless memory-intensive
Databricks serverless compute for variable ML workloads
Set appropriate timeouts on ML activities (training can be long-running)
Batch scoring over real-time when latency allows (cheaper, more efficient)
Incremental extraction from SQL to avoid re-copying unchanged data

Resources

Additional Reference Files

Detailed JSON examples and implementation patterns are in the references/ directory:

references/azure-ml-patterns.md - Azure ML ExecutePipeline (legacy SDK v1), batch endpoints (SDK v2), and online endpoints with complete activity JSON
references/sql-archival-patterns.md - T-SQL PREDICT, sp_execute_external_script, full/incremental SQL archival pipelines, ADLS Gen2 configuration, and storage organization
references/ai-services-and-openai-patterns.md - Azure AI Services (sentiment, anomaly detection), Azure OpenAI Batch API (JSONL upload, job creation, polling), and batch scoring patterns
references/databricks-ml-and-e2e-patterns.md - Databricks ML training/scoring pipelines, Data Flow feature engineering, and end-to-end ML pipeline with Switch activity

Maintainer

JosiahSiegel Core maintainer

Source details

Full Name: JosiahSiegel/claude-plugin-marketplace
Branch: main
Path in repo: plugins/adf-master/skills/adf-ml-analytics
License: MIT License

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

JosiahSiegel/claude-plugin-marketplace

opentofu-guide

Comprehensive OpenTofu expertise including migration from Terraform, state encryption, OpenTofu 1.10/1.11 features (OCI registry, native S3 locking, ephemeral resources, enabled meta-argument), and CI/CD integration. Covers when to use OpenTofu vs Terraform with decision matrix.

22 3

Explore

JosiahSiegel/claude-plugin-marketplace

terraform-tasks

Specialized Terraform task execution skill for autonomous infrastructure operations. Handles code generation, debugging, version management (1.10-1.14+), security scanning, and architecture design across all providers (AWS 6.0, AzureRM 4.x, GCP) and platforms. Covers ephemeral values, Terraform Stacks, policy-as-code, and 2025 best practices.

22 3

Explore

JosiahSiegel/claude-plugin-marketplace

shellcheck-cicd-2025

ShellCheck validation as non-negotiable 2025 workflow practice

22 3

Explore

JosiahSiegel/claude-plugin-marketplace

bash-master

Expert bash/shell scripting system across ALL platforms. PROACTIVELY activate for: (1) ANY bash/shell script task, (2) System automation, (3) DevOps/CI/CD scripts, (4) Build/deployment automation, (5) Script review/debugging, (6) Converting commands to scripts. Provides: Google Shell Style Guide compliance, ShellCheck validation, cross-platform compatibility (Linux/macOS/Windows/containers), POSIX compliance, security hardening, error handling, performance optimization, testing with BATS, and production-ready patterns. Ensures professional-grade, secure, portable scripts every time.

22 3

Explore

JosiahSiegel/claude-plugin-marketplace

process-substitution-fifos

Process substitution, named pipes (FIFOs), and advanced IPC patterns for efficient bash data streaming (2025)

22 3

Explore

JosiahSiegel/claude-plugin-marketplace

modern-automation-patterns

Modern DevOps and CI/CD automation patterns with containers and cloud (2025)

22 3

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Azure Data Factory Machine Learning & Analytics Patterns

Overview

Deprecation Notices & Platform Changes (Current March 2026)

Integration Patterns Quick Reference

Best Practices

Data Architecture

ML Orchestration

Security

Cost Optimization

Resources

Additional Reference Files

Recommended Agent Skills

opentofu-guide

terraform-tasks

shellcheck-cicd-2025

bash-master

process-substitution-fifos

modern-automation-patterns