Agent skills
Model Bias and Fairness

Agent skill

Model Bias and Fairness

Identifying, measuring, and mitigating algorithmic bias to ensure equitable outcomes in AI systems.

View SKILL.md on GitHub Repository

Stars 163

Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/model-bias-fairness

SKILL.md

Model Bias and Fairness

Overview

Model Bias occurs when an AI system produces results that are systematically prejudiced against certain individuals or groups. Fairness is the practice of ensuring that the model's predictions do not vary unfairly across protected attributes (e.g., race, gender, age).

Core Principle: "Bias is a feature of data, fairness is a requirement of the system."

1. Types of Algorithmic Bias

Bias Type	Description	Example
Historical Bias	Pre-existing prejudice in the world.	Credit scoring models reflecting historical redlining.
Representation Bias	Underrepresentation of certain groups in training data.	Facial recognition failing on darker skin tones.
Measurement Bias	Issues with how data is collected or labeled.	Using "Arrests" as a proxy for "Crime" when certain areas are over-policed.
Algorithmic Bias	The math itself favors a certain outcome.	Maximizing "Total Revenue" might favor high-income zip codes unfairly.

2. Quantitative Fairness Metrics

You cannot manage what you do not measure.

Metric	Goal	Equation
Demographic Parity	Outcome should be equal across groups.	$P(\hat{Y}=1
Equal Opportunity	True Positive Rate should be equal.	$P(\hat{Y}=1
Disparate Impact	Ratio of selection rates.	Success Rate (B) / Success Rate (A) should be > 0.8.

3. Mitigation Strategies

A. Pre-processing (Data Level)

Reweighing: Assigning higher weights to underrepresented samples.
Oversampling: Creating synthetic data for minority groups.

B. In-processing (Model Level)

Adversarial Debiasing: Training a "De-biaser" alongside the model.
Fairness Constraints: Adding a "Fairness Penalty" to the loss function.

C. Post-processing (Prediction Level)

Threshold Moving: Adjusting the binary classification threshold differently for different groups to equalize error rates.

4. Implementation with Fairlearn (Python)

Fairlearn is a popular open-source tool for measuring and mitigating bias.

Measuring Disparity

python

from fairlearn.metrics import MetricFrame, selection_rate
from sklearn.metrics import accuracy_score

# Group by 'gender'
gm = MetricFrame(
    metrics=accuracy_score,
    y_true=y_true,
    y_pred=y_pred,
    sensitive_features=X_test['gender']
)

print(gm.by_group)
print(f"Accuracy Disparity: {gm.difference()}")

Mitigating Bias (Threshold Optimization)

python

from fairlearn.postprocessing import ThresholdOptimizer

optimizer = ThresholdOptimizer(
    estimator=unconstrained_model,
    constraints="equalized_odds"
)

optimizer.fit(X_train, y_train, sensitive_features=X_train['gender'])

5. The Fairness Audit Workflow

Identify Protected Attributes: Define which groups (Gender, Race, Age) must be protected.
Baseline Measurement: Calculate fairness metrics on the current production model.
Root Cause Analysis: Is the bias coming from the dataset size or the labels?
Mitigation Application: Select a technique (e.g., Reweighing).
Trade-off Analysis: Usually, increasing fairness slightly decreases overall accuracy.
Continuous Monitoring: Alerting if fairness metrics drop over time (Bias Drift).

6. Tools Landscape

AIF360 (IBM): Comprehensive library with over 70 fairness metrics.
Fairlearn (Microsoft): Focused on mitigation and visualization.
Google What-If Tool: Interactive dashboard to explore fairness trade-offs without code.
NIST AI RMF: Framework for managing AI risks including bias.

7. Compliance: EU AI Act & NIST

Regulators are increasingly requiring "Fairness Audits" for high-risk AI (Employment, Lending, Law Enforcement).

EU AI Act: Requires data quality and bias documentation for high-risk systems.
NIST AI 100-1: Guidelines for identifying and managing bias.

8. Real-World Scenario: The Recruitment Filter

Scenario: An AI tool for screening resumes was systematically favoring male candidates.
Investigation: The model was trained on 10 years of historical hire data. Historically, the company hired mostly men. The model learned that words like "Captain" and "Competitive" (more common on men's resumes) were high-value features.
Action: Amazon eventually scrapped the tool.
Lesson: If the historical data is biased, the model will faithfully reproduce and amplify that bias.

9. Model Fairness Checklist

Protected Attributes: Have we identified and explicitly tagged sensitive features?
Disparate Impact: Is our selection rate ratio > 0.8 for all groups?
Equal Opportunity: Do we have the same "False Negative" rate for all ethnicities?
Representation: Does our training set match the demographic distribution of our real users?
Trade-off: Have we documented the accuracy cost of our fairness mitigation?
Governance: Has the Ethics Review Board approved the model's fairness report?

Related Skills

44-ai-governance/ai-ethics-compliance
44-ai-governance/model-explainability
43-data-reliability/data-quality-monitoring

Maintainer

majiayu000 Core maintainer

Source details

Full Name: majiayu000/claude-skill-registry
Branch: main
Path in repo: skills/data/model-bias-fairness
License: MIT License

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Model Bias and Fairness

Overview

1. Types of Algorithmic Bias

2. Quantitative Fairness Metrics

3. Mitigation Strategies

A. Pre-processing (Data Level)

B. In-processing (Model Level)

C. Post-processing (Prediction Level)

4. Implementation with Fairlearn (Python)

Measuring Disparity

Mitigating Bias (Threshold Optimization)

5. The Fairness Audit Workflow

6. Tools Landscape

7. Compliance: EU AI Act & NIST

8. Real-World Scenario: The Recruitment Filter

9. Model Fairness Checklist

Related Skills

Recommended Agent Skills

agent-ops-spec

agent-ops-state

agent-ops-spec

agent-ops-testing

agent-ops-testing

agent-ops-state