Agent skill

design-auditor

Comprehensive UI/UX design audit with 12-category scoring, AI slop detection, WCAG accessibility validation, design system compliance, and automated fix recommendations

Stars 71
Forks 21

Install this agent skill to your Project

npx add-skill https://github.com/borghei/Claude-Skills/tree/main/engineering/design-auditor

Metadata

Additional technical details for this skill

tags
design-audit ai-slop color-contrast accessibility
author
borghei
domain
design-engineering
updated
1773792000
version
2.0.0
category
engineering
tech stack
python, css, accessibility, wcag, design-systems
python tools
design_scorer.py, ai_slop_detector.py, color_contrast_checker.py, design_system_validator.py

SKILL.md

Design Auditor

The most comprehensive design audit skill available for AI coding assistants. Combines systematic 12-category UI/UX evaluation, AI slop detection, WCAG accessibility validation, design system compliance checking, and responsive design auditing into a single unified workflow with three independent grades and prioritized fix recommendations.

Keywords

design audit, UI review, UX evaluation, accessibility, WCAG, design system, AI slop detection, color contrast, responsive design, usability heuristics, interaction states, visual hierarchy, typography, spacing, motion design, information architecture, design tokens, cross-device testing

Quick Start

bash
# 1. Run a full design audit (produces 3 grades)
# Prepare audit findings as JSON, then score:
python scripts/design_scorer.py --input findings.json --output report.json

# 2. Detect AI-generated slop in HTML/CSS
python scripts/ai_slop_detector.py --input page.html --css styles.css

# 3. Check color contrast against WCAG
python scripts/color_contrast_checker.py --foreground "#333333" --background "#ffffff"
python scripts/color_contrast_checker.py --input colors.json --level AAA

# 4. Validate design system compliance
python scripts/design_system_validator.py --tokens tokens.json --input styles.css

Core Workflows

Workflow 1: Full Design Audit (12-Category Systematic Review)

Execute a top-to-bottom audit across all 12 categories plus overall coherence. Each category is scored 0-10, weighted, and aggregated into a Design Grade (A-F).

Steps:

  1. Preparation — Gather all screens, components, and interaction flows. Identify the design system (if any) and target platforms.
  2. Visual Pass — Evaluate categories 1-4: Visual Hierarchy, Typography, Color & Contrast, Spacing & Layout Grid.
  3. Interaction Pass — Evaluate categories 5-6: Interaction States, Navigation & Information Architecture.
  4. Platform Pass — Evaluate categories 7-8: Responsive Design, Accessibility (WCAG).
  5. Polish Pass — Evaluate categories 9-10: Motion & Animation, Content & Microcopy.
  6. Integrity Pass — Evaluate categories 11-12: AI Slop Indicators, Performance as Design.
  7. Coherence Assessment — Rate overall design coherence across all categories.
  8. Scoring — Run design_scorer.py to compute weighted grades.
  9. Remediation Plan — Prioritize fixes by severity (Critical > High > Medium > Low > Cosmetic).
bash
python scripts/design_scorer.py --input audit_findings.json --output full_report.json --verbose

Workflow 2: AI Slop Detection

Identify generic, template-driven, or AI-generated UI patterns that lack intentionality and reduce design quality.

Steps:

  1. HTML Analysis — Scan markup for structural slop patterns (generic hero sections, 3-column grids, cookie-cutter pricing tables).
  2. CSS Analysis — Detect visual slop (stock gradients, excessive box-shadows, blur overuse, inconsistent spacing values).
  3. Copy Analysis — Flag generic microcopy, vague CTAs, buzzword density, lorem ipsum residue.
  4. Confidence Scoring — Each finding gets a confidence score (0.0-1.0) indicating how likely it is AI-generated vs intentional design.
  5. Remediation — Provide specific suggestions for making each flagged element more intentional.
bash
python scripts/ai_slop_detector.py --input index.html --css styles.css --threshold 0.6

Workflow 3: Accessibility Compliance Check (WCAG 2.1 AA/AAA)

Systematic validation against WCAG 2.1 success criteria with automated color contrast checking.

Steps:

  1. Color Contrast Audit — Run color_contrast_checker.py against all foreground/background combinations.
  2. Semantic Structure — Verify heading hierarchy, landmark regions, ARIA labels.
  3. Keyboard Navigation — Validate focus order, skip links, focus indicators.
  4. Screen Reader Compatibility — Check alt text, form labels, live regions, error announcements.
  5. Motor Accessibility — Verify touch target sizes (minimum 44x44px), gesture alternatives.
  6. Cognitive Accessibility — Evaluate reading level, consistent navigation, error prevention.
bash
python scripts/color_contrast_checker.py --input color_pairs.json --level AA --suggest-fixes

Workflow 4: Design System Compliance

Validate that implementation matches the project's design token definitions.

Steps:

  1. Token Extraction — Load design tokens (colors, spacing, typography, shadows, radii).
  2. CSS Scanning — Parse stylesheets for hardcoded values that should use tokens.
  3. Deviation Detection — Flag any value not in the token set.
  4. Compliance Scoring — Calculate percentage of values using tokens vs hardcoded.
  5. Migration Plan — Generate token replacement suggestions for each violation.
bash
python scripts/design_system_validator.py --tokens design-tokens.json --input styles.css --output compliance.json

Workflow 5: Responsive & Cross-Device Audit

Validate design across breakpoints and device categories.

Steps:

  1. Breakpoint Analysis — Verify media query coverage (mobile: 320-480, tablet: 481-1024, desktop: 1025+).
  2. Touch Target Validation — Ensure interactive elements meet 44x44px minimum on touch devices.
  3. Content Reflow — Check that content reflows properly without horizontal scrolling at 320px.
  4. Image Responsiveness — Verify srcset usage, art direction, lazy loading.
  5. Typography Scaling — Check fluid typography or appropriate size adjustments per breakpoint.
  6. Navigation Patterns — Validate mobile navigation patterns (hamburger, bottom nav, etc.).

Tools

Tool Purpose Input Output
design_scorer.py Score audit findings across 12 categories, generate 3 grades Audit JSON Scored report JSON
ai_slop_detector.py Detect AI-generated UI patterns in HTML/CSS HTML + CSS files Findings JSON
color_contrast_checker.py WCAG color contrast validation Color pairs Pass/fail + suggestions
design_system_validator.py Design token compliance checking Tokens + CSS Compliance report

12 Audit Categories

1. Visual Hierarchy & Composition (Weight: 10%)

Evaluate how effectively the design guides the user's eye through content.

  • Primary focal point — Is there a clear visual entry point on each screen?
  • Z-pattern / F-pattern — Does content flow match established reading patterns?
  • Size contrast — Do important elements have sufficient size differentiation?
  • Whitespace usage — Is negative space used intentionally to create groupings?
  • Visual weight balance — Is the composition balanced or intentionally asymmetric?

2. Typography System (Weight: 8%)

Assess typographic consistency, hierarchy, and readability.

  • Type scale — Is there a consistent modular scale (e.g., 1.25, 1.333, 1.5)?
  • Heading hierarchy — H1 > H2 > H3 clearly differentiated in size and weight?
  • Body readability — Line height 1.4-1.6, line length 45-75 characters?
  • Font pairing — Maximum 2-3 font families with clear roles?
  • Responsive typography — Font sizes adapt appropriately across breakpoints?

3. Color & Contrast (Weight: 8%)

Validate color system integrity and accessibility compliance.

  • Color palette — Defined primary, secondary, neutral, semantic (error/warning/success/info)?
  • Contrast ratios — All text meets WCAG AA minimum (4.5:1 normal, 3:1 large)?
  • Color independence — Information not conveyed by color alone?
  • Dark mode — If applicable, proper color inversion with maintained contrast?
  • Color consistency — Same semantic meaning for same colors throughout?

4. Spacing & Layout Grid (Weight: 8%)

Assess spatial consistency and grid system usage.

  • Spacing scale — Consistent spacing values (4px/8px base or similar)?
  • Grid system — Defined column grid with consistent gutters?
  • Component spacing — Consistent internal padding and external margins?
  • Alignment — Elements aligned to grid or intentionally offset?
  • Density balance — Appropriate information density for the context?

5. Interaction States (Weight: 10%)

Validate completeness of interactive element states.

  • Default — Clear affordance that element is interactive
  • Hover — Visual feedback on cursor hover (desktop)
  • Focus — Visible focus indicator for keyboard navigation
  • Active/Pressed — Feedback during click/tap
  • Disabled — Clearly non-interactive appearance with reduced opacity
  • Loading — Skeleton screens, spinners, or progress indicators
  • Error — Clear error state with actionable messaging
  • Empty — Meaningful empty states with guidance
  • Success — Confirmation feedback after successful actions

6. Navigation & Information Architecture (Weight: 8%)

Evaluate wayfinding and content organization.

  • Navigation clarity — User always knows where they are and can get where they need
  • Breadcrumbs/indicators — Current location visible in navigation hierarchy
  • Menu organization — Logical grouping with 7-plus-or-minus-2 items per level
  • Search — Available and functional when content volume warrants it
  • Deep linking — Key content reachable within 3 clicks/taps

7. Responsive Design (Weight: 8%)

Validate cross-device adaptation quality.

  • Breakpoint coverage — Smooth transitions at mobile/tablet/desktop breakpoints
  • Content priority — Most important content visible without scrolling on mobile
  • Touch targets — Minimum 44x44px on touch devices with 8px gaps
  • Orientation — Layouts work in both portrait and landscape
  • No horizontal scroll — Content contained at 320px minimum width

8. Accessibility — WCAG (Weight: 12%)

Comprehensive accessibility validation (highest weight category).

  • Perceivable — Alt text, captions, sufficient contrast, text resizing
  • Operable — Keyboard accessible, sufficient time, no seizure triggers, navigable
  • Understandable — Readable, predictable, input assistance
  • Robust — Compatible with assistive technologies, valid markup
  • ARIA usage — Correct roles, states, and properties when needed
  • Focus management — Logical focus order, focus trapping in modals

9. Motion & Animation (Weight: 5%)

Assess animation quality and appropriateness.

  • Purpose — Animations serve functional purpose (feedback, orientation, guidance)
  • Performance — Animations use GPU-accelerated properties (transform, opacity)
  • Duration — Appropriate timing (100-300ms for micro-interactions, 300-500ms for transitions)
  • Easing — Natural easing curves, no linear animations for UI elements
  • Reduced motion — Respects prefers-reduced-motion media query

10. Content & Microcopy (Weight: 5%)

Evaluate written content quality within the UI.

  • Button labels — Action-oriented, specific (not "Submit" or "Click Here")
  • Error messages — Explain what happened and how to fix it
  • Empty states — Guide users toward productive action
  • Tooltips/help — Contextual without overwhelming
  • Consistency — Same terminology throughout (not "delete" in one place, "remove" in another)

11. AI Slop Indicators (Weight: 8%)

Detect patterns suggesting generic AI-generated design without human refinement.

  • Generic hero sections — Full-width hero with centered text over stock gradient
  • Stock gradient backgrounds — Linear gradients with trending color combinations (purple-to-blue, etc.)
  • 3-column feature grids — Icon + heading + paragraph repeated exactly 3 times
  • Meaningless illustrations — Abstract SVG decorations with no semantic connection
  • Cookie-cutter pricing tables — 3-tier layout with highlighted "recommended" plan
  • Generic testimonial layouts — Circular avatar + quote + name/title in a carousel
  • Over-use of shadows/blur — Excessive drop shadows and backdrop blur on every element
  • Inconsistent icon styles — Mixed icon sets (some outlined, some filled, different weights)
  • Lorem ipsum residue — Placeholder text or suspiciously generic content
  • Template section ordering — Hero > Features > Testimonials > Pricing > CTA > Footer

12. Performance as Design (Weight: 5%)

Evaluate how performance impacts the design experience.

  • Perceived performance — Skeleton screens, optimistic UI, progressive loading
  • Image optimization — Proper formats (WebP/AVIF), sizing, lazy loading
  • Font loading — FOUT/FOIT strategy, font-display property
  • Layout stability — No Cumulative Layout Shift (CLS) from late-loading elements
  • Interaction responsiveness — UI responds within 100ms of user action

Overall Coherence Bonus (Weight: 5%)

  • Design unity — All elements feel like they belong to the same design system
  • Brand alignment — Design reflects brand personality and values
  • Consistency — Patterns learned in one area apply throughout
  • Polish level — Attention to detail in edge cases and transitions

Scoring System

Three Independent Grades

Unlike single-grade systems, the Design Auditor produces three separate grades for comprehensive evaluation:

1. Design Grade (A-F)

Weighted aggregate of all 12 categories plus coherence bonus.

Grade Score Range Meaning
A+ 95-100 Exceptional — production-ready, sets new standards
A 90-94 Excellent — minor polish items only
B+ 85-89 Very Good — few notable issues
B 80-84 Good — solid foundation with improvement areas
C+ 75-79 Adequate — functional but needs refinement
C 70-74 Below Average — multiple issues need attention
D 60-69 Poor — significant problems across categories
F 0-59 Failing — fundamental redesign needed

2. AI Slop Grade (A-F)

Based on AI Slop Indicators category (inverted — fewer slop patterns = higher grade).

Grade Meaning
A Highly original — no detectable AI patterns
B Mostly original — 1-2 minor patterns detected
C Mixed — several AI patterns present but customized
D Heavily templated — many generic AI patterns
F Pure slop — design appears entirely AI-generated without refinement

3. Accessibility Grade (A-F)

Based on WCAG compliance level achieved.

Grade Meaning
A+ AAA compliant — exceeds all requirements
A AA compliant — meets all standard requirements
B Mostly AA — 1-3 minor violations
C Partial AA — several violations, core content accessible
D Significant gaps — multiple barriers to access
F Inaccessible — fundamental accessibility failures

AI Slop Detection Patterns

Visual Slop

Pattern Description Confidence
Generic hero section Full-width hero, centered heading, subtitle, CTA over gradient/image 0.7-0.9
Stock gradient Linear gradient with trending colors (purple-blue, pink-orange) 0.5-0.8
3-column feature grid Exactly 3 columns: icon + heading + paragraph 0.6-0.8
Shadow overuse box-shadow on > 60% of card/container elements 0.5-0.7
Blur overuse backdrop-filter: blur on multiple overlapping elements 0.5-0.7
Rounded everything border-radius > 12px on most elements 0.4-0.6

Copy Slop

Pattern Description Confidence
Vague CTAs "Get Started", "Learn More", "Try It Free" without context 0.5-0.7
Buzzword density "Seamless", "Powerful", "Revolutionary", "Next-gen" clustering 0.6-0.8
Generic testimonials Vague praise without specific product/feature references 0.6-0.8
Lorem ipsum residue Placeholder text fragments or suspiciously uniform paragraph lengths 0.8-1.0

Structural Slop

Pattern Description Confidence
Template ordering Hero > Features > Social Proof > Pricing > CTA > Footer 0.5-0.7
Cookie-cutter pricing 3 tiers, middle highlighted, feature checkmark lists 0.6-0.8
Predictable footer 4-column footer with Newsletter, Social, Links, Legal 0.4-0.6

Design Philosophy & Taste Anchors

These 10 principles are not optional guidelines — they are the audit's operating system. Every finding is evaluated against them.

  1. Specificity over vibes — "Clean UI" is banned. Name the font, the spacing scale, the color system. If you cannot name it, it is not a design decision.
  2. Empty states are features — "No items found" is a bug. Empty states guide users to their first action, explain what will appear, and build confidence.
  3. Subtraction default — As little design as possible, but not less. Every element must earn its place. When in doubt, remove it. (Dieter Rams)
  4. Edge cases are user experiences — 47-character names, zero search results, slow networks, stale state. These are not exceptions — they are the product for some users, every time.
  5. Four shadow paths — For every interaction, test: happy path, nil input, empty input, error upstream. If any path produces a blank screen or unhandled state, that is a Critical finding.
  6. Loading states earn trust — Skeleton screens > spinners > blank pages. Progressive disclosure of content signals responsiveness even before data arrives.
  7. Consistency compounds — One off-system color erodes the entire design language. Design tokens are contracts, not suggestions. Every deviation is a finding.
  8. Motion has meaning — Every animation must communicate state change (enter, exit, reorder, feedback). Decorative motion without purpose is noise — and noise is a finding.
  9. Accessibility is not a feature — It is a baseline. WCAG AA is the floor, not the ceiling. Accessibility findings are never "Low" priority.
  10. Performance is perceived design — A 3-second load feels broken regardless of visual quality. LCP, CLS, and INP are design metrics.

Interaction Model & Safety Controls

Fix Session Rules

  • One issue = one question — never batch multiple design issues into a single prompt. Each finding is evaluated and addressed independently.
  • AUTO-FIX (no confirmation): Cosmetic issues — spacing token mismatches, off-system colors swapped to nearest token, minor border-radius corrections
  • ASK (requires confirmation): Structural issues — layout changes, component swaps, new patterns, navigation restructuring
  • Maximum 30 fixes per session — hard stop. After 30 fixes, generate report and exit.
  • Risk accumulator — component file changes (+5 risk), global style changes (+8), layout changes (+10), reverts (+15). Stop if cumulative risk exceeds 20% of budget (100).
  • Revert on regression — if a fix breaks existing visual tests or introduces a new Critical finding, immediately git revert and flag for manual review.

Tool Input/Output Documentation

design_scorer.py — Expected Input Format

json
{
  "categories": {
    "visual_hierarchy": {
      "score": 7.5,
      "findings": [
        {
          "description": "Homepage lacks clear visual entry point",
          "priority": "high",
          "fix": "Establish single primary focal point using size contrast",
          "impact": 7
        }
      ]
    },
    "typography": { "score": 8.0, "findings": [...] },
    "color_contrast": { "score": 6.5, "findings": [...] },
    "spacing_layout": { "score": 7.0, "findings": [...] },
    "interaction_states": { "score": 5.0, "findings": [...] },
    "navigation_ia": { "score": 7.5, "findings": [...] },
    "responsive": { "score": 6.0, "findings": [...] },
    "accessibility": { "score": 5.0, "findings": [...] },
    "motion_animation": { "score": 8.0, "findings": [...] },
    "content_microcopy": { "score": 7.0, "findings": [...] },
    "ai_slop": { "score": 6.0, "findings": [...] },
    "performance_design": { "score": 7.5, "findings": [...] }
  }
}

design_scorer.py — Output

Design Grade:         D (67.2/100)
AI Slop Grade:        C (6.0/10)
Accessibility Grade:  C (5.0/10)

Category Breakdown:
  Visual Hierarchy      10%   7.5/10   1 finding
  Typography             8%   8.0/10   1 finding
  Color & Contrast       8%   6.5/10   2 findings
  Spacing & Layout       8%   7.0/10   1 finding
  Interaction States    10%   5.0/10   3 findings  ← attention
  Navigation & IA        8%   7.5/10   0 findings
  Responsive             8%   6.0/10   2 findings
  Accessibility         12%   5.0/10   4 findings  ← attention
  Motion & Animation     5%   8.0/10   0 findings
  Content & Microcopy    5%   7.0/10   1 finding
  AI Slop Indicators     8%   6.0/10   2 findings
  Performance            5%   7.5/10   1 finding

Recommendations (by impact):
  1. [Critical] Add missing form labels — accessibility
  2. [High] Implement hover/focus/disabled states for all buttons
  3. [High] Fix contrast ratio on secondary text (3.2:1 → 4.5:1)

color_contrast_checker.py — Input/Output

bash
# Single pair
python scripts/color_contrast_checker.py --fg "#333" --bg "#fff" --level AA

# Batch mode from JSON
python scripts/color_contrast_checker.py --input color-pairs.json --suggest-fixes
json
// color-pairs.json input format
[
  {"foreground": "#666666", "background": "#ffffff", "label": "body text"},
  {"foreground": "#999999", "background": "#f5f5f5", "label": "muted text"}
]

design_system_validator.py — Input

bash
python scripts/design_system_validator.py --tokens design-tokens.json --input src/styles/
json
// design-tokens.json format
{
  "colors": {"primary": "#1a73e8", "secondary": "#5f6368", "error": "#d93025"},
  "spacing": [0, 4, 8, 12, 16, 24, 32, 48, 64],
  "typography": {"body": "16px", "h1": "32px", "h2": "24px", "h3": "20px"},
  "radii": [0, 4, 8, 16, 9999]
}

CI/CD Integration

yaml
# .github/workflows/design-audit.yml
name: Design Audit Gate
on:
  pull_request:
    paths:
      - '**/*.css'
      - '**/*.scss'
      - '**/*.html'
      - '**/*.tsx'
      - '**/*.vue'
jobs:
  design-checks:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Color Contrast Check
        run: python scripts/color_contrast_checker.py --input color-pairs.json --level AA
      - name: Design System Compliance
        run: |
          python scripts/design_system_validator.py \
            --tokens design-tokens.json \
            --input src/styles/ \
            --format json > compliance.json
          # Fail if compliance below 90%
          python -c "import json; c=json.load(open('compliance.json')); exit(0 if c.get('compliance_percentage',0)>=90 else 1)"
      - name: AI Slop Detection
        run: |
          python scripts/ai_slop_detector.py --input dist/index.html --threshold 0.7 --format json > slop.json

Fix Priority System

Priority Definition Response Time Examples
Critical Breaks core UX, blocks user tasks, accessibility barrier Immediate Missing form labels, contrast below 2:1, broken navigation
High Degrades experience significantly, affects many users Before launch Inconsistent interaction states, poor mobile layout, missing error states
Medium Creates inconsistency, affects some users Next sprint Off-system colors, spacing violations, minor responsive issues
Low Polish and refinement, minor impact Backlog Animation timing, icon consistency, microcopy improvements
Cosmetic Nice-to-have, no functional impact Optional Pixel alignment, hover animation easing, shadow refinement

Integration Points

  • Code Review — Run design auditor alongside code review for frontend PRs
  • CI/CD — Integrate color_contrast_checker.py and design_system_validator.py into pipelines
  • Design Handoff — Use full audit before developer handoff to catch issues early
  • Sprint Review — Run abbreviated audit (categories 1-8) for sprint demo review
  • Release Gate — Require minimum B grade for Design and A for Accessibility before release
  • Design System Updates — Re-run design_system_validator.py after token updates

References

  • references/design_audit_methodology.md — Systematic audit approach, heuristics, Gestalt principles
  • references/ai_slop_patterns.md — Comprehensive AI pattern catalog with remediation strategies
  • references/accessibility_checklist.md — Complete WCAG 2.1 A/AA/AAA checklist

Assets

  • assets/design_audit_template.md — Ready-to-use audit report template
  • assets/sample_audit_data.json — Sample JSON input for Python tools

Troubleshooting

Problem Cause Solution
design_scorer.py exits with "Missing category" Input JSON is missing one or more of the 12 required category keys under categories Ensure all 12 keys are present: visual_hierarchy, typography, color_contrast, spacing_layout, interaction_states, navigation_ia, responsive, accessibility, motion_animation, content_microcopy, ai_slop, performance_design
Color contrast checker rejects a color value Unsupported color format passed to --foreground or --background Use hex (#RGB/#RRGGBB), rgb(R,G,B), hsl(H,S%,L%), or a named color (black, white, red, etc.)
AI slop detector reports zero findings on a clearly templated page Confidence threshold is set too high, filtering out valid detections Lower --threshold (default 0.5); try 0.3 for a broader sweep
Design system validator finds no violations but CSS clearly has hardcoded values Token JSON is missing the relevant category (e.g., spacing or border_radii) Add all applicable token categories to the tokens file; the validator skips any category with no defined tokens
design_system_validator.py reports "No CSS files found" Path points to a directory but files use .scss extension Pass --glob "*.scss" or --glob "*" to include SCSS files
Scores seem inflated despite many findings Category scores in the input JSON are set too high relative to the findings listed Scores are author-provided (0-10); re-evaluate each category score to accurately reflect the number and severity of findings
Baseline trend comparison shows unexpected "unchanged" across all categories Baseline JSON file has the same scores as the current file Verify you are passing the correct previous audit JSON via --baseline, not a copy of the current file

Success Criteria

  • WCAG AA color contrast compliance rate above 95% across all foreground/background pairs in the project.
  • Design system token compliance rate at or above 90%, with zero off-system color values in production CSS.
  • AI Slop Grade of B or higher (two or fewer high-confidence AI patterns detected above the 0.5 threshold).
  • Overall Design Grade of B (80/100) or higher before release, with no Critical-priority findings remaining open.
  • Accessibility Grade of A (score 8.5+/10), confirming full WCAG 2.1 AA compliance with no perceivable or operable barriers.
  • All interactive elements have complete state coverage: default, hover, focus, active, disabled, loading, error, empty, and success.
  • Zero lorem ipsum or placeholder text residue detected by the AI slop detector in any production build.

Scope & Limitations

Covers:

  • Visual and structural auditing across 12 weighted categories with three independent grades (Design, AI Slop, Accessibility).
  • Automated WCAG 2.1 AA/AAA color contrast validation with compliant color suggestions for failing pairs.
  • Design token compliance checking for colors, spacing, typography, radii, shadows, z-indices, transitions, and breakpoints.
  • AI-generated pattern detection in HTML markup, CSS stylesheets, inline styles, and page copy.

Does NOT cover:

  • Runtime accessibility testing (screen reader behavior, live ARIA state changes, keyboard trap detection) — see senior-qa for end-to-end testing workflows.
  • Performance profiling (LCP, CLS, INP measurement) — see senior-frontend and senior-devops for Lighthouse and Core Web Vitals tooling.
  • User research or usability testing with real participants — see product-team/ux-researcher for research frameworks.
  • Design file inspection (Figma, Sketch, Adobe XD source files) — this skill operates on implemented HTML/CSS output only.

Integration Points

Skill Integration Data Flow
senior-frontend Run design audit on frontend component library after build Frontend CSS/HTML output -> design_system_validator.py + color_contrast_checker.py -> compliance gates
senior-qa Incorporate accessibility and design regression checks into QA pipelines design_scorer.py report JSON -> QA test assertions on grade thresholds
code-reviewer Attach design audit findings to frontend PR reviews ai_slop_detector.py + color_contrast_checker.py findings -> PR comment annotations
senior-devops Gate deployments on minimum design compliance scores in CI/CD design_system_validator.py compliance percentage -> pipeline pass/fail exit code
product-team/ux-researcher Feed audit findings into usability research prioritization design_scorer.py recommendations list -> research backlog items ranked by impact
senior-secops Validate that accessibility compliance meets regulatory requirements (ADA, EAA) Accessibility Grade from design_scorer.py -> compliance evidence artifacts

Tool Reference

design_scorer.py

Purpose: Computes three independent grades (Design A-F, AI Slop A-F, Accessibility A-F) from a 12-category audit findings JSON file. Supports baseline trend comparison and prioritized fix recommendations.

Usage:

bash
python scripts/design_scorer.py --input findings.json
python scripts/design_scorer.py --input findings.json --output report.json --verbose
python scripts/design_scorer.py --input findings.json --baseline previous.json --format text

Flags/Parameters:

Flag Short Required Default Description
--input -i Yes Path to audit findings JSON file
--output -o No stdout Path to write report output
--baseline -b No Path to baseline findings JSON for trend comparison
--format -f No json Output format: json or text
--verbose -v No off Include detailed per-category findings in text output

Example:

bash
python scripts/design_scorer.py --input audit_findings.json --baseline last_sprint.json --format text --verbose

Output Formats:

  • JSON (default) — Full report object with grades, category_breakdown, coherence, recommendations, summary, and optional trend data.
  • Text — Human-readable report with grade summary, category score table, findings summary, top issues, trend comparison (if baseline provided), and all recommendations grouped by priority.
  • Text + Verbose — Text report plus a detailed section listing every finding per category with priority, description, suggested fix, and impact score.

ai_slop_detector.py

Purpose: Analyzes HTML and CSS files for AI-generated UI patterns including stock gradients, generic hero sections, cookie-cutter layouts, buzzword-heavy copy, lorem ipsum residue, and inconsistent icon styles. Each finding receives a confidence score (0.0-1.0).

Usage:

bash
python scripts/ai_slop_detector.py --input page.html
python scripts/ai_slop_detector.py --input page.html --css styles.css --threshold 0.6
python scripts/ai_slop_detector.py --input page.html --format text --output report.txt

Flags/Parameters:

Flag Short Required Default Description
--input -i Yes Path to HTML file to analyze
--css -c No Path to external CSS file (inline and embedded styles are always extracted from HTML)
--threshold -t No 0.5 Confidence threshold (0.0-1.0); findings below this are reported but counted separately
--output -o No stdout Path to write report output
--format -f No json Output format: json or text

Example:

bash
python scripts/ai_slop_detector.py --input dist/index.html --css dist/styles.css --threshold 0.7 --format json

Output Formats:

  • JSON (default) — Report with grade (A-F), summary (total patterns, above/below threshold counts, by-category breakdown), findings sorted by confidence descending, and findings_by_category grouped into visual, structural, and copy.
  • Text — Human-readable report with grade, category totals, and each finding showing confidence bar, type, description, evidence, line number, and remediation suggestion.

color_contrast_checker.py

Purpose: Validates color contrast ratios against WCAG 2.1 AA and AAA standards. Supports hex, rgb, hsl, and named color formats. Can check a single pair or batch-process from a JSON file. Optionally suggests the closest compliant color alternative for failing pairs.

Usage:

bash
# Single pair
python scripts/color_contrast_checker.py --foreground "#333" --background "#fff"

# Batch mode
python scripts/color_contrast_checker.py --input color_pairs.json --level AAA --suggest-fixes

# With text size context
python scripts/color_contrast_checker.py --fg "#666" --bg "#eee" --size large --format text

Flags/Parameters:

Flag Short Required Default Description
--foreground --fg Yes (single mode) Foreground color (hex, rgb, hsl, or named)
--background --bg Yes (single mode) Background color (hex, rgb, hsl, or named)
--size No normal Text size context: normal, large, or ui
--input -i Yes (batch mode) Path to JSON file with color pairs
--level -l No AA WCAG level to check: AA or AAA
--suggest-fixes -s No off Suggest closest compliant foreground and background alternatives for failing pairs
--output -o No stdout Path to write report output
--format -f No json Output format: json or text

Example:

bash
python scripts/color_contrast_checker.py --input brand-colors.json --level AA --suggest-fixes --format text

Output Formats:

  • JSON (default) — Report with accessibility_grade (A+ through F), summary (total pairs, passing, failing, compliance rate), results array with per-pair ratio, pass/fail status across all WCAG levels, and optional suggestions with adjusted foreground/background hex values and their new ratios.
  • Text — Human-readable report with grade, compliance rate, per-pair pass/fail status, ratio details, WCAG level results (AA normal, AA large, AAA normal, AAA large), suggested fixes, and a failing pairs summary.

design_system_validator.py

Purpose: Validates CSS files against a design token JSON definition. Detects hardcoded colors, spacing, font sizes, font families, font weights, line heights, border radii, shadows, breakpoints, z-indices, and transitions that deviate from the token set. Reports compliance percentage and suggests the closest matching token for each violation.

Usage:

bash
python scripts/design_system_validator.py --tokens tokens.json --input styles.css
python scripts/design_system_validator.py --tokens tokens.json --input src/ --glob "*.scss"
python scripts/design_system_validator.py --tokens tokens.json --input styles.css --output report.json --format text

Flags/Parameters:

Flag Short Required Default Description
--tokens -t Yes Path to design tokens JSON file
--input -i Yes Path to CSS file or directory to scan
--glob -g No *.css Glob pattern for filtering files when input is a directory (e.g., *.scss, *)
--output -o No stdout Path to write report output
--format -f No json Output format: json or text

Example:

bash
python scripts/design_system_validator.py --tokens design-tokens.json --input src/styles/ --glob "*" --format text

Output Formats:

  • JSON (default) — Report with compliance_grade (A-F), compliance_rate percentage, summary (total values checked, compliant, violations), category_breakdown with per-category stats, violations_by_category grouped with property/value/line/selector/suggested token, and all_violations flat list.
  • Text — Human-readable report with grade, compliance rate, category breakdown table (total/pass/fail/rate per category), and violations listed per category with line numbers, property values, and suggested token replacements (limited to 20 per category in display).

Expand your agent's capabilities with these related and highly-rated skills.

borghei/Claude-Skills

churn-prevention

SaaS churn reduction covering cancel flow design, dynamic save offers, exit survey architecture, dunning sequences, payment recovery, win-back campaigns, and churn impact modeling.

71 21
Explore
borghei/Claude-Skills

popup-cro

Popup and modal optimization for conversion. Covers exit-intent, slide-ins, banners, timing optimization, frequency capping, audience targeting, compliance, and A/B testing frameworks for lead capture, promotions, and announcements.

71 21
Explore
borghei/Claude-Skills

competitor-alternatives

Competitor comparison and alternative page creation for SEO and sales enablement. Covers 4 page formats (singular alternative, plural alternatives, vs pages, competitor vs competitor), content architecture, research methodology, and centralized competitor data management.

71 21
Explore
borghei/Claude-Skills

contract-and-proposal-writer

Generate production-ready business documents including freelance contracts, project proposals, SOWs, NDAs, and MSAs with jurisdiction-aware clauses. Covers US (Delaware), EU (GDPR), UK, and DACH (German law) legal frameworks. Includes contract templates, clause libraries, and DOCX conversion. Use when starting client engagements, writing proposals, drafting partnership agreements, or needing GDPR-compliant data processing addenda.

71 21
Explore
borghei/Claude-Skills

pricing-strategy

SaaS pricing design and optimization covering value metric selection, tier architecture, price point research, pricing page design, price increase execution, and competitive pricing analysis.

71 21
Explore
borghei/Claude-Skills

referral-program

Referral and affiliate program design covering referral loop architecture, incentive design, trigger moment optimization, viral coefficient modeling, affiliate program structure, and optimization playbook.

71 21
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results