Agent skill

design-auditor

Comprehensive UI/UX design audit with 12-category scoring, AI slop detection, WCAG accessibility validation, design system compliance, and automated fix recommendations

View SKILL.md on GitHub Repository

Stars 71

Forks 21

Install this agent skill to your Project

npx add-skill https://github.com/borghei/Claude-Skills/tree/main/engineering/design-auditor

Metadata

Additional technical details for this skill

tags: design-audit ai-slop color-contrast accessibility
author: borghei
domain: design-engineering
updated: 1773792000
version: 2.0.0
category: engineering
tech stack: python, css, accessibility, wcag, design-systems
python tools: design_scorer.py, ai_slop_detector.py, color_contrast_checker.py, design_system_validator.py

SKILL.md

Design Auditor

The most comprehensive design audit skill available for AI coding assistants. Combines systematic 12-category UI/UX evaluation, AI slop detection, WCAG accessibility validation, design system compliance checking, and responsive design auditing into a single unified workflow with three independent grades and prioritized fix recommendations.

Keywords

design audit, UI review, UX evaluation, accessibility, WCAG, design system, AI slop detection, color contrast, responsive design, usability heuristics, interaction states, visual hierarchy, typography, spacing, motion design, information architecture, design tokens, cross-device testing

Quick Start

bash

# 1. Run a full design audit (produces 3 grades)
# Prepare audit findings as JSON, then score:
python scripts/design_scorer.py --input findings.json --output report.json

# 2. Detect AI-generated slop in HTML/CSS
python scripts/ai_slop_detector.py --input page.html --css styles.css

# 3. Check color contrast against WCAG
python scripts/color_contrast_checker.py --foreground "#333333" --background "#ffffff"
python scripts/color_contrast_checker.py --input colors.json --level AAA

# 4. Validate design system compliance
python scripts/design_system_validator.py --tokens tokens.json --input styles.css

Core Workflows

Workflow 1: Full Design Audit (12-Category Systematic Review)

Execute a top-to-bottom audit across all 12 categories plus overall coherence. Each category is scored 0-10, weighted, and aggregated into a Design Grade (A-F).

Steps:

Preparation — Gather all screens, components, and interaction flows. Identify the design system (if any) and target platforms.
Visual Pass — Evaluate categories 1-4: Visual Hierarchy, Typography, Color & Contrast, Spacing & Layout Grid.
Interaction Pass — Evaluate categories 5-6: Interaction States, Navigation & Information Architecture.
Platform Pass — Evaluate categories 7-8: Responsive Design, Accessibility (WCAG).
Polish Pass — Evaluate categories 9-10: Motion & Animation, Content & Microcopy.
Integrity Pass — Evaluate categories 11-12: AI Slop Indicators, Performance as Design.
Coherence Assessment — Rate overall design coherence across all categories.
Scoring — Run design_scorer.py to compute weighted grades.
Remediation Plan — Prioritize fixes by severity (Critical > High > Medium > Low > Cosmetic).

bash

python scripts/design_scorer.py --input audit_findings.json --output full_report.json --verbose

Workflow 2: AI Slop Detection

Identify generic, template-driven, or AI-generated UI patterns that lack intentionality and reduce design quality.

Steps:

HTML Analysis — Scan markup for structural slop patterns (generic hero sections, 3-column grids, cookie-cutter pricing tables).
CSS Analysis — Detect visual slop (stock gradients, excessive box-shadows, blur overuse, inconsistent spacing values).
Copy Analysis — Flag generic microcopy, vague CTAs, buzzword density, lorem ipsum residue.
Confidence Scoring — Each finding gets a confidence score (0.0-1.0) indicating how likely it is AI-generated vs intentional design.
Remediation — Provide specific suggestions for making each flagged element more intentional.

bash

python scripts/ai_slop_detector.py --input index.html --css styles.css --threshold 0.6

Workflow 3: Accessibility Compliance Check (WCAG 2.1 AA/AAA)

Systematic validation against WCAG 2.1 success criteria with automated color contrast checking.

Steps:

Color Contrast Audit — Run color_contrast_checker.py against all foreground/background combinations.
Semantic Structure — Verify heading hierarchy, landmark regions, ARIA labels.
Keyboard Navigation — Validate focus order, skip links, focus indicators.
Screen Reader Compatibility — Check alt text, form labels, live regions, error announcements.
Motor Accessibility — Verify touch target sizes (minimum 44x44px), gesture alternatives.
Cognitive Accessibility — Evaluate reading level, consistent navigation, error prevention.

bash

python scripts/color_contrast_checker.py --input color_pairs.json --level AA --suggest-fixes

Workflow 4: Design System Compliance

Validate that implementation matches the project's design token definitions.

Steps:

Token Extraction — Load design tokens (colors, spacing, typography, shadows, radii).
CSS Scanning — Parse stylesheets for hardcoded values that should use tokens.
Deviation Detection — Flag any value not in the token set.
Compliance Scoring — Calculate percentage of values using tokens vs hardcoded.
Migration Plan — Generate token replacement suggestions for each violation.

bash

python scripts/design_system_validator.py --tokens design-tokens.json --input styles.css --output compliance.json

Workflow 5: Responsive & Cross-Device Audit

Validate design across breakpoints and device categories.

Steps:

Breakpoint Analysis — Verify media query coverage (mobile: 320-480, tablet: 481-1024, desktop: 1025+).
Touch Target Validation — Ensure interactive elements meet 44x44px minimum on touch devices.
Content Reflow — Check that content reflows properly without horizontal scrolling at 320px.
Image Responsiveness — Verify srcset usage, art direction, lazy loading.
Typography Scaling — Check fluid typography or appropriate size adjustments per breakpoint.
Navigation Patterns — Validate mobile navigation patterns (hamburger, bottom nav, etc.).

Tools

Tool	Purpose	Input	Output
`design_scorer.py`	Score audit findings across 12 categories, generate 3 grades	Audit JSON	Scored report JSON
`ai_slop_detector.py`	Detect AI-generated UI patterns in HTML/CSS	HTML + CSS files	Findings JSON
`color_contrast_checker.py`	WCAG color contrast validation	Color pairs	Pass/fail + suggestions
`design_system_validator.py`	Design token compliance checking	Tokens + CSS	Compliance report

12 Audit Categories

1. Visual Hierarchy & Composition (Weight: 10%)

Evaluate how effectively the design guides the user's eye through content.

Primary focal point — Is there a clear visual entry point on each screen?
Z-pattern / F-pattern — Does content flow match established reading patterns?
Size contrast — Do important elements have sufficient size differentiation?
Whitespace usage — Is negative space used intentionally to create groupings?
Visual weight balance — Is the composition balanced or intentionally asymmetric?

2. Typography System (Weight: 8%)

Assess typographic consistency, hierarchy, and readability.

Type scale — Is there a consistent modular scale (e.g., 1.25, 1.333, 1.5)?
Heading hierarchy — H1 > H2 > H3 clearly differentiated in size and weight?
Body readability — Line height 1.4-1.6, line length 45-75 characters?
Font pairing — Maximum 2-3 font families with clear roles?
Responsive typography — Font sizes adapt appropriately across breakpoints?

3. Color & Contrast (Weight: 8%)

Validate color system integrity and accessibility compliance.

Color palette — Defined primary, secondary, neutral, semantic (error/warning/success/info)?
Contrast ratios — All text meets WCAG AA minimum (4.5:1 normal, 3:1 large)?
Color independence — Information not conveyed by color alone?
Dark mode — If applicable, proper color inversion with maintained contrast?
Color consistency — Same semantic meaning for same colors throughout?

4. Spacing & Layout Grid (Weight: 8%)

Assess spatial consistency and grid system usage.

Spacing scale — Consistent spacing values (4px/8px base or similar)?
Grid system — Defined column grid with consistent gutters?
Component spacing — Consistent internal padding and external margins?
Alignment — Elements aligned to grid or intentionally offset?
Density balance — Appropriate information density for the context?

5. Interaction States (Weight: 10%)

Validate completeness of interactive element states.

Default — Clear affordance that element is interactive
Hover — Visual feedback on cursor hover (desktop)
Focus — Visible focus indicator for keyboard navigation
Active/Pressed — Feedback during click/tap
Disabled — Clearly non-interactive appearance with reduced opacity
Loading — Skeleton screens, spinners, or progress indicators
Error — Clear error state with actionable messaging
Empty — Meaningful empty states with guidance
Success — Confirmation feedback after successful actions

6. Navigation & Information Architecture (Weight: 8%)

Evaluate wayfinding and content organization.

Navigation clarity — User always knows where they are and can get where they need
Breadcrumbs/indicators — Current location visible in navigation hierarchy
Menu organization — Logical grouping with 7-plus-or-minus-2 items per level
Search — Available and functional when content volume warrants it
Deep linking — Key content reachable within 3 clicks/taps

7. Responsive Design (Weight: 8%)

Validate cross-device adaptation quality.

Breakpoint coverage — Smooth transitions at mobile/tablet/desktop breakpoints
Content priority — Most important content visible without scrolling on mobile
Touch targets — Minimum 44x44px on touch devices with 8px gaps
Orientation — Layouts work in both portrait and landscape
No horizontal scroll — Content contained at 320px minimum width

8. Accessibility — WCAG (Weight: 12%)

Comprehensive accessibility validation (highest weight category).

Perceivable — Alt text, captions, sufficient contrast, text resizing
Operable — Keyboard accessible, sufficient time, no seizure triggers, navigable
Understandable — Readable, predictable, input assistance
Robust — Compatible with assistive technologies, valid markup
ARIA usage — Correct roles, states, and properties when needed
Focus management — Logical focus order, focus trapping in modals

9. Motion & Animation (Weight: 5%)

Assess animation quality and appropriateness.

Purpose — Animations serve functional purpose (feedback, orientation, guidance)
Performance — Animations use GPU-accelerated properties (transform, opacity)
Duration — Appropriate timing (100-300ms for micro-interactions, 300-500ms for transitions)
Easing — Natural easing curves, no linear animations for UI elements
Reduced motion — Respects prefers-reduced-motion media query

10. Content & Microcopy (Weight: 5%)

Evaluate written content quality within the UI.

Button labels — Action-oriented, specific (not "Submit" or "Click Here")
Error messages — Explain what happened and how to fix it
Empty states — Guide users toward productive action
Tooltips/help — Contextual without overwhelming
Consistency — Same terminology throughout (not "delete" in one place, "remove" in another)

11. AI Slop Indicators (Weight: 8%)

Detect patterns suggesting generic AI-generated design without human refinement.

Generic hero sections — Full-width hero with centered text over stock gradient
Stock gradient backgrounds — Linear gradients with trending color combinations (purple-to-blue, etc.)
3-column feature grids — Icon + heading + paragraph repeated exactly 3 times
Meaningless illustrations — Abstract SVG decorations with no semantic connection
Cookie-cutter pricing tables — 3-tier layout with highlighted "recommended" plan
Generic testimonial layouts — Circular avatar + quote + name/title in a carousel
Over-use of shadows/blur — Excessive drop shadows and backdrop blur on every element
Inconsistent icon styles — Mixed icon sets (some outlined, some filled, different weights)
Lorem ipsum residue — Placeholder text or suspiciously generic content
Template section ordering — Hero > Features > Testimonials > Pricing > CTA > Footer

12. Performance as Design (Weight: 5%)

Evaluate how performance impacts the design experience.

Perceived performance — Skeleton screens, optimistic UI, progressive loading
Image optimization — Proper formats (WebP/AVIF), sizing, lazy loading
Font loading — FOUT/FOIT strategy, font-display property
Layout stability — No Cumulative Layout Shift (CLS) from late-loading elements
Interaction responsiveness — UI responds within 100ms of user action

Overall Coherence Bonus (Weight: 5%)

Design unity — All elements feel like they belong to the same design system
Brand alignment — Design reflects brand personality and values
Consistency — Patterns learned in one area apply throughout
Polish level — Attention to detail in edge cases and transitions

Scoring System

Three Independent Grades

Unlike single-grade systems, the Design Auditor produces three separate grades for comprehensive evaluation:

1. Design Grade (A-F)

Weighted aggregate of all 12 categories plus coherence bonus.

Grade	Score Range	Meaning
A+	95-100	Exceptional — production-ready, sets new standards
A	90-94	Excellent — minor polish items only
B+	85-89	Very Good — few notable issues
B	80-84	Good — solid foundation with improvement areas
C+	75-79	Adequate — functional but needs refinement
C	70-74	Below Average — multiple issues need attention
D	60-69	Poor — significant problems across categories
F	0-59	Failing — fundamental redesign needed

2. AI Slop Grade (A-F)

Based on AI Slop Indicators category (inverted — fewer slop patterns = higher grade).

Grade	Meaning
A	Highly original — no detectable AI patterns
B	Mostly original — 1-2 minor patterns detected
C	Mixed — several AI patterns present but customized
D	Heavily templated — many generic AI patterns
F	Pure slop — design appears entirely AI-generated without refinement

3. Accessibility Grade (A-F)

Based on WCAG compliance level achieved.

Grade	Meaning
A+	AAA compliant — exceeds all requirements
A	AA compliant — meets all standard requirements
B	Mostly AA — 1-3 minor violations
C	Partial AA — several violations, core content accessible
D	Significant gaps — multiple barriers to access
F	Inaccessible — fundamental accessibility failures

AI Slop Detection Patterns

Visual Slop

Pattern	Description	Confidence
Generic hero section	Full-width hero, centered heading, subtitle, CTA over gradient/image	0.7-0.9
Stock gradient	Linear gradient with trending colors (purple-blue, pink-orange)	0.5-0.8
3-column feature grid	Exactly 3 columns: icon + heading + paragraph	0.6-0.8
Shadow overuse	box-shadow on > 60% of card/container elements	0.5-0.7
Blur overuse	backdrop-filter: blur on multiple overlapping elements	0.5-0.7
Rounded everything	border-radius > 12px on most elements	0.4-0.6

Copy Slop

Pattern	Description	Confidence
Vague CTAs	"Get Started", "Learn More", "Try It Free" without context	0.5-0.7
Buzzword density	"Seamless", "Powerful", "Revolutionary", "Next-gen" clustering	0.6-0.8
Generic testimonials	Vague praise without specific product/feature references	0.6-0.8
Lorem ipsum residue	Placeholder text fragments or suspiciously uniform paragraph lengths	0.8-1.0

Structural Slop

Pattern	Description	Confidence
Template ordering	Hero > Features > Social Proof > Pricing > CTA > Footer	0.5-0.7
Cookie-cutter pricing	3 tiers, middle highlighted, feature checkmark lists	0.6-0.8
Predictable footer	4-column footer with Newsletter, Social, Links, Legal	0.4-0.6

Design Philosophy & Taste Anchors

These 10 principles are not optional guidelines — they are the audit's operating system. Every finding is evaluated against them.

Specificity over vibes — "Clean UI" is banned. Name the font, the spacing scale, the color system. If you cannot name it, it is not a design decision.
Empty states are features — "No items found" is a bug. Empty states guide users to their first action, explain what will appear, and build confidence.
Subtraction default — As little design as possible, but not less. Every element must earn its place. When in doubt, remove it. (Dieter Rams)
Edge cases are user experiences — 47-character names, zero search results, slow networks, stale state. These are not exceptions — they are the product for some users, every time.
Four shadow paths — For every interaction, test: happy path, nil input, empty input, error upstream. If any path produces a blank screen or unhandled state, that is a Critical finding.
Loading states earn trust — Skeleton screens > spinners > blank pages. Progressive disclosure of content signals responsiveness even before data arrives.
Consistency compounds — One off-system color erodes the entire design language. Design tokens are contracts, not suggestions. Every deviation is a finding.
Motion has meaning — Every animation must communicate state change (enter, exit, reorder, feedback). Decorative motion without purpose is noise — and noise is a finding.
Accessibility is not a feature — It is a baseline. WCAG AA is the floor, not the ceiling. Accessibility findings are never "Low" priority.
Performance is perceived design — A 3-second load feels broken regardless of visual quality. LCP, CLS, and INP are design metrics.

Interaction Model & Safety Controls

Fix Session Rules

One issue = one question — never batch multiple design issues into a single prompt. Each finding is evaluated and addressed independently.
AUTO-FIX (no confirmation): Cosmetic issues — spacing token mismatches, off-system colors swapped to nearest token, minor border-radius corrections
ASK (requires confirmation): Structural issues — layout changes, component swaps, new patterns, navigation restructuring
Maximum 30 fixes per session — hard stop. After 30 fixes, generate report and exit.
Risk accumulator — component file changes (+5 risk), global style changes (+8), layout changes (+10), reverts (+15). Stop if cumulative risk exceeds 20% of budget (100).
Revert on regression — if a fix breaks existing visual tests or introduces a new Critical finding, immediately git revert and flag for manual review.

Tool Input/Output Documentation

design_scorer.py — Expected Input Format

json

{
  "categories": {
    "visual_hierarchy": {
      "score": 7.5,
      "findings": [
        {
          "description": "Homepage lacks clear visual entry point",
          "priority": "high",
          "fix": "Establish single primary focal point using size contrast",
          "impact": 7
        }
      ]
    },
    "typography": { "score": 8.0, "findings": [...] },
    "color_contrast": { "score": 6.5, "findings": [...] },
    "spacing_layout": { "score": 7.0, "findings": [...] },
    "interaction_states": { "score": 5.0, "findings": [...] },
    "navigation_ia": { "score": 7.5, "findings": [...] },
    "responsive": { "score": 6.0, "findings": [...] },
    "accessibility": { "score": 5.0, "findings": [...] },
    "motion_animation": { "score": 8.0, "findings": [...] },
    "content_microcopy": { "score": 7.0, "findings": [...] },
    "ai_slop": { "score": 6.0, "findings": [...] },
    "performance_design": { "score": 7.5, "findings": [...] }
  }
}

design_scorer.py — Output

Design Grade:         D (67.2/100)
AI Slop Grade:        C (6.0/10)
Accessibility Grade:  C (5.0/10)

Category Breakdown:
  Visual Hierarchy      10%   7.5/10   1 finding
  Typography             8%   8.0/10   1 finding
  Color & Contrast       8%   6.5/10   2 findings
  Spacing & Layout       8%   7.0/10   1 finding
  Interaction States    10%   5.0/10   3 findings  ← attention
  Navigation & IA        8%   7.5/10   0 findings
  Responsive             8%   6.0/10   2 findings
  Accessibility         12%   5.0/10   4 findings  ← attention
  Motion & Animation     5%   8.0/10   0 findings
  Content & Microcopy    5%   7.0/10   1 finding
  AI Slop Indicators     8%   6.0/10   2 findings
  Performance            5%   7.5/10   1 finding

Recommendations (by impact):
  1. [Critical] Add missing form labels — accessibility
  2. [High] Implement hover/focus/disabled states for all buttons
  3. [High] Fix contrast ratio on secondary text (3.2:1 → 4.5:1)

color_contrast_checker.py — Input/Output

bash

# Single pair
python scripts/color_contrast_checker.py --fg "#333" --bg "#fff" --level AA

# Batch mode from JSON
python scripts/color_contrast_checker.py --input color-pairs.json --suggest-fixes

json

// color-pairs.json input format
[
  {"foreground": "#666666", "background": "#ffffff", "label": "body text"},
  {"foreground": "#999999", "background": "#f5f5f5", "label": "muted text"}
]

design_system_validator.py — Input

bash

python scripts/design_system_validator.py --tokens design-tokens.json --input src/styles/

json

// design-tokens.json format
{
  "colors": {"primary": "#1a73e8", "secondary": "#5f6368", "error": "#d93025"},
  "spacing": [0, 4, 8, 12, 16, 24, 32, 48, 64],
  "typography": {"body": "16px", "h1": "32px", "h2": "24px", "h3": "20px"},
  "radii": [0, 4, 8, 16, 9999]
}

CI/CD Integration

yaml

# .github/workflows/design-audit.yml
name: Design Audit Gate
on:
  pull_request:
    paths:
      - '**/*.css'
      - '**/*.scss'
      - '**/*.html'
      - '**/*.tsx'
      - '**/*.vue'
jobs:
  design-checks:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Color Contrast Check
        run: python scripts/color_contrast_checker.py --input color-pairs.json --level AA
      - name: Design System Compliance
        run: |
          python scripts/design_system_validator.py \
            --tokens design-tokens.json \
            --input src/styles/ \
            --format json > compliance.json
          # Fail if compliance below 90%
          python -c "import json; c=json.load(open('compliance.json')); exit(0 if c.get('compliance_percentage',0)>=90 else 1)"
      - name: AI Slop Detection
        run: |
          python scripts/ai_slop_detector.py --input dist/index.html --threshold 0.7 --format json > slop.json

Fix Priority System

Priority	Definition	Response Time	Examples
Critical	Breaks core UX, blocks user tasks, accessibility barrier	Immediate	Missing form labels, contrast below 2:1, broken navigation
High	Degrades experience significantly, affects many users	Before launch	Inconsistent interaction states, poor mobile layout, missing error states
Medium	Creates inconsistency, affects some users	Next sprint	Off-system colors, spacing violations, minor responsive issues
Low	Polish and refinement, minor impact	Backlog	Animation timing, icon consistency, microcopy improvements
Cosmetic	Nice-to-have, no functional impact	Optional	Pixel alignment, hover animation easing, shadow refinement

Integration Points

Code Review — Run design auditor alongside code review for frontend PRs
CI/CD — Integrate color_contrast_checker.py and design_system_validator.py into pipelines
Design Handoff — Use full audit before developer handoff to catch issues early
Sprint Review — Run abbreviated audit (categories 1-8) for sprint demo review
Release Gate — Require minimum B grade for Design and A for Accessibility before release
Design System Updates — Re-run design_system_validator.py after token updates

References

references/design_audit_methodology.md — Systematic audit approach, heuristics, Gestalt principles
references/ai_slop_patterns.md — Comprehensive AI pattern catalog with remediation strategies
references/accessibility_checklist.md — Complete WCAG 2.1 A/AA/AAA checklist

Assets

assets/design_audit_template.md — Ready-to-use audit report template
assets/sample_audit_data.json — Sample JSON input for Python tools

Troubleshooting

Problem	Cause	Solution
`design_scorer.py` exits with "Missing category"	Input JSON is missing one or more of the 12 required category keys under `categories`	Ensure all 12 keys are present: `visual_hierarchy`, `typography`, `color_contrast`, `spacing_layout`, `interaction_states`, `navigation_ia`, `responsive`, `accessibility`, `motion_animation`, `content_microcopy`, `ai_slop`, `performance_design`
Color contrast checker rejects a color value	Unsupported color format passed to `--foreground` or `--background`	Use hex (`#RGB`/`#RRGGBB`), `rgb(R,G,B)`, `hsl(H,S%,L%)`, or a named color (`black`, `white`, `red`, etc.)
AI slop detector reports zero findings on a clearly templated page	Confidence threshold is set too high, filtering out valid detections	Lower `--threshold` (default `0.5`); try `0.3` for a broader sweep
Design system validator finds no violations but CSS clearly has hardcoded values	Token JSON is missing the relevant category (e.g., `spacing` or `border_radii`)	Add all applicable token categories to the tokens file; the validator skips any category with no defined tokens
`design_system_validator.py` reports "No CSS files found"	Path points to a directory but files use `.scss` extension	Pass `--glob ".scss"` or `--glob ""` to include SCSS files
Scores seem inflated despite many findings	Category scores in the input JSON are set too high relative to the findings listed	Scores are author-provided (0-10); re-evaluate each category score to accurately reflect the number and severity of findings
Baseline trend comparison shows unexpected "unchanged" across all categories	Baseline JSON file has the same scores as the current file	Verify you are passing the correct previous audit JSON via `--baseline`, not a copy of the current file

Success Criteria

WCAG AA color contrast compliance rate above 95% across all foreground/background pairs in the project.
Design system token compliance rate at or above 90%, with zero off-system color values in production CSS.
AI Slop Grade of B or higher (two or fewer high-confidence AI patterns detected above the 0.5 threshold).
Overall Design Grade of B (80/100) or higher before release, with no Critical-priority findings remaining open.
Accessibility Grade of A (score 8.5+/10), confirming full WCAG 2.1 AA compliance with no perceivable or operable barriers.
All interactive elements have complete state coverage: default, hover, focus, active, disabled, loading, error, empty, and success.
Zero lorem ipsum or placeholder text residue detected by the AI slop detector in any production build.

Scope & Limitations

Covers:

Visual and structural auditing across 12 weighted categories with three independent grades (Design, AI Slop, Accessibility).
Automated WCAG 2.1 AA/AAA color contrast validation with compliant color suggestions for failing pairs.
Design token compliance checking for colors, spacing, typography, radii, shadows, z-indices, transitions, and breakpoints.
AI-generated pattern detection in HTML markup, CSS stylesheets, inline styles, and page copy.

Does NOT cover:

Runtime accessibility testing (screen reader behavior, live ARIA state changes, keyboard trap detection) — see senior-qa for end-to-end testing workflows.
Performance profiling (LCP, CLS, INP measurement) — see senior-frontend and senior-devops for Lighthouse and Core Web Vitals tooling.
User research or usability testing with real participants — see product-team/ux-researcher for research frameworks.
Design file inspection (Figma, Sketch, Adobe XD source files) — this skill operates on implemented HTML/CSS output only.

Integration Points

Skill	Integration	Data Flow
`senior-frontend`	Run design audit on frontend component library after build	Frontend CSS/HTML output -> `design_system_validator.py` + `color_contrast_checker.py` -> compliance gates
`senior-qa`	Incorporate accessibility and design regression checks into QA pipelines	`design_scorer.py` report JSON -> QA test assertions on grade thresholds
`code-reviewer`	Attach design audit findings to frontend PR reviews	`ai_slop_detector.py` + `color_contrast_checker.py` findings -> PR comment annotations
`senior-devops`	Gate deployments on minimum design compliance scores in CI/CD	`design_system_validator.py` compliance percentage -> pipeline pass/fail exit code
`product-team/ux-researcher`	Feed audit findings into usability research prioritization	`design_scorer.py` recommendations list -> research backlog items ranked by impact
`senior-secops`	Validate that accessibility compliance meets regulatory requirements (ADA, EAA)	Accessibility Grade from `design_scorer.py` -> compliance evidence artifacts

Tool Reference

design_scorer.py

Purpose: Computes three independent grades (Design A-F, AI Slop A-F, Accessibility A-F) from a 12-category audit findings JSON file. Supports baseline trend comparison and prioritized fix recommendations.

Usage:

bash

python scripts/design_scorer.py --input findings.json
python scripts/design_scorer.py --input findings.json --output report.json --verbose
python scripts/design_scorer.py --input findings.json --baseline previous.json --format text

Flags/Parameters:

Flag	Short	Required	Default	Description
`--input`	`-i`	Yes	—	Path to audit findings JSON file
`--output`	`-o`	No	stdout	Path to write report output
`--baseline`	`-b`	No	—	Path to baseline findings JSON for trend comparison
`--format`	`-f`	No	`json`	Output format: `json` or `text`
`--verbose`	`-v`	No	off	Include detailed per-category findings in text output

Example:

bash

python scripts/design_scorer.py --input audit_findings.json --baseline last_sprint.json --format text --verbose

Output Formats:

JSON (default) — Full report object with grades, category_breakdown, coherence, recommendations, summary, and optional trend data.
Text — Human-readable report with grade summary, category score table, findings summary, top issues, trend comparison (if baseline provided), and all recommendations grouped by priority.
Text + Verbose — Text report plus a detailed section listing every finding per category with priority, description, suggested fix, and impact score.

ai_slop_detector.py

Purpose: Analyzes HTML and CSS files for AI-generated UI patterns including stock gradients, generic hero sections, cookie-cutter layouts, buzzword-heavy copy, lorem ipsum residue, and inconsistent icon styles. Each finding receives a confidence score (0.0-1.0).

Usage:

bash

python scripts/ai_slop_detector.py --input page.html
python scripts/ai_slop_detector.py --input page.html --css styles.css --threshold 0.6
python scripts/ai_slop_detector.py --input page.html --format text --output report.txt

Flags/Parameters:

Flag	Short	Required	Default	Description
`--input`	`-i`	Yes	—	Path to HTML file to analyze
`--css`	`-c`	No	—	Path to external CSS file (inline and embedded styles are always extracted from HTML)
`--threshold`	`-t`	No	`0.5`	Confidence threshold (0.0-1.0); findings below this are reported but counted separately
`--output`	`-o`	No	stdout	Path to write report output
`--format`	`-f`	No	`json`	Output format: `json` or `text`

Example:

bash

python scripts/ai_slop_detector.py --input dist/index.html --css dist/styles.css --threshold 0.7 --format json

Output Formats:

JSON (default) — Report with grade (A-F), summary (total patterns, above/below threshold counts, by-category breakdown), findings sorted by confidence descending, and findings_by_category grouped into visual, structural, and copy.
Text — Human-readable report with grade, category totals, and each finding showing confidence bar, type, description, evidence, line number, and remediation suggestion.

color_contrast_checker.py

Purpose: Validates color contrast ratios against WCAG 2.1 AA and AAA standards. Supports hex, rgb, hsl, and named color formats. Can check a single pair or batch-process from a JSON file. Optionally suggests the closest compliant color alternative for failing pairs.

Usage:

bash

# Single pair
python scripts/color_contrast_checker.py --foreground "#333" --background "#fff"

# Batch mode
python scripts/color_contrast_checker.py --input color_pairs.json --level AAA --suggest-fixes

# With text size context
python scripts/color_contrast_checker.py --fg "#666" --bg "#eee" --size large --format text

Flags/Parameters:

Flag	Short	Required	Default	Description
`--foreground`	`--fg`	Yes (single mode)	—	Foreground color (hex, rgb, hsl, or named)
`--background`	`--bg`	Yes (single mode)	—	Background color (hex, rgb, hsl, or named)
`--size`	—	No	`normal`	Text size context: `normal`, `large`, or `ui`
`--input`	`-i`	Yes (batch mode)	—	Path to JSON file with color pairs
`--level`	`-l`	No	`AA`	WCAG level to check: `AA` or `AAA`
`--suggest-fixes`	`-s`	No	off	Suggest closest compliant foreground and background alternatives for failing pairs
`--output`	`-o`	No	stdout	Path to write report output
`--format`	`-f`	No	`json`	Output format: `json` or `text`

Example:

bash

python scripts/color_contrast_checker.py --input brand-colors.json --level AA --suggest-fixes --format text

Output Formats:

JSON (default) — Report with accessibility_grade (A+ through F), summary (total pairs, passing, failing, compliance rate), results array with per-pair ratio, pass/fail status across all WCAG levels, and optional suggestions with adjusted foreground/background hex values and their new ratios.
Text — Human-readable report with grade, compliance rate, per-pair pass/fail status, ratio details, WCAG level results (AA normal, AA large, AAA normal, AAA large), suggested fixes, and a failing pairs summary.

design_system_validator.py

Purpose: Validates CSS files against a design token JSON definition. Detects hardcoded colors, spacing, font sizes, font families, font weights, line heights, border radii, shadows, breakpoints, z-indices, and transitions that deviate from the token set. Reports compliance percentage and suggests the closest matching token for each violation.

Usage:

bash

python scripts/design_system_validator.py --tokens tokens.json --input styles.css
python scripts/design_system_validator.py --tokens tokens.json --input src/ --glob "*.scss"
python scripts/design_system_validator.py --tokens tokens.json --input styles.css --output report.json --format text

Flags/Parameters:

Flag	Short	Required	Default	Description
`--tokens`	`-t`	Yes	—	Path to design tokens JSON file
`--input`	`-i`	Yes	—	Path to CSS file or directory to scan
`--glob`	`-g`	No	`*.css`	Glob pattern for filtering files when input is a directory (e.g., `.scss`, ``)
`--output`	`-o`	No	stdout	Path to write report output
`--format`	`-f`	No	`json`	Output format: `json` or `text`

Example:

bash

python scripts/design_system_validator.py --tokens design-tokens.json --input src/styles/ --glob "*" --format text

Output Formats:

JSON (default) — Report with compliance_grade (A-F), compliance_rate percentage, summary (total values checked, compliant, violations), category_breakdown with per-category stats, violations_by_category grouped with property/value/line/selector/suggested token, and all_violations flat list.
Text — Human-readable report with grade, compliance rate, category breakdown table (total/pass/fail/rate per category), and violations listed per category with line numbers, property values, and suggested token replacements (limited to 20 per category in display).

Maintainer

borghei Core maintainer

Source details

Full Name: borghei/Claude-Skills
Branch: main
Path in repo: engineering/design-auditor
License: Other
Topics: claude-code automation ai-agents cursor developer-tools agentic-coding github-copilot prompt-engineering llm python ai-coding-assistant ai-skills windsurf openai-codex compliance-automation eu-ai-act gdpr-compliance iso-27001 role-based-agents soc2

Featured Tools

Join Our Newsletter

Referral and affiliate program design covering referral loop architecture, incentive design, trigger moment optimization, viral coefficient modeling, affiliate program structure, and optimization playbook.

71 21

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

Design Auditor

Keywords

Quick Start

Core Workflows

Workflow 1: Full Design Audit (12-Category Systematic Review)

Workflow 2: AI Slop Detection

Workflow 3: Accessibility Compliance Check (WCAG 2.1 AA/AAA)

Workflow 4: Design System Compliance

Workflow 5: Responsive & Cross-Device Audit

Tools

12 Audit Categories

1. Visual Hierarchy & Composition (Weight: 10%)

2. Typography System (Weight: 8%)

3. Color & Contrast (Weight: 8%)

4. Spacing & Layout Grid (Weight: 8%)

5. Interaction States (Weight: 10%)

6. Navigation & Information Architecture (Weight: 8%)

7. Responsive Design (Weight: 8%)

8. Accessibility — WCAG (Weight: 12%)

9. Motion & Animation (Weight: 5%)

10. Content & Microcopy (Weight: 5%)

11. AI Slop Indicators (Weight: 8%)

12. Performance as Design (Weight: 5%)

Overall Coherence Bonus (Weight: 5%)

Scoring System

Three Independent Grades

1. Design Grade (A-F)

2. AI Slop Grade (A-F)

3. Accessibility Grade (A-F)

AI Slop Detection Patterns

Visual Slop

Copy Slop

Structural Slop

Design Philosophy & Taste Anchors

Interaction Model & Safety Controls

Fix Session Rules

Tool Input/Output Documentation

design_scorer.py — Expected Input Format

design_scorer.py — Output

color_contrast_checker.py — Input/Output

design_system_validator.py — Input

CI/CD Integration

Fix Priority System

Integration Points

References

Assets

Troubleshooting

Success Criteria

Scope & Limitations

Integration Points

Tool Reference

design_scorer.py

ai_slop_detector.py

color_contrast_checker.py

design_system_validator.py

Recommended Agent Skills

churn-prevention

popup-cro

competitor-alternatives

contract-and-proposal-writer

pricing-strategy

referral-program