AAgentic Design School
Course
Intermediate
Available

Design Review and Critique with Agents

Critique is where design quality is actually decided, and it is the part of the process most teams under-resource. This course covers using agents to make review sharper and more frequent without outsourcing taste: structured critique loops, heuristic evaluations and cognitive walkthroughs run at a scale no team could staff, visual regression evidence, accessibility and content review, and design review wired into every pull request.

Modules5 modules

Last updated2026-06-02

Who it is for

Audience and outcomes

Design leads, senior designers, and design-minded engineers responsible for review quality — particularly teams where critique currently depends on whoever has spare time that week.

By the end of this course you can
  • Run critique loops where the agent applies named dimensions and the human makes the calls.
  • Execute heuristic evaluations and cognitive walkthroughs across whole products with evidence per finding.
  • Use screenshot-based regression evidence so visual review stops relying on memory.
  • Run accessibility and UX-writing reviews as standard passes rather than special projects.
  • Wire design review into every pull request with clear severity levels and human escalation.
  • Keep taste and final judgment human while making the mechanical parts of review continuous.
Curriculum

Modules

Modules marked Available have full slide decks with speaker notes and narration scripts. The rest show their planned outline until production catches up.

1

Critique Loops with Agents

Available
40–50 minutes

The structure of a useful critique loop: named dimensions instead of vibes, the agent as a tireless first reviewer, and the difference between feedback an agent can act on and feedback that needs a human decision.

  • Define critique dimensions — hierarchy, density, tone, states, accessibility — for your product.
  • Run an agent critique pass that produces findings with evidence, not opinions.
  • Separate actionable feedback from judgment calls and route each correctly.
Open module 1
2

Heuristic Evaluation and Cognitive Walkthroughs at Scale

Available
45–55 minutes

Two classic methods most teams skip because of cost, run across an entire product by agents: standard heuristics applied screen by screen, task-based walkthroughs step by step, and the human pass that turns findings into decisions.

  • Set up a heuristic evaluation across a full product surface with consistent criteria.
  • Run cognitive walkthroughs against defined user tasks rather than screens in isolation.
  • Triage large finding sets without drowning the team in low-severity noise.
Open module 2
3

Visual QA and Regression Evidence

Available
40–50 minutes

Replacing it looked fine last time I checked with evidence: screenshot baselines, agent-run sweeps across states and breakpoints, and regression reports a designer can act on in minutes rather than re-checking everything by hand.

  • Establish screenshot baselines for the surfaces that matter.
  • Run regression sweeps that compare against the baseline with diffs as evidence.
  • Distinguish genuine regressions from intended change without manual re-review of everything.
Open module 3
4

Accessibility and Content Review

Available
40–50 minutes

Two reviews that are chronically deferred, made routine: accessibility sweeps that go beyond automated contrast checks, and UX-writing review that holds copy to the product's voice with the same rigour as the visuals.

  • Run accessibility reviews covering structure, keyboard flow, and screen-reader sense, not just contrast.
  • Audit interface copy for voice, clarity, and consistency against a written standard.
  • Decide which findings agents can fix directly and which need design or legal judgment.
Open module 4
5

Design Review on Every PR

Available
40–50 minutes

The end state: every change that touches the interface gets a design review automatically — token and component checks, screenshot evidence, severity levels — with humans reviewing what matters instead of everything or nothing.

  • Define what a per-PR design review checks and what it deliberately ignores.
  • Set severity levels and escalation so humans see the findings that need them.
  • Roll the practice out without turning it into a blocking bureaucracy.
Open module 5
Related material

Books and articles behind this course

The course teaches the practice; the books and articles carry the depth, the sources, and the worked runs it draws on.

The Agentic Designer cover
Curriculum
The Agentic Designer
How AI agents are transforming product design.

The operating model for product designers, design leads, and builders who need to understand what changes when agents join design work.

  • Visual QA With Agents

    A long-form workflow for using agents to inspect screenshots, compare implementation against intent, run accessibility checks, and produce fix-ready review notes — traced against a real review of this site's own article page.

  • Design Critique Loops With Agents

    A practical workflow for turning agent critique from vague opinion into structured findings, evidence, severity, revision passes, and human design decisions.

  • Design System Audits With Agents

    A deep workflow for finding design-system drift across tokens, components, screenshots, copy, interaction states, and product meaning.

  • Four Agentic Design Workflows in Production: What Actually Happened

    Four real agentic design workflows traced from their own repositories — an editorial site, a multi-agent book pipeline, a book-to-video pipeline, and the design system underneath the site — with the timelines git can prove, the configs that did the steering, the failures the gates caught, and the patterns that repeat across all four.