AAgentic Design School
Module 2 of 6
40–50 minutes

Agentic Prototyping

From Reference to Brief

References are the richest input designers have and the easiest to misuse. This module covers turning screenshots, competitor flows, and research findings into briefs that borrow behaviour rather than cloning surfaces.

Duration40–50 minutes

Slides13 slides with notes and narration

Learning objectives

  • Annotate references so the agent borrows the right things: density, rhythm, hierarchy.
  • Combine research findings and references into a single executable brief.
  • Avoid the copy-this-screenshot failure and the legal and taste problems behind it.
Slide deck

Work through the module

Each slide is shown in its 16:9 frame, exactly as it appears in the video version. Open the notes under any slide for the longer explanation, and the narration if you prefer to read along.

Slide 1 of 1316:9

From Reference to Brief

Agentic Prototyping · Module 2 of 6

  • What references are actually good for — and what they are not
  • Annotating a reference: match this, ignore that, never do this
  • Folding research findings into the same brief
  • The clone trap, and the legal, taste, and product reasons to avoid it

References carry behaviour worth borrowing. The brief is where that behaviour becomes something an agent can execute without cloning the surface.

Slide notes

Module 1 established the prototype-first mindset and the discipline around scope and disposability. This module is about the input side: most prototypes do not start from a blank page, they start from a pile of references — screenshots someone admired, a competitor flow the product manager keeps mentioning, a handful of research findings, and the product's own constraints. The skill is converting that pile into a brief an agent can execute.

Name the failure mode up front, because everyone in the room has either done it or been asked to do it: pasting a screenshot into the agent with the words make something like this. The agent will oblige, and what comes back is somewhere between a derivative layout and a near-clone — generic where the screenshot was specific, and uncomfortably close where the screenshot was distinctive. Both outcomes are bad, for different reasons this module covers.

The module is structured as a pipeline: what to extract from each kind of input, how to annotate a reference so the extraction is explicit, how research findings join the brief, and then a worked example tracing a competitor flow all the way to a prototype. The exercise at the end has participants annotate one real reference from their own work.

Narration for this slide

Welcome back. Module 1 was about scoping the prototype; this module is about feeding it. Almost no prototype starts from nothing — you start with references. A screenshot you admire, a competitor flow that works, research notes about what users struggled with, and your own product's constraints. The question is how all of that becomes a brief an agent can execute. The lazy answer is to paste the screenshot in and say make something like this — and we will spend some time on why that fails. The better answer is to extract behaviour from each input, annotate it, and put it where it belongs in the brief. That is what this module teaches.

Slide 2 of 1316:9

What references carry: behaviour, not pixels

A good reference is evidence that a design behaviour works. The behaviour transfers; the surface should not.

  • Density — how much information sits on screen before it feels crowded
  • Rhythm — spacing, grouping, and the cadence of rows, cards, or bands
  • Hierarchy — what the screen makes you read first, second, and third
  • Navigation and flow patterns — how steps are ordered and how users move
  • States — how the reference handles empty, error, loading, and long content

If you cannot name the trait you want from a reference, the agent will pick one for you — and it usually picks the palette.

Slide notes

The list on this slide is the extraction vocabulary for the rest of the module. Each item is a behaviour: something the reference does that your prototype could also do, in your own components, your own tokens, and your own voice. Density is the most underused one — designers feel density immediately but rarely name it, and unnamed it never reaches the agent. Rhythm covers the spacing and grouping decisions that make a dense screen feel calm or a sparse screen feel intentional. Hierarchy is the reading order. Flow patterns cover step order and navigation models. States are the quiet gold in competitor references: a mature product's empty states and error recovery represent years of support tickets you did not have to receive.

Contrast that with what a reference also carries that you do not want: brand colour, typography, copy voice, illustration style, logo treatments, and content. Those are the parts an agent reaches for first when given a screenshot with no instruction, because they are the most visually salient and the easiest to imitate.

The practical implication is that a reference is only as useful as the sentence you write about it. The school's prompting article makes the same point about screenshots generally: a screenshot is evidence when the prompt says what it is evidence of, and decoration when it does not.

Narration for this slide

Start with what a reference is actually good for. A reference is evidence that some design behaviour works — and behaviour is the part you can borrow honestly. Density: how much information the screen carries before it feels crowded. Rhythm: the spacing and grouping that make that density readable. Hierarchy: what you read first, second, third. Flow patterns: the order of steps and how people move between them. And states — how a mature product handles empty, error, and loading, which is years of someone else's support tickets distilled into screens. What a reference also carries is brand, copy, and palette. That part does not transfer. And if you do not name the trait you want, the agent will default to imitating the most visible thing — which is usually the palette.

Slide 3 of 1316:9

Cloning a surface vs borrowing behaviour

The same screenshot, used two ways. The difference is entirely in what the brief asks for.

Clone the surfaceBorrow the behaviour
The instruction"Make something like this screenshot""Match its table density and row rhythm; ignore brand, copy, and colour"
What comes backA near-copy in the reference's visual languageYour components and tokens, arranged with the reference's discipline
Fit with your productFights your design system and your voiceReads as part of your product on the first draft
RiskTrade dress, brand confusion, and a taste ceiling set by someone elseNone beyond ordinary design influence
What you learnedNothing — the thinking stayed in the screenshotA named trait you can reuse in the next brief

Cloning outsources the design decision to the reference. Borrowing keeps the decision yours and uses the reference as evidence for it.

Slide notes

Walk the columns rather than the rows. The clone column is what happens by default: the instruction is one line, the agent imitates what it can see, and what it can see is the surface. The output fights your design system because it was never asked to use it, and it carries a risk profile nobody signed up for — visual similarity to a competitor is exactly the kind of thing that surfaces late, in a legal review or a stakeholder meeting, when the prototype has already gathered momentum.

The borrow column costs one more sentence and changes everything downstream. Naming the trait — table density, row rhythm, the way filters collapse — gives the agent something it can implement in your components. The output reads as part of your product because the brief said to build it from your product's materials. And the named trait becomes reusable: once you have written match the density of reference A, ignore its brand once, you have a pattern you will use in every reference-driven brief from now on.

The last row is the one to linger on. Cloning feels efficient but teaches nothing; the design reasoning stays locked in the screenshot. Borrowing forces you to articulate why the reference works, and that articulation is the design work.

Narration for this slide

Here is the same screenshot used two different ways. On the left, the clone: make something like this. The agent imitates what it can see, which is the surface — colours, type, copy register. The result fights your design system, and it carries a risk nobody asked for, because looking too much like a competitor is a problem that surfaces late and expensively. On the right, the borrow: match its table density and row rhythm, ignore its brand, copy, and colour. One extra sentence. The output comes back in your components, with the reference's discipline. And you have learned something — you have named why the reference works, and that named trait is reusable. Cloning outsources the decision. Borrowing keeps it yours.

Slide 4 of 1316:9

Annotating a reference: match, ignore, never

An annotated reference is a screenshot plus three short lists. Without the lists, the agent decides what to copy.

  • Match this — the named traits: density, rhythm, hierarchy, a specific interaction pattern
  • Ignore that — brand, palette, copy, illustration style, anything owned by someone else
  • Never do this — the parts of the reference that are wrong for your product or users
  • One reference, one job — say what each screenshot is in the brief to demonstrate
Reference annotation, as it goes into the brief
Reference A — competitor's plan-comparison screen (screenshots/ref-a-pricing.png)
Match: the row density of the comparison table; limits visible on the cards
       before the table; one recommended plan with a stated reason.
Ignore: colours, type, brand, copy voice, the testimonial band.
Never: the dark-pattern countdown timer; hiding the cheapest plan behind a toggle.

The annotation is the design judgment. The screenshot is just the evidence attached to it.

Slide notes

This is the core technique of the module, and it is deliberately small: three labelled lists per reference, written in the brief next to a pointer to the image file. Match is the positive extraction — the traits from the previous slides, named specifically enough that the agent can act on them and a reviewer can check them. Ignore is the firewall: everything that belongs to the reference's owner or the reference's brand stays out. Never is the often-forgotten third list — the parts of the reference that are actively wrong for your product, like a dark pattern, an accessibility failure, or a density level your audience cannot handle. References are not endorsements; a competitor screen can be worth studying and still contain choices you refuse to repeat.

The one reference, one job line matters more than it looks. When a brief contains four screenshots with no roles assigned, the agent averages them, and the average of four references is usually the generic centre this whole course is trying to escape. Assigning each reference a job — this one is for the table rhythm, this one is for the empty state, this one is what not to do — keeps the extraction sharp.

The code excerpt is the format to copy. It lives in the brief or in the briefing packet beside the screenshots folder, so the annotation and the evidence travel together into the agent's context.

Narration for this slide

Here is the technique that does most of the work in this module. For every reference, write three short lists. Match this: the traits you actually want — the row density, the way limits sit on the cards, the recommended-plan pattern. Ignore that: colours, type, brand, copy — anything owned by someone else. And never do this: the parts of the reference that are wrong for your product, like that countdown timer or the hidden cheapest plan. References are not endorsements; you can learn from a screen and still refuse half of it. Give each reference one job, write the three lists into the brief next to the file path, and the screenshot stops being decoration and becomes evidence.

Slide 5 of 1316:9

Research findings as briefing input

References tell you what could work. Research tells you what your users actually struggled with. The brief needs both.

  • State findings as observations, not solutions — "users could not find export" rather than "add a bigger export button"
  • Separate what the evidence says from what you infer from it
  • Attach a confidence level: tested and observed, reported second-hand, or hunch
  • Turn the strongest findings into review criteria the prototype must answer
  • Keep the findings short — three to five lines in the brief, with the full notes in the packet

A finding earns its place in the brief when it would change what the agent builds or how the prototype is judged.

Slide notes

Research findings and references answer different questions, and the brief needs the distinction kept clean. A reference says this behaviour works somewhere; a finding says this is what our users actually did, said, or failed to do. When the two conflict, the finding usually wins, because it is about your users rather than someone else's.

The discipline borrowed from the school's research-packet practice is the separation of observation from inference, with a confidence level attached. A finding written as users could not find the export action is an observation; add a bigger export button is an inference, and a premature one — it pre-empts the design exploration the prototype exists to do. Confidence matters because findings arrive in very different grades: something you watched five users fail at is not the same as something one stakeholder remembers a customer saying. Marking the difference stops a hunch from quietly hardening into a constraint.

The most useful destination for a strong finding is the review criteria. If the research says users could not tell which plan fits a small team, then a participant in a corridor test can name the right plan within thirty seconds becomes a pass/fail question for the prototype. That closes the loop: the evidence that motivated the work is also the standard the work is judged against. Keep the brief's findings section short and point at the fuller notes in the briefing packet rather than pasting transcripts.

Narration for this slide

References tell you what could work. Research tells you what your users actually struggled with — and the brief needs both. Three habits keep research useful here. First, state findings as observations, not solutions: users could not find export, not add a bigger export button — the solution is what the prototype is for. Second, mark your confidence: something you watched five users fail at is not the same as something one stakeholder half-remembers. Third, turn the strongest findings into review criteria. If users could not tell which plan fits a small team, then the prototype passes when a test participant names the right plan in thirty seconds. The evidence that motivated the work becomes the standard it is judged against.

Slide 6 of 1316:9

From reference to brief

Four kinds of input, what gets extracted from each, and where the extraction lands in the brief.

Diagram with four rows showing how inputs become brief content. A reference screenshot yields density, spacing rhythm, and hierarchy, landing in the brief's design direction. A competitor flow yields step order, navigation pattern, and the states it handles, landing in the user job and states. Research findings yield what users struggled with, stated as observations with confidence levels, landing in the user job and review criteria. Product context yields components, tokens, constraints, and real content, landing in the constraints and content lines. A bottom strip lists what is never extracted: brand, copy, colour palette, logos, and proprietary illustrations — the clone trap.
Each input contributes a different kind of evidence, and each lands in a different line of the brief. The bottom strip is the firewall: brand, copy, palette, and proprietary content are never extracted.

Nothing flows from a reference into the brief without passing through an extraction you wrote down. That step is where the design judgment lives.

Slide notes

Walk the diagram row by row. Reference screenshots contribute density, rhythm, and hierarchy, and those land in the design direction line of the brief — phrased as match and ignore instructions, the way the annotation slide showed. Competitor flows contribute step order, navigation pattern, and state coverage, which land in the user job and the states list; the gaps in a competitor's flow are often as informative as its strengths, because a state the competitor skipped is a place your prototype can be better. Research findings contribute what users actually struggled with, stated as observations with confidence levels, and they land in the user job and the review criteria. Product context — your components, tokens, constraints, and real content — lands in the constraints and content lines, including the explicit ban on inventing facts.

The vertical reading matters as much as the horizontal one: nothing crosses from the left column to the right column directly. Every arrow passes through the middle — the extraction you wrote down. That middle column is the design work of this module, and it is the part teams skip when they paste screenshots straight into the prompt.

The bottom strip is the firewall and it is deliberately blunt. Brand, copy, palette, logos, and proprietary content do not get extracted, full stop. Putting it on the diagram makes it a shared rule rather than an individual judgment call.

Narration for this slide

Here is the whole module in one picture. Four kinds of input on the left: reference screenshots, competitor flows, research findings, and your own product context. From each one you extract something different — density and rhythm from screenshots, step order and state coverage from flows, what users struggled with from research, and components, constraints, and real content from your product. And each extraction lands in a specific line of the brief: design direction, user job and states, review criteria, constraints and content. Notice that nothing crosses this diagram without passing through the middle column — the extraction you wrote down. That is the design judgment. And the strip at the bottom is the firewall: brand, copy, palette, and proprietary content never make the trip.

Slide 7 of 1316:9

Writing the brief: the same canvas, fed by references

The briefing canvas does not change for reference-driven work. What changes is that most lines now arrive pre-filled by the extraction.

  • Situation and user job — sharpened by the research findings and the competitor flow
  • Design direction — the match / ignore / never annotations, one block per reference
  • Constraints and content — your components, tokens, and a content packet with a ban on inventing more
  • Output shape — a prototype in a scratch folder, at the fidelity you scoped in Module 1
  • Review criteria — executable checks plus the pass/fail questions drawn from the research

End the brief the same way every time: plan first, restate the job, name the gaps, do not build until the plan is approved.

Slide notes

This slide connects the module to the briefing canvas the school teaches everywhere else: situation, user job, audience, direction, constraints, output shape, review criteria, with a plan-first instruction at the end. Nothing about that structure changes when the work is reference-driven. What changes is where the content of each line comes from — and after the extraction work of the previous slides, most lines are already written. The findings sharpen the user job. The annotations are the design direction. The product context fills the constraints, including the real content the agent must use and the explicit instruction not to invent more — the prompting article's pricing-page run is the cautionary tale here, where a structurally excellent page shipped with confidently fabricated prices because no real content was supplied.

Remind people what does not belong in the brief: anything the harness already enforces. Tokens, the component inventory, banned patterns, and verification commands live in DESIGN.md, AGENTS.md or CLAUDE.md, and skills; the brief points at them rather than restating them. Reference-driven briefs are particularly prone to bloat because the screenshots invite description — keep the annotations tight and let the images carry the detail.

The closing line is non-negotiable for prototype work of any real size: plan first, in a plan mode where the platform supports one, so the extraction can be checked against the plan before anything is built. The plan is where you find out whether the agent read match the density as clone the layout.

Narration for this slide

Now the brief itself — and the good news is that it is the same canvas you already know: situation, user job, direction, constraints, output shape, review criteria. What changes is that the extraction work has pre-filled most of it. The research findings sharpen the user job. The match, ignore, never annotations are the design direction. Your product context fills the constraints — including the real content, with a ban on inventing more, because a beautifully structured prototype full of fabricated facts is worse than useless in a stakeholder review. Keep harness material out of the brief; point at it instead. And end the same way every time: plan first, restate the job, name the gaps, do not build until I approve. The plan is where you catch the agent reading match the density as clone the layout.

Slide 8 of 1316:9

The clone trap

Three separate reasons not to clone, and any one of them is sufficient on its own.

  • Legal and ethical — distinctive layouts, icons, illustrations, and copy are someone's property; close imitation invites trade-dress and brand-confusion problems no prototype is worth
  • Taste — a clone inherits the reference's compromises and caps your ceiling at someone else's decisions
  • Product fit — the reference was designed for different users, content, and constraints; the closer the copy, the worse it serves yours
  • Process — cloning skips the extraction step, so the team learns nothing it can reuse
  • If a stakeholder asks for "exactly like theirs", the answer is an annotated reference, not a refusal

The fix for the clone trap is not avoiding references. It is the one paragraph of annotation that turns a screenshot into named, borrowable behaviour.

Slide notes

Take the three main reasons in order, and keep the legal one conservative: this course is not legal advice, and the point is not that every resemblance is actionable. The point is that distinctive visual identity — layouts that function as brand, icon sets, illustration styles, copy — is the kind of thing trade-dress and unfair-competition arguments are made of, and a prototype that drifts into that territory creates a conversation with legal that arrives at the worst possible time, after stakeholders have seen and liked the work. Agents make this easier to stumble into, not harder, because they are very good at faithful imitation when that is what the prompt implies.

The taste argument is the one designers feel most directly: a clone inherits every compromise the reference made for reasons you do not know, and it caps your quality at the level of someone else's decisions. The product-fit argument is the practical one — the reference was built for different users, different content lengths, different constraints, and the closer you copy it the more of that mismatch you import. The process point rounds it out: cloning skips the articulation step, so nothing is learned and the next brief starts from zero again.

The last bullet is the one people will actually use. Stakeholders ask for exactly like theirs constantly, and a flat refusal reads as preciousness. The productive answer is to do the extraction with them: what specifically about theirs do you want — the density, the flow, the way it explains the plans? That conversation produces the annotation, and the annotation produces a better prototype than the clone would have been.

Narration for this slide

Why not just clone the reference, if it works? Three reasons, and any one of them is enough. Legal and ethical: distinctive layouts, icons, illustrations, and copy are someone's property, and close imitation buys you a trade-dress conversation no prototype is worth — agents make this easier to stumble into because they imitate faithfully when that is what the prompt implies. Taste: a clone inherits the reference's compromises and caps your ceiling at someone else's decisions. And product fit: that screen was designed for their users and their content, not yours. When a stakeholder asks for exactly like theirs, do not refuse — extract. Ask what specifically they want from it. That conversation produces the annotation, and the annotation produces something better than the clone.

Slide 9 of 1316:9

Multiple references with conflicting lessons

Real briefs cite more than one reference, and the references rarely agree. The brief has to resolve the conflict, not forward it.

  • Assign each reference a role: density from A, flow from B, the empty state from C
  • When two references conflict on the same trait, decide — do not ask the agent to average them
  • Use your research to break ties: the users' struggle outranks either reference
  • Name one reference as the anti-pattern when that is its real job
  • Two or three annotated references beat six unannotated ones

An unresolved conflict between references becomes the agent's decision by default — and the agent resolves it with the most generic answer available.

Slide notes

The single-reference case is the easy one. In practice a prototype brief draws on several: a competitor's flow, a different product's table density, an internal screen whose empty state the team likes. The failure mode is handing all of them to the agent with equal weight and no roles. The agent cannot tell which trait you wanted from which source, so it averages, and averaged references converge on the generic centre — which is the gravitational pull Module 3 deals with at the level of whole design directions.

The technique is role assignment plus explicit tie-breaking. Each reference gets one job, written in its annotation. When two references genuinely conflict on the same trait — one argues for a dense comparison table, the other for one-plan-at-a-time progressive disclosure — that is a design decision, and it belongs to you, made before the brief is sent. Research is the natural tie-breaker: if your findings say users were overwhelmed by the existing table, the disclosure pattern wins regardless of how much you admire the dense reference. If you genuinely cannot decide, that is a signal the question deserves the multi-direction treatment in the next module rather than a coin flip inside one brief.

The count matters too. Two or three references, each annotated and each with a role, give the agent more usable signal than six raw screenshots. Past three or four, you are usually decorating the brief rather than informing it.

Narration for this slide

Real briefs almost always cite more than one reference, and the references rarely agree. One argues for a dense comparison table; another argues for showing one plan at a time. The mistake is forwarding that conflict to the agent — give it both screenshots with equal weight and it will average them, and the average of two good references is usually generic. So assign each reference a role: density from this one, flow from that one, the empty state from the third. When two references conflict on the same trait, decide before the brief goes out, and use your research to break the tie — what your users struggled with outranks either screenshot. And keep the count low. Two or three annotated references beat six raw ones every time.

Slide 10 of 1316:9

Worked example: a competitor flow becomes a brief and a prototype

A plan-selection flow for a design workflow product, briefed from a competitor walkthrough, two screenshots, and three research findings.

StageWhat was producedWhat it changed downstream
Walk the competitor flowFive-step trace: entry, compare, recommend, confirm, upgrade — with two states it never handledThe two missing states became part of the user job, not an afterthought
Annotate the referencesMatch: card-plus-table comparison, limits on the cards. Ignore: brand, copy. Never: the countdown timerThe plan reused the school's own components; no competitor styling appeared
Fold in the researchThree findings — trial users could not name the right plan for a small team (observed, high confidence)Became the headline review criterion for the corridor test
Write the brief, plan firstOne-page brief, two annotated screenshots, content packet with real plans and limitsThe plan flagged that mobile comparison was unspecified — fixed before any code
Build and checkPrototype in a scratch folder, two iterations, token audit and type check cleanFirst corridor test ran the day after the brief was written

The competitor flow contributed structure and gaps. The research contributed the pass/fail question. Neither contributed a single pixel of the surface.

Slide notes

This worked example composes runs the school has published rather than inventing a new benchmark, so present it as an illustrative trace, not a controlled study. The shape of the task mirrors the pricing-page case study from the prompting article and the briefing canvas from the briefing article: a plan-selection surface for a design workflow product, where the real problem was that trial users could not tell which plan fitted them.

Walk the rows as a sequence. The competitor walkthrough is done by hand, screen by screen, and produces two kinds of value: the step structure worth borrowing and the states the competitor never handled — in this trace, what happens when a team is mid-trial and what the downgrade path looks like. The annotation row shows the match, ignore, never lists doing their job: the comparison pattern transferred, the brand did not, and the dark-pattern countdown was explicitly banned. The research row shows a finding with stated confidence becoming the headline review criterion. The brief row shows the plan-first instruction earning its keep — the plan surfaced that mobile comparison behaviour was unspecified, the same residual gap the published pricing run hit, and fixing it cost a sentence instead of a revision round. The build row is deliberately boring: two iterations, clean checks, and a corridor test the next day, which is the speed the prototype-first module promised when the inputs are prepared properly.

The highlight line is the summary of the whole module: structure and gaps from the flow, the pass/fail question from the research, and not one pixel of the competitor's surface in the output.

Narration for this slide

Let's trace one end to end. The task: a plan-selection flow for a design workflow product, where trial users could not tell which plan fitted a small team. First, walk the competitor's flow by hand — five steps, and two states it never handles, which immediately become part of our user job. Second, annotate the references: match the card-plus-table comparison, ignore the brand, never use the countdown timer. Third, fold in the research — the observed finding becomes the headline review criterion: can a test participant name the right plan in thirty seconds. Fourth, write the brief and ask for a plan first; the plan catches that mobile comparison was unspecified, before any code exists. Then build: two iterations, clean checks, corridor test the next day. The competitor contributed structure and gaps. The research contributed the question. Neither contributed a pixel.

Slide 11 of 1316:9

Where reference-driven briefs still go wrong

The technique has limits, and most of them show up after the brief looks finished.

  • The annotation names a trait the agent cannot see in the screenshot — pair the claim with a crop or a measurement
  • Reference admiration overrides research — the evidence about your users wins
  • The brief borrows structure but forgets content, and the agent invents plausible facts
  • Whatever no input covered, the agent resolves with a default — mobile behaviour is the usual casualty
  • Annotation is not a review: the plan and the rendered prototype still need checking against the brief

A reference-driven brief is only as good as the decisions it records. The inputs inform those decisions; they do not make them.

Slide notes

Close the teaching section with the honest limits, because the failure modes here are subtle — they produce briefs that look thorough and still miss. The first is annotation that asserts what the evidence does not show: writing match its generous spacing when the screenshot is too small or too cropped for the agent to read the spacing at all. If the trait matters, give the agent a usable crop or state the measurement directly.

The second is the seductive reference: a beautiful screen from a product whose users, content, and constraints have nothing in common with yours. Admiration is not evidence. When the reference and the research point in different directions, the research wins, and the published pricing-page run is the standing reminder of the third failure — structure without content. The specific prompt produced an excellent layout with confidently invented prices; only the real content packet fixed it. Reference-driven briefs are especially prone to this because the references supply so much structural confidence that the missing facts go unnoticed.

The fourth limit is the residual-gap rule from the prompting article: whatever no input covered, the agent resolves with the most common default, and for reference-driven work that is almost always responsive behaviour, because screenshots are taken at one width. The fifth is the reminder that nothing in this module replaces the gates: the plan still gets reviewed against the brief, and the rendered prototype still gets looked at — at mobile width, with real content lengths — before anyone calls it done.

Narration for this slide

Before the exercise, the honest limits. Annotations can claim things the agent cannot actually see — if the spacing matters, give it a crop or a number, not just an adjective. Admiration can override evidence — when a beautiful reference and your own research disagree, the research wins, because it is about your users. Structure can arrive without content — references give you so much layout confidence that nobody notices the prices are invented, until a stakeholder does. Whatever no input covered, the agent fills with a default, and the usual casualty is mobile behaviour, because screenshots come at one width. And none of this replaces the gates: the plan still gets reviewed against the brief, and the rendered prototype still gets looked at properly. The inputs inform your decisions. They do not make them.

Slide 12 of 1316:9

Exercise: annotate one reference for a current project

Take one reference you are already carrying around — a screenshot, a competitor flow, a screen someone sent you — and turn it into briefing input.

  • Name the prototype it would feed and the user job it relates to
  • Write the three lists: match this, ignore that, never do this — specific enough that a colleague could check the output against them
  • Add one research finding, stated as an observation with a confidence level, and turn it into a pass/fail review question
  • Note the content the agent must use and must not invent
  • Mark one trait you want that the screenshot alone cannot show — and how you would supply it

Keep the annotation. In Module 3 you will write three direction briefs for the same task, and this reference will feed at least one of them.

Slide notes

The exercise is paper-only and should take ten to fifteen minutes. The constraint that matters is using a real reference the participant already has — the screenshot saved on their desktop, the competitor link in the team channel — because the difficulty of the exercise is articulation, and articulation is only hard when the material is real.

The second bullet carries the standard: the lists are specific enough when a colleague could look at a generated prototype and say whether it matched, ignored, and avoided what the annotation said. Match its clean layout fails that test; match the two-column settings pattern with the section nav on the left passes it. The research bullet deliberately reuses the observation-plus-confidence format from earlier in the module, and the review-question conversion is the part most participants find newly useful.

The last bullet pre-empts the most common annotation failure — claiming traits the screenshot cannot communicate — and asks for the fix: a crop, a measurement, a second reference, or a sentence of description. If running this live, have two or three people read their never list aloud; it is the list people skip when working alone and the one that generates the best discussion about taste and product fit. The page carries forward: Module 3's multi-direction exercise asks for three direction briefs on one task, and a well-annotated reference is ready-made input for at least one of them.

Narration for this slide

Your turn. Take one reference you already have — the screenshot on your desktop, the competitor flow your product manager keeps mentioning. Name the prototype it would feed and the user job it relates to. Then write the three lists: match this, ignore that, never do this — specific enough that a colleague could check an output against them. Add one research finding, stated as an observation with a confidence level, and turn it into a pass/fail question. Note the real content the agent must use and must not invent. And mark one trait you want that the screenshot cannot show on its own, plus how you would supply it. Keep the page — Module 3 will use it.

Slide 13 of 1316:9

Summary, and what comes next

  • References carry behaviour worth borrowing — density, rhythm, hierarchy, flows, states — not surfaces worth cloning
  • Every reference gets an annotation: match this, ignore that, never do this, with one job per reference
  • Research findings join the brief as observations with confidence levels, and the strongest become review criteria
  • The brief stays the same canvas — the extraction pre-fills it, real content stops invention, and the plan-first gate stays
  • The clone trap has legal, taste, and product-fit costs; the fix is extraction, not avoiding references

Module 3 takes the brief you can now write and multiplies it: genuinely different directions explored in parallel, without collapsing into five versions of the same idea.

Slide notes

Recap by walking the pipeline rather than the bullet list: inputs, extraction, brief, gates. The inputs are screenshots, competitor flows, research, and product context. The extraction is the written step in the middle — the annotations and the findings — and it is where the design judgment of this module lives. The brief is the same canvas the school teaches everywhere, pre-filled by the extraction and closed with the plan-first instruction. The gates do not change: the plan gets reviewed against the brief, the prototype gets checked against the criteria, and the criteria themselves now trace back to evidence about real users.

Preview Module 3 concretely. A single good brief produces a single direction, and the gravitational pull of the common pattern means even good briefs drift toward the generic centre. The next module is about deliberately writing several briefs for the same task — different information densities, navigation models, and tones — running them in parallel, and converging through a comparison board with named criteria rather than whichever version looked best in the standup. The annotated reference from this module's exercise and the research findings format both carry straight into that work.

If participants kept their Module 1 fidelity-scoping page, point out that the two pages now pair: one says how real the prototype needs to be, the other says what evidence is shaping it.

Narration for this slide

Let's close. References carry behaviour — density, rhythm, hierarchy, flows, states — and that behaviour is worth borrowing; the surface is not. Every reference gets its annotation: match this, ignore that, never do this, one job per screenshot. Research joins the brief as observations with confidence levels, and the strongest findings become the questions the prototype is judged by. The brief itself is the same canvas you already use — the extraction just pre-fills it, real content stops the agent inventing facts, and the plan-first gate stays exactly where it was. Next module, we multiply this. One brief gives you one direction; Module 3 is about exploring genuinely different directions in parallel, and converging on evidence rather than on whichever one looked best in the standup. See you there.

Module transcript
Module 2, narrated slide by slide

Slide 1From Reference to Brief

Welcome back. Module 1 was about scoping the prototype; this module is about feeding it. Almost no prototype starts from nothing — you start with references. A screenshot you admire, a competitor flow that works, research notes about what users struggled with, and your own product's constraints. The question is how all of that becomes a brief an agent can execute. The lazy answer is to paste the screenshot in and say make something like this — and we will spend some time on why that fails. The better answer is to extract behaviour from each input, annotate it, and put it where it belongs in the brief. That is what this module teaches.

Slide 2What references carry: behaviour, not pixels

Start with what a reference is actually good for. A reference is evidence that some design behaviour works — and behaviour is the part you can borrow honestly. Density: how much information the screen carries before it feels crowded. Rhythm: the spacing and grouping that make that density readable. Hierarchy: what you read first, second, third. Flow patterns: the order of steps and how people move between them. And states — how a mature product handles empty, error, and loading, which is years of someone else's support tickets distilled into screens. What a reference also carries is brand, copy, and palette. That part does not transfer. And if you do not name the trait you want, the agent will default to imitating the most visible thing — which is usually the palette.

Slide 3Cloning a surface vs borrowing behaviour

Here is the same screenshot used two different ways. On the left, the clone: make something like this. The agent imitates what it can see, which is the surface — colours, type, copy register. The result fights your design system, and it carries a risk nobody asked for, because looking too much like a competitor is a problem that surfaces late and expensively. On the right, the borrow: match its table density and row rhythm, ignore its brand, copy, and colour. One extra sentence. The output comes back in your components, with the reference's discipline. And you have learned something — you have named why the reference works, and that named trait is reusable. Cloning outsources the decision. Borrowing keeps it yours.

Slide 4Annotating a reference: match, ignore, never

Here is the technique that does most of the work in this module. For every reference, write three short lists. Match this: the traits you actually want — the row density, the way limits sit on the cards, the recommended-plan pattern. Ignore that: colours, type, brand, copy — anything owned by someone else. And never do this: the parts of the reference that are wrong for your product, like that countdown timer or the hidden cheapest plan. References are not endorsements; you can learn from a screen and still refuse half of it. Give each reference one job, write the three lists into the brief next to the file path, and the screenshot stops being decoration and becomes evidence.

Slide 5Research findings as briefing input

References tell you what could work. Research tells you what your users actually struggled with — and the brief needs both. Three habits keep research useful here. First, state findings as observations, not solutions: users could not find export, not add a bigger export button — the solution is what the prototype is for. Second, mark your confidence: something you watched five users fail at is not the same as something one stakeholder half-remembers. Third, turn the strongest findings into review criteria. If users could not tell which plan fits a small team, then the prototype passes when a test participant names the right plan in thirty seconds. The evidence that motivated the work becomes the standard it is judged against.

Slide 6From reference to brief

Here is the whole module in one picture. Four kinds of input on the left: reference screenshots, competitor flows, research findings, and your own product context. From each one you extract something different — density and rhythm from screenshots, step order and state coverage from flows, what users struggled with from research, and components, constraints, and real content from your product. And each extraction lands in a specific line of the brief: design direction, user job and states, review criteria, constraints and content. Notice that nothing crosses this diagram without passing through the middle column — the extraction you wrote down. That is the design judgment. And the strip at the bottom is the firewall: brand, copy, palette, and proprietary content never make the trip.

Slide 7Writing the brief: the same canvas, fed by references

Now the brief itself — and the good news is that it is the same canvas you already know: situation, user job, direction, constraints, output shape, review criteria. What changes is that the extraction work has pre-filled most of it. The research findings sharpen the user job. The match, ignore, never annotations are the design direction. Your product context fills the constraints — including the real content, with a ban on inventing more, because a beautifully structured prototype full of fabricated facts is worse than useless in a stakeholder review. Keep harness material out of the brief; point at it instead. And end the same way every time: plan first, restate the job, name the gaps, do not build until I approve. The plan is where you catch the agent reading match the density as clone the layout.

Slide 8The clone trap

Why not just clone the reference, if it works? Three reasons, and any one of them is enough. Legal and ethical: distinctive layouts, icons, illustrations, and copy are someone's property, and close imitation buys you a trade-dress conversation no prototype is worth — agents make this easier to stumble into because they imitate faithfully when that is what the prompt implies. Taste: a clone inherits the reference's compromises and caps your ceiling at someone else's decisions. And product fit: that screen was designed for their users and their content, not yours. When a stakeholder asks for exactly like theirs, do not refuse — extract. Ask what specifically they want from it. That conversation produces the annotation, and the annotation produces something better than the clone.

Slide 9Multiple references with conflicting lessons

Real briefs almost always cite more than one reference, and the references rarely agree. One argues for a dense comparison table; another argues for showing one plan at a time. The mistake is forwarding that conflict to the agent — give it both screenshots with equal weight and it will average them, and the average of two good references is usually generic. So assign each reference a role: density from this one, flow from that one, the empty state from the third. When two references conflict on the same trait, decide before the brief goes out, and use your research to break the tie — what your users struggled with outranks either screenshot. And keep the count low. Two or three annotated references beat six raw ones every time.

Slide 10Worked example: a competitor flow becomes a brief and a prototype

Let's trace one end to end. The task: a plan-selection flow for a design workflow product, where trial users could not tell which plan fitted a small team. First, walk the competitor's flow by hand — five steps, and two states it never handles, which immediately become part of our user job. Second, annotate the references: match the card-plus-table comparison, ignore the brand, never use the countdown timer. Third, fold in the research — the observed finding becomes the headline review criterion: can a test participant name the right plan in thirty seconds. Fourth, write the brief and ask for a plan first; the plan catches that mobile comparison was unspecified, before any code exists. Then build: two iterations, clean checks, corridor test the next day. The competitor contributed structure and gaps. The research contributed the question. Neither contributed a pixel.

Slide 11Where reference-driven briefs still go wrong

Before the exercise, the honest limits. Annotations can claim things the agent cannot actually see — if the spacing matters, give it a crop or a number, not just an adjective. Admiration can override evidence — when a beautiful reference and your own research disagree, the research wins, because it is about your users. Structure can arrive without content — references give you so much layout confidence that nobody notices the prices are invented, until a stakeholder does. Whatever no input covered, the agent fills with a default, and the usual casualty is mobile behaviour, because screenshots come at one width. And none of this replaces the gates: the plan still gets reviewed against the brief, and the rendered prototype still gets looked at properly. The inputs inform your decisions. They do not make them.

Slide 12Exercise: annotate one reference for a current project

Your turn. Take one reference you already have — the screenshot on your desktop, the competitor flow your product manager keeps mentioning. Name the prototype it would feed and the user job it relates to. Then write the three lists: match this, ignore that, never do this — specific enough that a colleague could check an output against them. Add one research finding, stated as an observation with a confidence level, and turn it into a pass/fail question. Note the real content the agent must use and must not invent. And mark one trait you want that the screenshot cannot show on its own, plus how you would supply it. Keep the page — Module 3 will use it.

Slide 13Summary, and what comes next

Let's close. References carry behaviour — density, rhythm, hierarchy, flows, states — and that behaviour is worth borrowing; the surface is not. Every reference gets its annotation: match this, ignore that, never do this, one job per screenshot. Research joins the brief as observations with confidence levels, and the strongest findings become the questions the prototype is judged by. The brief itself is the same canvas you already use — the extraction just pre-fills it, real content stops the agent inventing facts, and the plan-first gate stays exactly where it was. Next module, we multiply this. One brief gives you one direction; Module 3 is about exploring genuinely different directions in parallel, and converging on evidence rather than on whichever one looked best in the standup. See you there.