Slide 1 — The Designer–Agent Loop
Welcome back. In Module 1 we drew the loop: brief, plan, review gate, generate, critique, revise, ship. In this module we slow it down to working speed. Not the diagram — the actual session. What you type, what comes back, how long each step takes, where you have to pay attention and where you can walk away. We will look at each step as a concrete activity, see how plan modes enforce the gates across the four platforms, trace one real session end to end with timestamps, and then look honestly at the three places the loop most often breaks. By the end, the loop should feel like a rhythm you could start tomorrow.
Slide 2 — The agent as a junior designer
Before we walk the steps, fix the mental model. The agent is best treated as a junior design partner: fast, literal, eager, and with no taste memory. Fast means a wrong run costs minutes, not days. Literal means it does what your words say, not what you meant — it cannot infer the politics or the polish level. No taste memory means yesterday's correction is gone unless it lives in a file. And eager means it would rather hand you something plausible than ask a clarifying question. You would never give a junior a one-line request and then judge them on the output. The agent needs the same working agreement, and the loop is that agreement.
Slide 3 — Step 1 — Brief: what the step produces
Step one: the brief. In session terms it is a short written artifact, not a long chat message — usually under a page, often seven lines. It does two jobs. It gives the agent the facts it needs to work inside your project: the page, the user job, the audience, the components and tokens to reuse. And it gives the agent the standards to judge its own output: the direction, the constraints, and the review criteria you will hold it to. It does not restate your design system — that lives in the harness. And it ends with one instruction that changes everything downstream: plan first, restate the task, do not build until I approve.
Slide 4 — The plan gate: how the four platforms enforce it
The plan gate used to be a habit you had to remember. Now it is a feature. Claude Code has plan mode as a read-only permission mode. Codex CLI has plan mode plus the PLANS.md pattern, where the plan lives as a file beside the work. OpenCode gives you a separate read-only Plan agent. Gemini CLI's Plan Mode keeps the session read-only and asks for explicit approval. In every case, the platform — not your prompt — stops the agent from building until you approve. What you review is the agent's judgment: did it restate the right job, the right structure, the right components? One caution: an approved plan does not guarantee the artifact. That is what the second gate is for.
Slide 5 — Step 2 — Generate: scoping the run
Step two: generate. This is the part the agent does mostly on its own, so your work happens before it starts. Two decisions matter. First, scope: one run, one bounded outcome — a section, a component, an audit. If you cannot say in one sentence what the run should produce, split it. Second, where the output lands: a scratch branch or a scratch folder, never directly on main. That is what lets you critique honestly and throw away a bad run without ceremony. The brief already named the output shape and the components to reuse, so while the agent works, you do not need to watch every keystroke. You need to be ready to look hard at what comes back.
Slide 6 — Step 3 — Critique: criteria, checks, and judgment
Step three: critique, and it is not a vibe check. You compare the artifact against the criteria the brief wrote down before anything existed. Run the executable checks first — the token audit, the type check, the verify script — because they are free and they catch real problems. Then look at screenshot evidence: desktop, mobile, real content lengths. Then apply the judgment no check covers: hierarchy, tone, whether the thing serves the user job. Two rules keep this useful. Every finding gets a severity — blocker, important, polish, or question. And critique stays read-only: findings first, fixes only after you approve them. The agent inspects. You judge.
Slide 7 — Step 4 — Revise: what the agent can act on
Step four: revise. The quality of this step is set entirely by the quality of your feedback. Look at the two columns. On the left, feedback the agent can act on: move this above that, replace these hex values with tokens, use the prop that actually exists. On the right, feedback that hides an unmade decision: it doesn't feel trustworthy, maybe rethink the flow, is this on-brand? The agent cannot make those decisions — if you hand them over raw, it will guess. So translate: figure out why it does not feel trustworthy, then hand over the specific change. Keep the revision scoped to approved findings only. And when you notice the same fix recurring, that is a rule asking to live in the harness.
Slide 8 — Step 5 — Ship: the decision and the record
Step five: ship. The checks passing gets you to the decision; it is not the decision. You look at the artifact, the evidence, and the brief, and you choose: ship it, hold it, or send it back for one more bounded round. Then two minutes of recording. Write a short note beside the work — what shipped, what was cut, why. And promote the recurring feedback into the harness: the anti-pattern you rejected for the third time, the component the agent keeps forgetting. That is the dashed line on the loop diagram, and it is why next month's first drafts are better than this month's. The loop ends when the learning is written down, not when the code merges.
Slide 9 — Show early, iterate small
Here is the habit that keeps the loop healthy: show early, iterate small. Long unsupervised runs drift — a chain of small, individually reasonable judgment calls that adds up to something that is not what you meant. And the bottleneck is not the agent's speed, it is your review capacity. A forty-file change is not four times harder to review than a ten-file change; it is effectively unreviewable, and unreviewable work either gets rubber-stamped or thrown away. So size the run to what you can honestly look at. Ask for the riskiest part first, correct it, and let the rest follow the corrected pattern. The agent does not mind. Your taste stays in the work.
Slide 10 — Where the loop breaks
Three failures account for most bad sessions. First, vague briefs: the agent fills the gaps with generic patterns, and you spend the session re-supplying context one correction at a time. The fix is writing the user job and review criteria before you prompt. Second, skipped gates: fast, plausible output goes straight to stakeholders, and the problems surface where they are most expensive. The fix is structural — plan mode and a scratch branch enforce the gates even when you are busy. Third, feedback that never reaches the harness: the same correction, every session, because the agent has no memory of yesterday. The fix is promoting recurring fixes into rules at the ship step. The tell for all three is the same: you are repeating yourself.
Slide 11 — Worked trace: one loop session, end to end
Let's trace one real-shaped session, the field-notes task from Module 1, with timestamps. Ten minutes writing the brief — most of it deciding what the section is for. Four minutes reviewing the plan, where the agent flagged a contradiction and one sentence resolved it. Seven minutes of generation on a scratch folder while you do something else. Five minutes of screenshot critique, where the audit and the type check ran and the type check caught an invented prop the plan never could have. Three minutes of scoped revision, then the ship decision and a note for the harness. About thirty minutes total — and the agent worked for ten of them. The rest was design thinking, review, and a decision. That was always the job.
Slide 12 — Exercise: run your task through a plan-only session
Now you run the loop — but only the front half. Take the brief you sketched in Module 1, open your platform's plan mode, and paste it in. Ask for a plan, a restatement of the user job, and a list of assumptions, contradictions, and underspecified states. The agent stays read-only the whole time, so nothing can go wrong that costs more than a few minutes. Then judge the plan: what would you approve, what would you correct, what sends the brief back for revision? Write down one thing the plan got wrong because of your brief, and one thing it got wrong on its own. Keep both — Module 3 fixes the first kind, and the gates handle the second.
Slide 13 — Summary, and what comes next
Let's close. The loop is a rhythm, not a ceremony — a bounded session of twenty to forty minutes where the agent runs production and you hold the gates. Treat the agent as a junior partner: fast, literal, eager, and with no taste memory, which is why the brief and the harness matter so much. Let plan mode and scratch branches enforce the gates so they survive your busiest week. Critique against criteria you wrote before generation, revise only on approved findings, and write down what the loop learned before you call it shipped. The loop breaks at vague briefs, skipped gates, and feedback that never reaches the harness. Module 3 attacks the first of those head on: how to write a brief that carries real design intent. See you there.