Slide 1 — Orchestration Patterns
Welcome back. In Module 1 you decided whether a task genuinely needs more than one agent. This module is about what happens after you say yes. We are going to cover the three patterns that handle most real cases — fan-out for parallel independent work, pipelines for staged work with handoffs, and critic pairs where one agent produces and another reviews against written criteria. We will define the orchestrator role properly, write worker prompts and merge rules before anything runs, look hard at the failure modes, and trace a real four-worker run end to end, with its costs. Let's start with fan-out.
Slide 2 — Fan-out: parallel zones, one merge point
The first pattern is fan-out. You split the work into zones that genuinely do not depend on each other, give each zone to one worker with one owned output, and bring everything back through a single merge review. The clearest production example is a design-system audit: one agent per component folder, sixty-odd agents in parallel, findings collected into one drift report in about an hour. Fan-out is the right shape when the bottleneck is reading volume rather than judgment. The price is the seams — the places where two zones meet, where spacing drifts and the same signal gets expressed two different ways. The merge review exists to catch exactly that, which is why you write its rules before anyone starts.
Slide 3 — Pipelines: staged work with gates between stages
The second pattern is the pipeline. Instead of splitting by place, you split by stage: a spec stage, a build stage, a verify stage, each handing its output to the next. Pipelines barely parallelise — stages wait on each other — but they merge almost for free, because there is nothing to reconcile. What you are really buying is checkpoints: every handoff is a gate with an owner. The production evidence is telling. A thirteen-role book pipeline we trace in the articles had its review loop protect the drafting stages from day one — and nearly every fix landed at the export boundary instead. Gates belong at the handoffs. The failure mode is propagation: a weak early stage feeds everything after it, and a gate that rubber-stamps is not a gate.
Slide 4 — Critic pairs: a producer and an independent reviewer
The third pattern is the critic pair. One agent produces; a second agent reviews — and the reviewer is read-only. It returns findings, each tied to a criterion you wrote down before the first draft existed, with severity and evidence, and it is forbidden from fixing anything itself. The producer revises, and the loop runs until the criteria pass or you hit the iteration cap. In the book pipeline this is literally configuration: a pass score of seven, a maximum of three revision rounds. In the five-surface run we trace later, the read-only QA worker caught both conflicts and the gaps nobody owned. The rule that keeps the pattern honest: the critic never grabs the pen, and its pass is evidence for your decision — not the decision itself.
Slide 5 — The three patterns, side by side
Here are the three patterns side by side. On the left, fan-out: one orchestrator brief, parallel workers each owning a zone, and a single merge review where the real decisions happen. In the middle, the pipeline: spec feeds build feeds verify, with a gate at every handoff — slow on purpose, because the checkpoints are the value. On the right, the critic pair: a producer and a read-only critic looping against written criteria, with an iteration cap, ending at a human gate. Notice each panel carries its own failure mode — seams and lazy merges, propagation and rubber-stamp gates, vague criteria and critics that start editing. And notice you can mix them. The worked example later is fan-out plus a critic layered on top.
Slide 6 — Choosing the pattern: what each one buys and costs
Here is the choice in one table. Read the first row and most decisions make themselves: if the boundaries of your task are places — surfaces, components, regions — fan out. If they are stages, where one thing must exist before the next can start, build a pipeline. If there is one artifact and the question is whether it meets the bar, run a critic pair. The row people underestimate is merge cost. Fan-out's merge is a genuine design review and it takes real time. The pipeline barely merges but barely parallelises. The critic pair never merges but pays in rounds, which is why you cap them. And if your task does not fit any row cleanly, that is the signal to go back to Module 1 and not split at all.
Slide 7 — The orchestrator role: a contract, not a contributor
Every pattern needs an orchestrator, and the role is easy to get wrong. The orchestrator — whether that is you or a lead agent — owns four things. The brief, which states the user job once and gives each worker a one-sentence boundary. The named overlaps, so the merge expects its conflicts instead of being surprised by them. The merge log, where decisions and rejected options get written down. And the stop conditions — the gates, the iteration caps, and the point where you fall back to a single agent. What the orchestrator does not do is design content. The documented failure mode, in the product docs and in our own traced run, is the lead starting to do a worker's job itself. The fix is structural: if the orchestrator is producing anything beyond the contract and the log, your decomposition is being bypassed.
Slide 8 — Worker prompts: one job, one boundary, one reviewable output
Here is a real worker brief from the traced run, and notice how much of it is about what not to do. Scope with explicit exclusions. The exact inputs the worker may read. One task, one owned output file, and acceptance criteria the orchestrator can check in a couple of minutes. What is missing is design direction — on purpose. Taste lives in the design system and in one shared constraints file that every brief points at; the brief carries the boundary and the deliverable. That keeps briefs cheap, and cheap briefs are the ones that actually get written. One honesty note: constraints reduce violations, they do not eliminate them. One worker in this run invented an off-token tint anyway. The acceptance check at the gate is what caught it.
Slide 9 — Merge rules and conflict handling, decided up front
Now the merge, and the rule that matters most: write the merge protocol before any worker runs, because gates invented after the outputs arrive bend around whatever came back. The protocol has an order. Audit ownership first — who stayed inside their boundary. Classify the conflicts: token semantics, seams, duplicated signals, unowned gaps. Resolve against the design system and the user job, never against whichever worker wrote the most or answered last. Record what you rejected and why. And log the gaps nobody owned as open questions instead of inventing answers on the spot. The cheapest way to fail here is merge by concatenation — accepting everything because nothing is individually wrong. That is how multi-agent work fails while looking like it succeeded.
Slide 10 — Failure modes: how orchestrated runs actually go wrong
Here is how these runs actually fail, and notice none of it is exotic. Two workers racing on the same file — prevented by one owned output each, and worktrees once specs become edits. Divergent assumptions — two workers answering the same question differently because nobody named the overlap. Silent scope growth — a worker wandering into someone else's surface. The lead doing the work itself instead of delegating, which is documented in the platform docs and happened in our own run. Merge by concatenation. And teams left running after the work is done, quietly burning tokens. Every one of these has a written counterpart you have already seen in this module: boundaries, named overlaps, exclusions, a contract-only orchestrator, a merge protocol, and a shutdown step.
Slide 11 — Worked example: a four-worker fan-out, traced end to end
Let's trace a real run. Five surfaces on this school's own site: four workers each owning one surface, a read-only QA critic across all of them, and an orchestrator that wrote the contract, the constraints, and the briefs, then ran the merge. The same brief was also run once with a single agent as a baseline. The baseline took about twenty-eight minutes and produced a usable spec. The orchestrated run took about seventy-seven minutes as executed — roughly forty-six if the workers had genuinely run in parallel, and that is an estimate — at three and a half to four times the tokens. So what did the tax buy? Two conflicts surfaced and resolved on the record before implementation, an independent QA pass, two gaps nobody owned logged for a human, and reusable briefs. Not speed. The honest conclusion: orchestration pays in decision quality and records first, and in speed only when the work truly runs in parallel.
Slide 12 — Exercise: choose and sketch a pattern for one large task
Your turn. Take one large design task — something that genuinely passed the Module 1 decision map — and design the run on paper. Name the pattern and say why: what kind of boundary does the task actually have — places, stages, or criteria? Write the orchestration brief with a one-sentence boundary per worker and the overlaps named. Draft one worker brief in full, exclusions included. Write the merge rules before you are tempted to skip them. And set the stop conditions, including the point where you would fall back to a single agent. Do not run it yet. Keep the pages — in Module 3, this exact sketch gets real tools wired into it.
Slide 13 — Summary, and the bridge to MCP chaining
Let's close. Three patterns cover most of what you will need: fan-out when the boundaries are places, pipelines when they are stages, critic pairs when the point is independent review — and mixing them is normal. The orchestrator owns the contract, the named overlaps, the merge log, and the stop conditions, and nothing else. Worker briefs are mostly exclusions, merge rules are written before anyone runs, and the traced run was honest about the bill: three and a half to four times the tokens, paid back in decision records and conflicts caught early rather than in speed. In Module 3 we put real tools inside these patterns — canvas, browser, tickets, and code connected over MCP — and the boundaries you sketched in this exercise become tool scopes. See you there.