Slide 1 — When One Agent Is Not Enough
Welcome to Orchestrating Design Agent Teams. Before we talk about patterns, pipelines, or shared canvases, we need to settle the question the whole course hinges on: when is one agent genuinely not enough? The honest answer is — less often than you would think. Multi-agent setups carry a real coordination tax, and most design tasks do not repay it. So this first module is a decision module. We will name the costs people underestimate, identify the signals that a task actually splits, and walk the single-agent alternatives you should exhaust first. If you finish this module deciding to keep most of your work with one agent, that is the module working.
Slide 2 — The coordination tax
Let's start with the costs, because they are the part people skip. When you split work across agents, you do not just multiply the output — you multiply the work around the output. You write briefs and constraints before anything starts. Every worker re-reads the design system and the project from scratch, which is where the token bill comes from. Workers that cannot see each other's decisions make conflicting assumptions — that is not a risk, it is the default. And at the end there is a merge review you cannot skip, plus debugging that now spans separate histories. Here is the uncomfortable part: that bill arrives whether or not the split helped. So before you pay it, you check the price list.
Slide 3 — Signals a task genuinely splits
So when does a task genuinely split? Not when it sounds big — when it has real boundaries. Independent zones: pages, bands, dashboard panels, canvas regions that do not share files. Independent stages: research feeding a spec, a spec feeding implementation, implementation feeding QA. Distinct expertise: a builder, plus a read-only reviewer that checks the work without touching it. The test that settles most cases is simple: can you state each worker's ownership in one sentence, including what it must not touch? If you can, delegation is safe. If you cannot, the merge will eat whatever time the split saved. Boundary quality is the whole decision.
Slide 4 — Signals it does not split
Now the other side — the signals that a task should stay together, even when more agents feel tempting. Tightly coupled taste work is the big one: a polish pass where spacing, tone, and hierarchy are tuned against each other has no clean boundary to hand a worker, and a split version usually comes back less coherent than it started. One shared file or component means two writers on the same surface, and that merge becomes a rescue. If you cannot say who owns a decision, more agents just means more outputs nobody owns. Small tasks cannot repay the setup time. And stages that wait on each other pay the coordination cost without getting any parallel speed back.
Slide 5 — Single-agent alternatives to exhaust first
Before you split anything, exhaust the single-agent alternatives, because most of them fix the actual problem. If the output is inconsistent, that is harness debt — tighten the design rules and the anti-pattern list, and every future run improves. If the task feels too big, run it longer with a reviewed plan, or run it across sequenced sessions where each sitting picks up the last one's artifacts. If what you are missing is a second opinion, add a read-only critique pass against written criteria — that is a crit, not an orchestra. And remember the escape hatch: if the task later grows real boundaries, escalating is cheap, because everything you would need is just files.
Slide 6 — The decision map
Here is the whole module on one diagram. Three questions sit in the middle, and you ask them before any worker exists. Can you state each worker's ownership in one sentence? Do the slices avoid sharing files, components, or one taste loop? And is the task valuable enough to pay the coordination cost plus a merge review you cannot skip? Any no routes you left, to one well-harnessed agent — the default. All three yeses route you right, to a genuine split. Notice that both sides list costs. The single-agent path gives up parallelism and an independent reviewer. The multi-agent path pays in setup, tokens, and the merge. Most tasks fail at least one question, and that is the map doing its job.
Slide 7 — Honest cost accounting from real runs
Let's put numbers on the tax, from runs executed for this school's own articles. A two-worker spec run took about twenty-five minutes, nine of them writing briefs, and roughly doubled the text written — as a one-off, it did not pay for itself in speed. A five-surface orchestrated run took about seventy-seven minutes as executed, against a twenty-eight minute single-agent baseline on the same brief. So what did the extra cost buy? Not volume — the baseline covered everything. It bought two conflicts surfaced and resolved before implementation, an independent QA pass, and a written record of what was rejected and why. And the published figures point the same way: Anthropic reports multi-agent research systems using around fifteen times the tokens of a single chat. The tax is real. Pay it on purpose or not at all.
Slide 8 — What the platforms themselves recommend
Here is something worth noticing: the vendors selling multi-agent surfaces publish the same caution this module does. As of June 2026, Anthropic's own Agent Teams documentation says sequential and same-file work belongs in a single session or with subagents, recommends file-level ownership per teammate, and suggests three to five focused workers rather than a swarm. Codex, OpenCode, and OpenPencil all express the same operating model — bounded workers, owned outputs, and a merge. And the strongest published counter-position, from Cognition, argues that write-heavy coupled work does better with one agent holding the full context. The decision map you just saw is not contrarian. It is the consensus, written down for design work.
Slide 9 — Worked example: one task, evaluated both ways
Let's evaluate one real task both ways. The task: specify a field-notes section — a repeated card, plus the frame around it. Kept with one agent, it takes ten to fifteen minutes and comes back coherent, because one context decides everything, including how a recently reviewed note is marked. Split across two workers, it took about twenty-five minutes, twice the text, and an eight-minute merge — and the deliberate overlap produced a real conflict: a yellow badge from one worker, a green stripe from the other. The merge resolved it against the design system and wrote the reason down. Both evaluations are defensible. The split bought evidence and a decision record; the single run bought speed. The map tells you which trade this task deserves.
Slide 10 — If you do split: the artifacts that make it safe
If the map does route you to a split, the thing that makes it safe is a small set of plain files. An orchestration brief that states the user job once, gives each worker a one-sentence boundary, names the overlaps you expect, and writes the merge gates before any work exists. A shared constraints file every brief points at. One brief and one owned output file per worker. A merge log that records what was accepted, what conflicted, what was rejected and why, and what nobody owned. And a baseline, so you can tell afterwards whether the split bought anything. These files are portable across every platform — and if the split stops paying mid-run, they are exactly what you hand back to one agent to finish the job.
Slide 11 — Exercise: classify three of your current tasks
Time to apply the map to your own backlog. Pick three real tasks: one small, one medium, and one that feels too big for a single sitting. Run each through the three questions — boundaries in one sentence, no shared files or taste loop, worth the coordination cost. For the ones that fail, and most will, name the single-agent alternative you would try first: a better harness, a longer planned run, sequenced sessions, or a critique pass. For the one that passes, write the one-sentence ownership boundary for each worker, and name the overlap you expect to cause a conflict. Keep the page. In Module 2 we turn that passing task into a real decomposition.
Slide 12 — Summary, and the bridge to orchestration patterns
Let's close the module. More agents is a cost: setup, duplicated context, divergent assumptions, and a merge review you cannot skip — and the bill arrives whether or not the split helped. Tasks split when the boundaries are real: independent zones, stages, or expertise, each ownable in one sentence. They do not split for coupled taste, shared files, unclear ownership, or small tasks. Exhaust the single-agent alternatives first, and when you do split, expect to buy evidence and decision records rather than speed — at least at small scale. In Module 2 we assume the split passed the map, and cover the three patterns that carry most real cases: fan-out, pipelines, and critic pairs. See you there.