AAgentic Design School
Module 1 of 6
35–45 minutes

Agentic Design Research

Research Packets

The unit of agentic research: a packet that holds the question, the sources, the method, the findings, and the evidence trail — built by an agent, auditable by a human, and reusable as briefing input for design work.

Duration35–45 minutes

Slides12 slides with notes and narration

Learning objectives

  • Structure a research packet so the question, method, and evidence are inspectable.
  • Decide which research activities agents do well and which corrupt without human contact.
  • Set source and citation standards before the first packet is built.
Slide deck

Work through the module

Each slide is shown in its 16:9 frame, exactly as it appears in the video version. Open the notes under any slide for the longer explanation, and the narration if you prefer to read along.

Slide 1 of 1216:9

Research Packets

Agentic Design Research · Module 1 of 6 — the packet is the deliverable

  • Why the unit of agentic research is a packet, not a slide deck
  • The anatomy: question, sources, method, findings, evidence, open questions
  • What agents do well in research, and what corrupts without human contact
  • Source and citation standards set before the first packet is built

Every later module in this course — teardowns, synthesis, funnels, journeys, briefs — produces its output in this packet shape.

Slide notes

Open by naming the change in deliverable, because it is the quiet premise of the whole course. When research is done by hand, the deliverable is usually a presentation: a curated narrative of what the researcher concluded, with the evidence compressed into a few pull quotes. When an agent does the gathering and structuring, it can produce something better — a packet where the question, the sources, the method, the findings, and the evidence trail all travel together — and the human's job shifts from assembling the deck to auditing and interpreting the packet.

Be explicit that the packet is not bureaucracy. The fields exist because they are exactly what a reviewer needs to decide whether the work can be trusted, and exactly what a later agent needs to reuse the work as briefing input. Clever bespoke formats fail because the next person, or the next agent, cannot reuse them.

This module deliberately precedes the method modules. Deep desk research, teardowns, synthesis, and funnel diagnosis all change shape depending on the topic — but they all land in the same packet structure. Get the container right once and the rest of the course slots into it.

Narration for this slide

Welcome to Agentic Design Research. This first module is about the container everything else in the course lands in: the research packet. When an agent does the gathering and the structuring, the deliverable stops being a slide deck of conclusions and becomes a packet — the question, the sources, the method, the findings, and the evidence trail, all in one place, built by the agent and auditable by you. We will walk through its anatomy, look honestly at what agents do well and badly in research, and set the source standards you need before the first packet gets built. Let's start with why the packet is the deliverable at all.

Slide 2 of 1216:9

Anatomy of a packet

Six parts, always the same. Boring and repeatable beats clever and bespoke.

  • Question — what decision this research would change, and what is out of scope
  • Sources — where each claim came from, dated, with what the source says vs what is inferred
  • Method — how evidence was gathered and themed, and what was excluded
  • Findings — ranked claims, with observation kept separate from inference
  • Evidence trail — quotes, links, files, screenshots; every claim points back to a source
  • Open questions and decision — gaps, risks, and the human call: approve, defer, reject

If a part is missing, the packet is not done — it is a pile of notes wearing a folder name.

Slide notes

Walk the six parts in order, because the order is the argument. The question comes first and is human-written: it names the decision the research would change, which is the single best defence against the agent's tendency to accumulate everything vaguely related. Sources and method are where agentic research earns trust — every claim is dated, attributed, and marked for whether the source actually says it or the packet infers it, and the method section records how the evidence was gathered and what was deliberately left out.

Findings are the part most people think of as the research. In a packet they are ranked claims, not a wall of notes, and observation is kept visibly separate from inference — a discipline that Module 2 spends a whole slide on. The evidence trail is what makes the findings auditable: a reviewer should be able to pick any claim and follow it back to a quote, a link, a file, or a screenshot.

The last part is the one agents cannot own. Open questions name what the packet does not know; risk notes flag screenshots, claims, and sensitive territory before anything goes public; and the decision — approve, defer, reject, needs more research — is recorded by a person. The packet ends with judgment, not with a summary paragraph.

Narration for this slide

Here is the anatomy. Six parts, and they are always the same six. The question — what decision this research would change. The sources — where every claim came from, dated, with the source's own words separated from what we infer. The method — how the evidence was gathered and what was excluded. The findings — ranked claims, with observation kept apart from inference. The evidence trail — quotes, links, and screenshots that every claim points back to. And finally, open questions and the decision — what the packet does not know, and the human call on what happens next. Boring and repeatable, on purpose. That is what makes it auditable and reusable.

Slide 3 of 1216:9

What each part does downstream

The packet is not filed and forgotten. Each part has a job after the research run ends.

Six cards show the parts of a research packet — question, sources, method, findings, evidence trail, and open questions with the decision — with arrows pointing down to what each part does after the run: the question scopes later runs and briefs, sources are re-checked when the packet is regenerated, the method lets a reviewer audit or re-run the work, findings become constraints in design briefs, evidence citations are spot-checked at review, and the decision is the gate before anything reaches a brief or a public page. The question and decision are marked human-led, sources, method and evidence agent-built, and findings the shared output.
The question and the decision are human-led; sources, method, and the evidence trail are agent-built; findings are drafted by the agent and judged by the human. Each part keeps working after the run: scoping, re-checking, auditing, briefing, spot-checking, gating.

A packet earns its keep downstream — in the briefs it feeds and the audits it survives — not in how complete it looks on the day it is built.

Slide notes

Use the diagram to make the packet feel like infrastructure rather than paperwork. Read it in two passes. First pass, ownership: the question and the decision carry the human-led colour, the gathering parts — sources, method, evidence — are agent-built, and findings sit in between because the agent drafts them and the human decides what they mean. That ownership split is the same human-gate pattern this school uses everywhere: the agent runs production, the human holds the ends.

Second pass, the downstream band. The question keeps later runs and briefs scoped to a decision instead of general curiosity. Sources get re-checked when the packet is regenerated — packets in fast-moving areas go stale, and regeneration is cheaper than arguing about memory. The method is what lets a sceptical reviewer challenge or re-run the work. Findings become the constraints and direction lines in later design briefs, which is the subject of slide seven. The evidence trail is what reviewers spot-check rather than re-read in full. And the decision is the gate: nothing moves from research into a brief or a public page before it.

If participants take one thing from the diagram, it should be that every field in the packet exists because something downstream consumes it. Fields nobody consumes can be cut.

Narration for this slide

This diagram shows why the structure matters. Top row, the six parts. Bottom row, what each part does after the run ends. The question keeps later runs scoped to a real decision. The sources get re-checked when the packet is regenerated, instead of trusting memory. The method is what lets a reviewer challenge or re-run the work. The findings feed straight into design briefs as constraints and direction. The evidence trail is what gets spot-checked at review. And the decision is the gate — nothing reaches a brief or a public page before it. Notice the colours: you own the question and the decision; the agent builds the middle.

Slide 4 of 1216:9

What agents do well: coverage, structure, recall, traceability

Agents are strong at exactly the parts of research that exhaust people.

  • Coverage — reading forty sources, twelve competitors, or a year of tickets without fatigue
  • Structure — putting every finding into the same fields, every time
  • Recall — surfacing the source, the date, and the exact quote months later
  • Traceability — keeping the link between claim and evidence intact at scale
  • Re-running — regenerating the packet when sources change, instead of patching from memory

The agent's advantage is not insight. It is the stamina to keep evidence organised at a scale where people start summarising from memory.

Slide notes

Be specific about where the leverage actually is, because the honest list is less glamorous than the marketing version. Coverage is the obvious one: an agent can read the documentation, changelogs, reviews, and pricing pages of a dozen products, or a year of support tickets, in the time a researcher spends on two. The quality of any single judgment is lower than a good researcher's; the floor across the whole corpus is far higher than what a tired person produces at item thirty of forty.

Structure and recall are the underrated pair. People write their best source notes on day one and their worst on day five; an agent fills the same fields with the same discipline on every item. And because everything is text in a repository, recall is mechanical — the exact quote, the URL, the date checked — rather than reconstructed from memory in front of a stakeholder.

Traceability and re-running are what make packets durable. Hand-built research loses the claim-to-evidence link the moment the deck gets edited. A packet keeps it, and when the underlying sources change — a tool renames a feature, a competitor changes pricing — the cheaper move is often to re-run the packet against fresh sources rather than argue about which parts of the old one still hold. The school's own research runs work this way, and the worked example later in the module shows one.

Narration for this slide

So what do agents actually do well here? Four things, and they are the parts of research that exhaust people. Coverage — reading forty sources or a year of tickets without getting tired and starting to skim. Structure — putting every finding into the same fields, every single time, on day one and day five. Recall — surfacing the exact quote, the source, and the date you checked it, months later. And traceability — keeping the link between every claim and its evidence intact at a scale where humans quietly start summarising from memory. None of that is insight. It is stamina and bookkeeping. But it is precisely the stamina and bookkeeping that decide whether research can be trusted.

Slide 5 of 1216:9

What they do badly: novelty, contact, knowing what is missing

These are not rough edges that the next model version fixes. They are structural, and the packet has to design around them.

  • Novelty — agents reproduce the centre of what is already written; genuinely new framings come from people
  • Contact with users — an agent cannot sit in an interview, notice hesitation, or earn a candid answer
  • Knowing what is missing — absence of evidence reads as absence of a problem
  • Confidence calibration — fluent prose makes weak evidence sound as solid as strong evidence
  • Stakes and ethics — consent, privacy, and what is fair to publish are human responsibilities

Use agents for the gathering and the structuring. The moment the work needs new contact with users or a judgment about what is absent, a person has to be in it.

Slide notes

This slide is the counterweight to the previous one and deserves equal time. Novelty first: an agent synthesising desk sources will converge on the consensus view of those sources, because that is what synthesis of existing text does. If the question needs a framing nobody has written down yet, the agent can assemble the raw material but the reframe comes from a person. Treat agent output as a very well-organised literature review, not a discovery.

Contact with users is the hard limit. An agent can draft a discussion guide, transcribe, and theme — Module 3 covers all of that — but it cannot sit in the room, notice the pause before an answer, or build the trust that makes someone say what they actually think. Research programmes that replace contact with desk synthesis drift towards confident descriptions of users nobody has spoken to recently.

The last three are about epistemics. Agents do not reliably know what is missing: if no source discusses a failure mode, the packet reads as if the failure mode does not exist, which is why the open-questions field is mandatory rather than optional. Fluency flattens confidence — a low-confidence inference reads as smoothly as a verified fact unless the packet forces confidence labels. And consent, privacy, and publication ethics are accountability questions; the agent can flag them in risk notes, but a person answers for them.

Narration for this slide

Now the other half, and these are structural, not bugs. Agents are weak at novelty — they reproduce the centre of what is already written, so a genuinely new framing still comes from you. They have no contact with users — they cannot sit in an interview, notice hesitation, or earn a candid answer, and research that replaces contact with desk synthesis goes quietly stale. They do not know what is missing — if no source mentions a problem, the packet reads as if the problem does not exist. Their fluency flattens confidence, so weak evidence sounds as solid as strong evidence. And ethics — consent, privacy, what is fair to publish — stays with you. The packet is designed around these limits, not in denial of them.

Slide 6 of 1216:9

Source standards: set them before the first packet

Decide what counts as evidence, how it is cited, and how confidence is labelled — once, in writing, before any agent runs.

StandardThe ruleWhy it exists
Says vs infersRecord what the source directly says separately from what the packet infersFeature names and marketing copy sound more capable than the actual behaviour
Date everythingEvery source note carries a date checked; volatile facts are re-fetched before publicationTool behaviour, pricing, and availability change between the run and the decision
Confidence labelsHigh: source plus local test. Medium: source only. Low: anecdotal — do not publish as guidanceFluent prose hides weak evidence unless the label forces it into view
Short excerptsPoint at sources and summarise relevance; do not copy whole posts or docs into the packetThe packet stays reviewable, and rights stay clean
Screenshot rightsRecord who owns each screenshot and whether it can be used publiclyEvidence gathered for research is not automatically usable in published work

Standards set after the first packet exists become arguments about that packet. Standards set before it become rules the agent simply follows.

Slide notes

The timing in the title is the point: these standards belong in the agent's instructions — the harness, in this school's vocabulary — before the first run, because retrofitting them means re-litigating work that already exists. Once they are written down, the agent applies them with more consistency than a person would, which is one of the few places where the agent's lack of judgment is an advantage.

The says-versus-infers rule matters most in fast-moving tooling and competitive research, where a feature name or a marketing page routinely sounds more capable than the behaviour underneath. The school's own source-note template makes them separate fields: what the source directly says, and what this packet infers. Dating is the second habit: a packet built on a Monday can be quoting stale pricing by Friday, so volatile claims carry a date checked and get re-fetched before anything is published — the same conservatism this course applies to its own time-sensitive claims.

The three-level confidence scale is deliberately blunt. High means the source was checked and the behaviour was tested locally; medium means the source was checked but not tested; low means anecdotal or unclear, and low-confidence material does not get published as guidance — it gets a follow-up check or a test. The last two rows are hygiene with legal weight: keep excerpts short and pointed rather than copying whole documents into the repository, and record screenshot ownership at capture time, because an image that was fine to study is not automatically fine to publish.

Narration for this slide

Before the first packet gets built, write down the source standards, because afterwards they turn into arguments. Five rules carry most of the weight. Separate what a source says from what you infer — feature names always sound more capable than the behaviour. Date every source note, and re-check volatile facts before anything is published. Label confidence honestly: high means source plus a local test, medium means source only, low means anecdotal — and low does not become guidance. Keep excerpts short; point at sources rather than copying them in. And record screenshot rights when you capture, not when you publish. Put these in the agent's instructions once, and it follows them more consistently than any of us do.

Slide 7 of 1216:9

Packets as briefing inputs

The packet is not the end of the research. It is the start of the next brief.

  • Findings become the constraints and direction lines of a design brief
  • The evidence trail travels with the brief, so the agent and the team see the why
  • The question keeps the brief honest about which decision the work serves
  • Open questions become explicit assumptions in the brief, not silent gaps
  • One packet can feed several briefs; one brief should cite which packets it stands on

Research that ends as a deck gets presented. Research that ends as a packet gets reused — by people and by the next agent.

Slide notes

This slide is the bridge between this course and the design-side workflows the school teaches elsewhere. A design brief asks for the user job, the constraints, the references, and the review criteria. A good packet already contains most of that: the findings supply the constraints and the direction, the question names the decision the work serves, and the evidence trail explains why each constraint exists — which matters as much for the agent generating the design as for the teammates reviewing it.

The practical mechanics are unglamorous. Briefs cite packets by path, and packets sit in the same repository as the design work, so the agent generating a screen can be pointed at the packet that motivated it. That is the difference between a brief that says reduce friction at sign-up and one that says the packet at this path shows the drop happens at the email-verification step, with the evidence two clicks away.

The open-questions field earns its keep here too. Gaps the research could not close should appear in the brief as named assumptions rather than disappearing, because a named assumption is something a team can decide to test, while a silent gap becomes a confident design decision nobody remembers making. Module 6 of this course is entirely about this insight-to-brief step; this slide plants the connection.

Narration for this slide

Here is the part most research processes skip: what the packet is for. It is briefing input. The findings become the constraints and the direction lines of a design brief. The evidence trail travels with the brief, so the agent generating the work — and the team reviewing it — can see why each constraint exists. The question keeps the brief honest about which decision it serves. And the open questions become named assumptions instead of silent gaps. Research that ends as a deck gets presented once. Research that ends as a packet gets reused — by people, and by the next agent. Module 6 goes deep on this step; for now, just notice that the packet is built to be consumed.

Slide 8 of 1216:9

Folder and file conventions that make packets reusable

One dated folder per run, the same files in every folder. The repository layout is part of the method.

  • Folder name = date plus question slug, so runs sort and stale ones are obvious
  • Same file names in every run — agents and reviewers know where to look
  • Markdown and JSON, in the repo: diffable, searchable, citable by path
One research run, one folder
content/research-runs/2026-06-01-choosing-your-agent-platform/
├── research-report.md      # findings, ranked, observation vs inference
├── signals.json            # raw signals, structured for re-use
├── article-candidates.md   # proposed new public material
├── update-candidates.md    # proposed changes to existing material
├── visual-plan.md          # diagrams and screenshots, with rights
├── compliance-notes.md     # risk: claims, rights, sensitive territory
└── decision-log.md         # the human call: approve, defer, reject

Conventions are what let the next run, the next reviewer, and the next agent pick up a packet without asking how it is organised.

Slide notes

The example on the slide is a real folder from this school's own repository — one of more than twenty research runs under content/research-runs/ as of June 2026 — not a proposed ideal. The conventions are deliberately dull. The folder name is the run date plus a slug for the question, which makes runs sort chronologically and makes staleness visible at a glance. The file names are identical across every run: the report with the findings, the raw signals as structured data, candidate actions split into new material and updates to existing material, the visual plan with rights noted, the compliance or risk notes, and the decision log.

Say why this matters more in agentic research than it did before. An agent asked to build on previous research can only do so if previous research is findable and predictable: same paths, same file names, same fields. The repository layout is effectively part of the method. It is also what makes packets citable — a brief can reference a finding by path and heading, and a reviewer can follow that path months later.

Map the files loosely onto the anatomy from earlier so the structure does not feel like a second thing to learn: the question and method live at the top of the report, findings and evidence fill its body and the signals file, candidates are the packet reaching towards briefs, and the risk notes and decision log are the human end of the packet. Adapt the file names to your own context freely; what cannot vary is that every run uses the same ones.

Narration for this slide

Here is what this looks like on disk, taken from this school's own repository. One dated folder per research run — the date plus the question. Inside, the same files every time: the research report with the ranked findings, the raw signals as structured data, candidate actions for new and existing material, a visual plan with rights noted, risk notes, and a decision log. Nothing clever, and that is the point. Because every run is organised the same way, the next reviewer knows exactly where to look, a brief can cite a finding by path, and the next agent can build on previous research instead of starting cold. The folder layout is part of the method.

Slide 9 of 1216:9

Worked example: one packet, end to end

A real run from this school's repository: research for the article comparing agent platforms for design work, run on 2026-06-01.

Packet partWhat this run actually contains
QuestionWhat should the platform-comparison article say, and is any of it stale?
SourcesOfficial changelogs, release pages, and docs fetched on the run date; prior runs reused, not re-verified; each claim dated
MethodPer-platform fact check against the book chapter and the prior run; volatile facts flagged for re-fetch before publication
FindingsOne platform in the article title enters deprecation 17 days after the article date; several book-era facts are stale and must not be carried over
EvidenceEach fact carries its source and a confidence note — directly fetched, via search snippet, or re-verify before quoting
DecisionReframe the fourth platform, refresh stale facts, keep pricing claims dated — recorded in the decision log

The packet caught a finding that would have embarrassed the published article — not through insight, but because the method forced every claim to be checked and dated.

Slide notes

Walk the table as a story, because it shows the anatomy doing real work. The run is 2026-06-01-choosing-your-agent-platform in this school's research-runs folder. The question was narrow and decision-shaped: the school planned an article comparing four agent platforms for design work, and the run existed to confirm what the article could safely claim. The sources section shows two of the standards from earlier in action — every claim is dated, and findings from a previous run are explicitly reused rather than silently re-asserted.

The headline finding is the kind that desk research at agent scale is good at surfacing: one of the four platforms named in the article title was being retired for individual users roughly two and a half weeks after the planned publication date. Publishing the comparison as drafted would have dated the article almost immediately. The packet also caught several facts inherited from the book manuscript that were no longer true — capabilities the book said a platform lacked that it had since gained. None of this required insight; it required checking every claim against a current source and writing down the date.

Point at the confidence discipline in the evidence row: facts fetched directly from official pages are marked as such, facts that arrived via search snippets are marked re-verify before quoting, and pricing is treated as volatile by default. Then the decision log records the human call about what the article does. That is the whole loop of this module in one folder: agent-built coverage and traceability, human-owned question and decision.

Narration for this slide

Let's trace one real packet. On the first of June 2026, this school ran a research packet for an article comparing four agent platforms for design work. The question was simple: what can the article safely claim, and is any of it stale? The agent checked every platform against current changelogs and docs, dated every claim, and marked which facts came straight from official pages and which needed re-verifying. The headline finding: one of the four platforms in the article's title was being retired seventeen days after the planned publish date — and several facts inherited from the book were no longer true. The decision log records what the article does about it. No brilliance involved. Just every claim checked, dated, and traceable.

Slide 10 of 1216:9

Risk notes and the decision: the human end of the packet

The packet flags risk before any public copy exists, and ends with a recorded decision rather than a fade-out.

  • Flag claims about what tools or products can do unless a source or local test confirms them
  • Flag screenshots and quotes whose rights or consent are unclear
  • Flag anything touching privacy, accessibility, legal, financial, or medical territory
  • Do not let one experiment or one loud voice become universal advice without naming the scope
  • Record the decision — approve, defer, reject, needs more research — and who made it

Nothing should auto-publish from a research run. The packet exists to make the human decision easier, not to remove it.

Slide notes

Risk notes are the part of the packet that feels optional until the day they are not. The most common risks in agentic design research are mundane: stating that a tool supports a workflow when the source only implies it, overclaiming what an agent can do, using screenshots whose ownership was never recorded, exposing internal research metadata — run identifiers, internal status labels — on public pages, and writing guidance that sounds universal when it was only ever true for one stack or one team. The packet does not replace counsel or a privacy review; it keeps these questions visible at the moment they are cheapest to answer.

The scope warning deserves its own sentence. Agent-scale research makes it easy to generate confident, general-sounding claims from thin evidence — one failed experiment, one vocal reviewer, one competitor's edge case. The discipline is to name the scope in the finding itself: in this corpus, for this segment, on this date.

The decision log is the packet's full stop. Approve, defer, reject, or needs more research — recorded with a name and a date. This is the same gate pattern used everywhere in this school's workflows: the agent can prepare everything up to the decision, and the decision is a person's. Teams that skip the recorded decision end up with research that was technically done but never owned, which is how findings get re-litigated months later.

Narration for this slide

Two parts of the packet stay firmly human, and they sit at the end. Risk notes first: flag any claim about what a tool can do that a source or a test has not confirmed, any screenshot or quote whose rights are unclear, anything that touches privacy, accessibility, legal or medical territory, and any finding that is about to become universal advice on the back of one experiment. Then the decision: approve, defer, reject, or needs more research — recorded, with a name and a date. Nothing auto-publishes from a research run. The packet's job is to make that decision easy and well-evidenced, not to make it for you.

Slide 11 of 1216:9

Exercise: define the packet for a question you currently hold

Take a research question you actually have this month and define its packet on one page. Do not run it yet — the definition is the exercise.

  • Write the question as a decision: what would change depending on the answer
  • List the sources you would point an agent at, and mark which need rights or consent care
  • Note which parts an agent should build and where human contact is non-negotiable
  • Write the source standards line: dating, says-vs-infers, and your confidence labels
  • Name the folder and files the packet would live in, and who records the decision

Keep the page. In Module 2 you will run a deep research pass against this exact definition and see how much of the quality was set before the agent started.

Slide notes

Like the exercises across this school's courses, this one is deliberately analogue: a page, not a terminal. The aim is to surface how much of packet quality is decided before any agent runs — in the question, the source list, and the standards — and most participants find the first bullet hardest. Questions arrive as topics: onboarding, pricing, the competitor's new feature. Turning a topic into a decision — what would we do differently depending on the answer — is the single most valuable rewrite, and it is exactly the discipline that stops agent-scale research becoming unfocused accumulation.

The third bullet forces the honest split from earlier in the module. If the question genuinely needs new contact with users, the packet should say so and scope what desk research can and cannot answer in the meantime, rather than letting coverage substitute for contact. The fourth bullet is small but binding: writing the standards line now means the first real run inherits it.

If running this live, have a few people read their question-as-decision out loud and push on it: who makes that decision, when, and what evidence would actually move them. Questions that survive that push are worth a packet. Keep the pages — Module 2 opens by scoping a deep research run, and this definition is its starting input.

Narration for this slide

Your turn. Take a research question you actually hold this month and define its packet on one page — without running anything. First, rewrite the question as a decision: what would change depending on the answer. Then list the sources you would point an agent at, and mark the ones that need rights or consent care. Note which parts the agent should build, and where human contact is non-negotiable. Write your source standards in one line: dating, says-versus-infers, confidence labels. And name the folder, the files, and the person who records the decision. Keep the page. Module 2 starts a deep research run from exactly this definition.

Slide 12 of 1216:9

Summary, and the bridge to deep research

  • The packet is the unit of agentic research: question, sources, method, findings, evidence, decision
  • Agents supply coverage, structure, recall, and traceability — not novelty, contact, or knowing what is missing
  • Source standards — dating, says-vs-infers, confidence labels — are set before the first run, in the agent's instructions
  • Stable folder and file conventions make packets auditable, citable, and reusable as briefing input
  • The packet ends with risk notes and a recorded human decision, never an auto-publish

Module 2 puts the packet to work: deep desk research and competitive teardowns, where coverage is cheap and judgment about it is not.

Slide notes

Recap by connecting the bullets rather than repeating them. The packet structure exists because of the strengths-and-limits split: agents are good at coverage, structure, recall, and traceability, so those parts are agent-built; they are bad at novelty, contact, and knowing what is missing, so the question, the open-questions field, the risk notes, and the decision stay human. The source standards and the folder conventions are what turn a one-off good packet into a repeatable practice — they live in the harness and the repository, not in anyone's memory.

The worked example is the proof point worth restating: a real run caught a deprecation and a set of stale facts that would have undermined a published article, purely because the method forced every claim to be checked and dated. That is the realistic promise of this material — fewer embarrassments and faster decisions, not magic insight.

Preview Module 2 concretely. It takes the packet into the noisiest territory: deep desk research and competitive teardowns, where an agent can produce enormous coverage very quickly. The module covers scoping the question so breadth stays useful, capturing flows and patterns with screenshot evidence and the rights questions that come with it, and the discipline of keeping observation, inference, and recommendation visibly separate. Participants who did the exercise bring their packet definition with them.

Narration for this slide

Let's close. The unit of agentic research is the packet: question, sources, method, findings, evidence trail, and a recorded decision. Agents bring coverage, structure, recall, and traceability; they do not bring novelty, contact with users, or an awareness of what is missing — so the packet keeps those parts human. Source standards are set before the first run, and stable folder conventions make every packet auditable and reusable as briefing input. In Module 2 we put the packet to work on deep desk research and competitive teardowns — the territory where coverage is cheap and judgment about it is not. Bring the packet you defined in the exercise. See you there.

Module transcript
Module 1, narrated slide by slide

Slide 1Research Packets

Welcome to Agentic Design Research. This first module is about the container everything else in the course lands in: the research packet. When an agent does the gathering and the structuring, the deliverable stops being a slide deck of conclusions and becomes a packet — the question, the sources, the method, the findings, and the evidence trail, all in one place, built by the agent and auditable by you. We will walk through its anatomy, look honestly at what agents do well and badly in research, and set the source standards you need before the first packet gets built. Let's start with why the packet is the deliverable at all.

Slide 2Anatomy of a packet

Here is the anatomy. Six parts, and they are always the same six. The question — what decision this research would change. The sources — where every claim came from, dated, with the source's own words separated from what we infer. The method — how the evidence was gathered and what was excluded. The findings — ranked claims, with observation kept apart from inference. The evidence trail — quotes, links, and screenshots that every claim points back to. And finally, open questions and the decision — what the packet does not know, and the human call on what happens next. Boring and repeatable, on purpose. That is what makes it auditable and reusable.

Slide 3What each part does downstream

This diagram shows why the structure matters. Top row, the six parts. Bottom row, what each part does after the run ends. The question keeps later runs scoped to a real decision. The sources get re-checked when the packet is regenerated, instead of trusting memory. The method is what lets a reviewer challenge or re-run the work. The findings feed straight into design briefs as constraints and direction. The evidence trail is what gets spot-checked at review. And the decision is the gate — nothing reaches a brief or a public page before it. Notice the colours: you own the question and the decision; the agent builds the middle.

Slide 4What agents do well: coverage, structure, recall, traceability

So what do agents actually do well here? Four things, and they are the parts of research that exhaust people. Coverage — reading forty sources or a year of tickets without getting tired and starting to skim. Structure — putting every finding into the same fields, every single time, on day one and day five. Recall — surfacing the exact quote, the source, and the date you checked it, months later. And traceability — keeping the link between every claim and its evidence intact at a scale where humans quietly start summarising from memory. None of that is insight. It is stamina and bookkeeping. But it is precisely the stamina and bookkeeping that decide whether research can be trusted.

Slide 5What they do badly: novelty, contact, knowing what is missing

Now the other half, and these are structural, not bugs. Agents are weak at novelty — they reproduce the centre of what is already written, so a genuinely new framing still comes from you. They have no contact with users — they cannot sit in an interview, notice hesitation, or earn a candid answer, and research that replaces contact with desk synthesis goes quietly stale. They do not know what is missing — if no source mentions a problem, the packet reads as if the problem does not exist. Their fluency flattens confidence, so weak evidence sounds as solid as strong evidence. And ethics — consent, privacy, what is fair to publish — stays with you. The packet is designed around these limits, not in denial of them.

Slide 6Source standards: set them before the first packet

Before the first packet gets built, write down the source standards, because afterwards they turn into arguments. Five rules carry most of the weight. Separate what a source says from what you infer — feature names always sound more capable than the behaviour. Date every source note, and re-check volatile facts before anything is published. Label confidence honestly: high means source plus a local test, medium means source only, low means anecdotal — and low does not become guidance. Keep excerpts short; point at sources rather than copying them in. And record screenshot rights when you capture, not when you publish. Put these in the agent's instructions once, and it follows them more consistently than any of us do.

Slide 7Packets as briefing inputs

Here is the part most research processes skip: what the packet is for. It is briefing input. The findings become the constraints and the direction lines of a design brief. The evidence trail travels with the brief, so the agent generating the work — and the team reviewing it — can see why each constraint exists. The question keeps the brief honest about which decision it serves. And the open questions become named assumptions instead of silent gaps. Research that ends as a deck gets presented once. Research that ends as a packet gets reused — by people, and by the next agent. Module 6 goes deep on this step; for now, just notice that the packet is built to be consumed.

Slide 8Folder and file conventions that make packets reusable

Here is what this looks like on disk, taken from this school's own repository. One dated folder per research run — the date plus the question. Inside, the same files every time: the research report with the ranked findings, the raw signals as structured data, candidate actions for new and existing material, a visual plan with rights noted, risk notes, and a decision log. Nothing clever, and that is the point. Because every run is organised the same way, the next reviewer knows exactly where to look, a brief can cite a finding by path, and the next agent can build on previous research instead of starting cold. The folder layout is part of the method.

Slide 9Worked example: one packet, end to end

Let's trace one real packet. On the first of June 2026, this school ran a research packet for an article comparing four agent platforms for design work. The question was simple: what can the article safely claim, and is any of it stale? The agent checked every platform against current changelogs and docs, dated every claim, and marked which facts came straight from official pages and which needed re-verifying. The headline finding: one of the four platforms in the article's title was being retired seventeen days after the planned publish date — and several facts inherited from the book were no longer true. The decision log records what the article does about it. No brilliance involved. Just every claim checked, dated, and traceable.

Slide 10Risk notes and the decision: the human end of the packet

Two parts of the packet stay firmly human, and they sit at the end. Risk notes first: flag any claim about what a tool can do that a source or a test has not confirmed, any screenshot or quote whose rights are unclear, anything that touches privacy, accessibility, legal or medical territory, and any finding that is about to become universal advice on the back of one experiment. Then the decision: approve, defer, reject, or needs more research — recorded, with a name and a date. Nothing auto-publishes from a research run. The packet's job is to make that decision easy and well-evidenced, not to make it for you.

Slide 11Exercise: define the packet for a question you currently hold

Your turn. Take a research question you actually hold this month and define its packet on one page — without running anything. First, rewrite the question as a decision: what would change depending on the answer. Then list the sources you would point an agent at, and mark the ones that need rights or consent care. Note which parts the agent should build, and where human contact is non-negotiable. Write your source standards in one line: dating, says-versus-infers, confidence labels. And name the folder, the files, and the person who records the decision. Keep the page. Module 2 starts a deep research run from exactly this definition.

Slide 12Summary, and the bridge to deep research

Let's close. The unit of agentic research is the packet: question, sources, method, findings, evidence trail, and a recorded decision. Agents bring coverage, structure, recall, and traceability; they do not bring novelty, contact with users, or an awareness of what is missing — so the packet keeps those parts human. Source standards are set before the first run, and stable folder conventions make every packet auditable and reusable as briefing input. In Module 2 we put the packet to work on deep desk research and competitive teardowns — the territory where coverage is cheap and judgment about it is not. Bring the packet you defined in the exercise. See you there.