AAgentic Design School
Module 5 of 6
40–50 minutes

Agentic Design Research

Journey Maps and Service Blueprints from Product Data

Journey maps and blueprints built from what the product actually records — events, tickets, session paths — rather than workshop recollection, and kept current by re-running the generation instead of redrawing the poster.

Duration40–50 minutes

Slides13 slides with notes and narration

Learning objectives

  • Generate journey maps from event and support data with the gaps marked honestly.
  • Extend journeys into service blueprints covering front-stage and back-stage steps.
  • Keep maps current as living artifacts rather than one-off workshop posters.
Slide deck

Work through the module

Each slide is shown in its 16:9 frame, exactly as it appears in the video version. Open the notes under any slide for the longer explanation, and the narration if you prefer to read along.

Slide 1 of 1316:9

The Journey Map the Data Can Actually Support

Agentic Design Research · Module 5 of 6

  • What events, tickets, and session paths can and cannot tell you
  • Generating stages, friction, and pain points with evidence attached to every claim
  • Extending the journey into a service blueprint: front-stage, back-stage, systems, policies
  • Keeping the map alive: validation with operators and regeneration on a schedule

A journey map is trustworthy exactly to the degree that every claim on it traces back to something a user actually did or said.

Slide notes

This module takes the evidence discipline built in Modules 1 to 4 — packets, citations, observation kept separate from inference — and applies it to two artifacts with a bad reputation for decay: journey maps and service blueprints. Both are usually produced in workshops, from memory, and both get quoted in decks long after anyone could check them. The promise here is narrower and more useful: a map built from what the product actually records, with the gaps marked, that can be regenerated when the data changes.

Set expectations about scope. The module covers two related workflows: journey mapping from product data, and extending that journey into a service blueprint using research repositories, runbooks, and ops documentation. Both are Advanced workflows in the school's library, and both follow the same shape — per-source evidence agents, a synthesis step, a challenge step, and a human review session that is part of the workflow rather than an afterthought.

If participants have lived through a journey-mapping workshop that produced a beautiful poster nobody trusts, invite that experience into the room early. The contrast between the workshop poster and the evidence-linked map is the emotional core of the module, and it lands harder when people supply their own examples.

Narration for this slide

Welcome to Module 5. So far in this course we have built research packets, run desk research and teardowns, synthesised qualitative data, and handled surveys and funnels. This module applies all of that discipline to two artifacts that usually escape it: journey maps and service blueprints. Most journey maps are workshop recollection with good typography. We are going to build them differently — from events, tickets, session paths, and research the organisation already has, with evidence attached to every stage and the gaps marked honestly. Then we will extend the journey into a blueprint, validate it with the people who run the service, and set it up to be regenerated instead of redrawn.

Slide 2 of 1316:9

Why most journey maps are fiction with good typography

The failure mode is predictable, and it is not a drawing problem. It is a provenance problem.

  • A workshop produces a poster; the emotions row gets filled in from memory
  • Six months later nobody can say which parts were observed and which were invented
  • The map keeps getting quoted in decks, and decisions inherit its guesses
  • The fix is provenance: every claim traces to a funnel number, a ticket, a note, or a quote
  • Maps built that way look less complete — and that is the point

A stage with no evidence should look empty, not confident.

Slide notes

The opening claim is deliberately blunt because the pattern is so common. Workshop-built maps are not worthless — they surface assumptions and align teams — but they are routinely treated as evidence when they are actually a record of what the people in the room believed that day. The emotions row is the usual casualty: it gets filled in because an empty cell looks unfinished, and the invented emotion then outlives everyone's memory of having invented it.

The fix is not more workshops or better facilitation. It is provenance: the same rule that governed synthesis in Module 3, applied to a different artifact. Every stage, pain point, and emotion on the map should be traceable to something a user actually did or said — a funnel drop, a ticket, a session note, an interview quote. Maps built this way are visibly less symmetrical than the workshop version, with empty cells and unverified flags, and that asymmetry is the feature, not the flaw.

Name the reason teams skip provenance: volume. The evidence for one journey is scattered across an analytics export, a few thousand tickets, session-recording notes, and a research repository. Reading and extracting all of it by hand is weeks of work, which is exactly the kind of work agents do well — while interpretation of what the journey means stays with people. That division of labour is the rest of the module.

Narration for this slide

Here is the uncomfortable truth about most journey maps: they are fiction with good typography. A workshop produces a beautiful poster, the emotions row gets filled in from memory, and six months later nobody can tell you which parts were observed and which were invented to fill an empty cell. But the map keeps getting quoted, and decisions inherit its guesses. The fix is not a better workshop. It is provenance — every claim on the map traces back to a funnel number, a ticket, a session note, or a quote. Maps built that way look rougher and less complete, and that is exactly the point. A stage with no evidence should look empty, not confident.

Slide 3 of 1316:9

Inputs: each source answers a different question

The sources do not substitute for each other. Treating them as interchangeable is how maps go wrong.

SourceWhat it can supportBlind spot
Analytics and event exportsVolume, sequence, drop-off, timing — what happened and how oftenNever says why; uninstrumented flows read as calm
Session paths and recording notesObserved behaviour, hesitation, workarounds in contextTiny sample; only the sessions someone recorded
Support ticketsPain severe enough to report, in the user's own wordsSkews toward anger; silent abandonment is invisible
Interview themesMotivation, expectation, emotionFiltered through memory and politeness
Sales and CS notesAccount-level context and stakesSkews toward the deals already at risk

A pain point backed by a funnel drop, forty tickets, and three quotes is a different object from one backed by a single memorable quote — and the map should show the difference.

Slide notes

This table is the intellectual core of the input stage. Analytics tells you what happened and how often, but nothing about why; worse, anything that is not instrumented reads as calm, which is silence masquerading as satisfaction. Tickets tell you what hurt enough to make someone write, which over-represents anger and completely misses the people who quietly gave up. Session notes show real behaviour in context but for a handful of users. Interviews give you language, motivation, and emotion — filtered through memory and the human tendency to be polite to a researcher.

The practical consequence is that the workflow keeps sources separate during extraction, precisely so the synthesis step can say which kind of evidence supports each claim. Emotion claims need interview or verbatim support; volume claims need analytics; behaviour-in-context claims need session notes. A claim that draws on all three is strong. A claim resting on one source should carry that fact visibly.

Before any agent runs, two human decisions matter. First, define the journey's edges in one paragraph — signup to activation is a different map from first marketing touch to renewal, and an undefined edge is the first place a map starts inventing. Second, anonymise the exports before anything reaches the workflow: the agents should see behaviour and language, not identities. Both of these are designer judgement, not agent work.

Narration for this slide

Before generating anything, look hard at the inputs, because each source answers a different question. Analytics tells you what happened and how often — never why — and anything you have not instrumented looks deceptively calm. Tickets give you pain in the user's own words, but only from people angry enough to write; the silent abandoners never appear. Session notes show real behaviour for a tiny sample. Interviews give you motivation and emotion, filtered through memory. The workflow keeps these sources separate during extraction so that the final map can say which kind of evidence supports each claim. A pain point backed by a funnel drop, forty tickets, and three quotes is simply a different object from one backed by a single memorable quote.

Slide 4 of 1316:9

Generating the journey: evidence agents, a synthesiser, a challenger

The run is a dynamic workflow with three kinds of agents, governed by one rule: no claim without a citation.

  • Stage anonymised exports with a manifest: what each file is, what period, what it cannot tell you
  • Evidence agents — one per source or batch — extract moments and signals with citations to rows, ticket IDs, note IDs
  • A synthesis agent assembles stages, actions, friction, emotions, and pain points; every cell lists its evidence IDs
  • A challenge agent flags thin, single-source, or contradicted claims as UNVERIFIED before any human reads the draft
  • The output is a draft map plus a gaps list — not a finished poster

No claim without a citation; no stage without evidence or an explicit unverified flag.

Slide notes

Walk the orchestration shape, because it is the same pattern participants saw in synthesis and will see again in the blueprint workflow. The run is a Claude Code dynamic workflow: an orchestration script holds the source files and extracted evidence in its own variables, so thousands of tickets and a long event export never enter the main conversation context — only the structured extractions and the final map do. Ticket extraction alone often fans out into dozens of batches, which is why the concurrency matters here.

Three agent roles carry the discipline. Evidence agents are per-source and refuse to interpret beyond their own source: analytics gives behaviour and volume, not feelings; tickets give reported pain, not prevalence. Every extracted item carries a citation back to a file row, ticket ID, or note ID, and quotes must be verbatim. The synthesis agent assembles stages, actions, thoughts, emotions, pain points, and opportunities, and is required to attach evidence IDs to every cell and to quantify where the analytics allows it. The challenge agent then audits the draft and marks anything thin, single-source, or contradicted as UNVERIFIED, listing it in a gaps file alongside the manifest's known gaps.

Two rules in the prompt do most of the protective work: do not infer emotions from analytics alone, and do not fill empty cells for symmetry. Both are exactly the moves an agent under pressure to look complete will otherwise make. The manifest's known-gaps line — for example, mobile web not instrumented — is what stops the absence of data being read as the absence of problems.

Narration for this slide

Here is how the journey actually gets generated. You stage the anonymised exports with a manifest that says what each file is, what period it covers, and — critically — what it cannot tell you. Then the run dispatches three kinds of agents. Evidence agents work one source or batch at a time, extracting moments and signals with citations back to the exact row, ticket, or note. A synthesis agent assembles the stages, actions, friction, and pain points, and every single cell has to list the evidence that supports it. Then a challenge agent audits the draft and marks anything thin, single-source, or contradicted as unverified — before any human reads it. The rule the whole run lives by: no claim without a citation, no stage without evidence or an explicit flag.

Slide 5 of 1316:9

From product data to an evidence-linked map

The pipeline in one picture: staged sources, evidence extraction, synthesis with ranked pain points, blueprint layers, the challenge gate, and the owner review.

Pipeline diagram showing product analytics, support tickets, and research findings staged as sources, feeding per-source evidence agents that extract cited moments into a shared ledger. A synthesis step assembles journey stages with evidence per stage and ranks pain points by evidence strength, with blueprint layers — front-stage, back-stage, systems, and policies — added beneath the journey. A challenge gate marks thin or contradicted claims as unverified, and the draft then goes to an owner review where the team accepts or disputes claims, assigns owners to the top pain points, and dates the map for the next regeneration.
Staging the sources and the owner review are human-led; extraction, synthesis, and blueprint assembly are agent-run; the challenge gate sits between them so weak evidence is visible before anyone presents the map.

The agents do the reading and assembling. The humans define the edges, hold the challenge output, and make the decisions.

Slide notes

Walk the diagram left to right and name who owns each column. The left column is human work: deciding which exports represent the journey, anonymising them, and writing the manifest with its known gaps. The second column is the evidence agents — one per source or batch — extracting moments into a shared schema with verbatim quotes and citations, then merging everything into a single evidence ledger with stable IDs.

The third column is synthesis. The journey card shows what the map carries: stages, actions, friction, emotion, pain points ranked by combined evidence strength, quantified wherever the analytics allows, with empty stages left visibly empty. The blueprint card underneath previews the second half of the module: once the customer-facing journey exists, the same evidence approach extends it downward into front-stage, back-stage, systems, and policy layers drawn from runbooks and ops documentation.

The right column is where trust is earned. The challenge gate is agent-run but exists to serve the human review: it marks thin, single-source, and contradicted claims as unverified rather than letting the map silently fill itself in. The owner review is the human decision point — accept or dispute each claim, assign owners to the top pain points, date the map, and record which exports produced it so staleness is visible later. If a team adopts only one element from this module, it should be that gate-plus-review pairing.

Narration for this slide

Here is the whole pipeline in one picture. On the left, the staged sources — product analytics, support tickets, research findings — each anonymised and described in a manifest. Evidence agents read them source by source and extract cited moments into one ledger. The synthesis step assembles journey stages with evidence attached to every stage and pain points ranked by how much evidence sits behind them, and the blueprint layers — front-stage, back-stage, systems, policies — get added underneath. Then the challenge gate marks anything thin or contradicted as unverified, and the draft goes to the owner review, where people accept or dispute claims, assign owners, and date the map. Agents do the reading and assembling. People hold the edges and the decisions.

Slide 6 of 1316:9

Marking the gaps: what the data cannot see

An evidence-based map is still a map of the evidence you happened to have. The gaps list is part of the deliverable.

  • Users absent from every export — including those who left before generating any data
  • Uninstrumented flows: absence of events is not absence of problems
  • Causation: funnels and tickets locate where to investigate, they do not explain why
  • Emotion claims without interview or verbatim support — behaviour alone cannot carry them
  • Smooth experiences: tickets and reviews describe the unhappy minority, not everyone

The challenge agent makes these limits visible. It does not remove them.

Slide notes

This slide is the honesty clause of the module, and it deserves unhurried treatment because it is where evidence-based mapping most often gets oversold. The map can only describe the users who showed up in the data. People who bounced before creating an account, or who churned silently, generate nothing — and a map that does not say so will be read as if they do not exist. The same applies to uninstrumented flows: in one of the case studies behind this module, the apparent calm of a workspace-creation stage was partly an artifact of an uninstrumented mobile web flow, and only the manifest's known-gaps line stopped that calm being celebrated.

Causation is the second standing limit. A funnel drop co-located with a pile of tickets tells you where to investigate; it does not tell you why people leave, and turning that correlation into a confident causal story is exactly the laundering Module 4 warned about. Emotion is the third: the workflow's rule is that emotion claims need interview or verbatim support, because reading feelings off event data is invention wearing a chart.

Finally, remind people what the unhappy-minority skew means for the emotion row specifically. In the returns-journey case later in this module, the team labelled the emotion row explicitly as the journey of the unhappy minority, because nothing in tickets or reviews represents customers who returned an item smoothly and said nothing. That label is a small thing that keeps a stakeholder from misreading the whole artifact.

Narration for this slide

Now the part that keeps this honest. An evidence-based map is still only a map of the evidence you happened to have. It cannot speak for users who never appear in any export — including the ones who left before generating data. It cannot treat uninstrumented flows as calm; absence of events is not absence of problems. It cannot turn a funnel drop into a cause; it can only tell you where to look. It cannot read emotions off behaviour alone — emotion claims need quotes behind them. And it cannot describe the people who had a smooth experience and said nothing. The challenge agent makes these limits visible on the map itself. It does not make them go away — and neither should you.

Slide 7 of 1316:9

Good and bad journey map claims

The unit of quality is the individual claim. Audit a few cells by hand after every run, before anyone presents the map.

Bad claimGood claim
Users feel overwhelmed during setupConnect-data stage: 61% of onboarding tickets, median 9 minutes on the credentials screen across 38 session notes, funnel drop from 71% to 38% (amp, zd, sess)
Customers are frustrated waiting for refunds44% of returns tickets arrive between drop-off and refund, median 11-day gap, echoed in 312 reviews; smooth returns are not represented in this data (zd, reviews)
Renewal depends on the champion relationshipEscalated accounts (14) cite support responsiveness; across all 220 accounts, active-user breadth tracks renewal more closely — divergence flagged UNVERIFIED pending churn interviews

A trustworthy claim names its evidence and its limits. A plausible uncited claim reads well and decays fast.

Slide notes

These pairs come from the school's journey-mapping workflow and the traced runs behind it. The pattern in the good column is consistent: a specific stage, a quantity the analytics actually supports, the source identifiers in brackets, and — in two of the three — an explicit statement of what the data does not cover. The bad column is not wrong so much as unfalsifiable: users feel overwhelmed during setup could be true of any product in any year, which is exactly why it survives unchallenged in decks.

The third row deserves a comment because it shows the workflow doing something a workshop cannot. The sales narrative and the usage analytics genuinely disagreed about what drives renewal, and the synthesis did not average them into something nobody had observed. It carried both strands, showed which evidence each rested on — fourteen escalated accounts versus all two hundred and twenty — and flagged the divergence as the map's central open question. Locating a contradiction precisely is often more valuable than resolving it prematurely.

The practical habit to take away: after every run, pick a handful of cells across different stages and check the citations by hand before the map goes anywhere near a stakeholder. If a citation does not hold, treat the surrounding synthesis as suspect. This is the same spot-check discipline from earlier modules, and it takes minutes, not hours.

Narration for this slide

Quality lives at the level of the individual claim, so let's compare. 'Users feel overwhelmed during setup' reads fine and tells you nothing — you cannot check it, and you cannot act on it. The good version names the stage, gives the numbers the data actually supports — sixty-one percent of onboarding tickets, nine minutes on the credentials screen, the funnel drop — and cites its sources. The refunds example goes further and states what the data cannot show: smooth returns are not represented. And the renewal example shows the workflow holding a contradiction honestly instead of averaging it away. After every run, spot-check a few cells by hand before the map gets presented. A trustworthy claim names its evidence and its limits.

Slide 8 of 1316:9

From journey to blueprint: front-stage, back-stage, systems, policies

The journey map shows what users experience. The blueprint shows what the organisation does to produce that experience — and where it fails to.

  • Lanes: customer actions, front-stage, back-stage, support processes, physical and digital evidence
  • The evidence already exists: research repositories, ticket samples, ops runbooks, process docs, stakeholder notes
  • Lane agents draft each layer from extracted moments, citing sources per step and flagging single-source steps
  • Back-stage steps must name the front-stage moment they support and any handoff across the line
  • Never invent a step to make the grid look complete — empty cells are workshop input, not failures

Most service failures live at the lines of visibility and internal interaction. The blueprint exists to make those lines inspectable.

Slide notes

The move from journey to blueprint is a change of question. The journey map asks what users experience; the blueprint asks what the organisation does — front-stage and back-stage — to produce that experience, and where the handoffs sit. Most organisations already hold the answer in scattered form: the research repository has the customer's side, the support queue has the failure modes, the ops runbooks describe back-stage steps nobody outside operations has read, and the rest lives in the heads of the people who run the service. The traditional blueprinting workshop ignores all of it and starts from a blank wall, which is why the people who know the back-stage best spend day one transcribing things that are already written down.

The agentic version flips the order. Per-source evidence agents extract moments from each staged file, then one lane agent per blueprint layer drafts customer actions, front-stage, back-stage, support processes, and evidence, pulling only from the extracted moments and citing source files for every step. Back-stage steps must name which front-stage moment they support and any handoff across the line of internal interaction, because that is where most service failures live. Keeping the blueprint as a structured file rather than a drawing means every step can carry citations and the rendered grid can be regenerated when the data changes.

The rule that protects the output is the same as the journey rule, stated for a different temptation: never invent a process step to make the grid look complete. An agent under pressure to fill cells will reach for how services like this usually work, and those plausible invented steps are exactly the ones a workshop participant will not think to challenge. A gap in the blueprint is information; an invented step is a liability someone will repeat in a steering committee.

Narration for this slide

Once the journey exists, extend it downward into a service blueprint. The journey shows what users experience; the blueprint shows what the organisation does to produce it — customer actions, front-stage, back-stage, support processes, and the evidence customers actually see. The raw material already exists: research repositories, ticket samples, ops runbooks, process documents, stakeholder notes. Lane agents draft each layer from the extracted evidence, citing sources for every step, and back-stage steps have to name the front-stage moment they support and any handoff across the line — because that is where services break. One rule protects everything: never invent a step to make the grid look complete. Empty cells are workshop input, not failures.

Slide 9 of 1316:9

Validating with the people who do the work

The draft is workshop preparation, not the deliverable. If the validation session never happens, you have a plausible document nobody has confirmed — which is worse than no map at all.

  • Read the gaps list before the map: it tells you where the draft is least trustworthy
  • Spot-check six citations across stages and lanes; if one fails, treat that lane as suspect
  • Run the session with frontline staff and ops — the people behind the line of visibility
  • Structure the session around gaps and conflicts, not a lane-by-lane read-through
  • For each unverified cell: gather evidence, schedule research, or accept the gap in writing
  • Pick the top pain points by combined evidence strength and assign owners

Documentation describes what is supposed to happen. Only the people who run the service can say whether it does.

Slide notes

This is the step that turns a plausible draft into something the organisation can act on, and it is the step most tempting to skip because the draft already looks finished. Resist that. Runbooks drift from practice, tickets over-represent failure, and stakeholder notes carry their authors' vantage points. The validation session — with frontline staff, operations, and the teams behind the line of visibility — is what confirms which documented steps still match reality and which exist only on paper.

The structure of the session matters as much as holding it. Walking the grid lane by lane wastes the room's scarce time on things the evidence already supports. Structure it around the gaps file and the flagged conflicts instead: the cells marked unverified, the steps where two sources describe the same moment differently, the back-stage steps that appear in a runbook but never in anything customer-facing. In one of the case studies behind this module, a B2B onboarding blueprint surfaced nine steps where sales, implementation, and support documents disagreed about owners and timelines — and the validation workshop took ninety minutes instead of the planned half day, because the argument was already on the wall with citations.

Close the session with decisions, not vibes. Every unverified cell gets one of three outcomes: gather more evidence, schedule research, or accept the uncertainty explicitly and in writing. The top pain points get owners. The map gets a date and a record of which exports produced it, so its staleness is visible later. That last administrative-sounding step is what makes the next slide possible.

Narration for this slide

The run ends with a draft and a gaps list — not a finished map. The validation session is part of the workflow, and it belongs to the people who actually run the service: frontline staff, operations, the teams behind the line of visibility. Read the gaps list before the map, spot-check a handful of citations by hand, and then structure the session around the gaps and conflicts rather than a polite read-through. For every unverified cell, the team makes a call: gather evidence, schedule research, or accept the gap in writing. The top pain points get owners. And the map gets a date. Documentation tells you what is supposed to happen. Only the people doing the work can tell you whether it does.

Slide 10 of 1316:9

Keeping it current: regeneration as a scheduled run

The poster decays because updating it means redrawing it. A generated map is updated by re-running the generation against fresh exports.

  • Save the run as a reusable workflow command; keep the schema and agent definitions in version control
  • Keep the evidence ledger next to the map — challenges become lookups, not memory contests
  • Re-run quarterly, or before each planning cycle, against fresh exports
  • The re-run also measures change: did the pain point you funded actually move?
  • An unmaintained blueprint decays into the same confident, outdated documentation it was built to interrogate

The map stays alive because regenerating it costs an afternoon, not another workshop.

Slide notes

The economic argument for this whole approach lands on this slide. A workshop poster decays because updating it requires re-convening the workshop, so it never gets updated; it just gets quoted with decreasing accuracy. A generated map decays at the same rate, but updating it costs a re-run against fresh exports — the saved workflow, the same schema, the same agents, new data. That difference in update cost is what turns the journey map from a one-off artifact into a living one.

The mechanics are mundane and worth stating. Once the prompt and agent definitions are stable, the run gets saved as a project-level workflow command so the team shares it, and the agent definitions and evidence schema live in version control alongside the map and its ledger. Keeping the ledger next to the map permanently changes how the artifact behaves in meetings: when someone challenges a claim four months later, the answer is a lookup, not a contest of recollections.

The re-run also closes a loop the original poster never could: measurement. In the returns case study coming up next, the team shipped proactive status emails and the following quarter's re-run picked up a thirty-one percent drop in where-is-my-refund tickets from the fresh export. That number fed straight back into the prioritisation conversation. Date every version, record which exports produced it, and treat a map without a date the way you would treat an undated screenshot of a dashboard — as decoration.

Narration for this slide

Here is what makes this durable. The workshop poster decays because updating it means redrawing it, so nobody does. A generated map gets updated by re-running the generation. Save the run as a workflow command, keep the evidence schema and the agents in version control, and keep the ledger next to the map — so when someone challenges a claim months later, the answer is a lookup, not an argument. Re-run it each quarter or before each planning cycle against fresh exports. The re-run does double duty: it refreshes the map, and it tells you whether the pain point you funded actually moved. The map stays alive because regenerating it costs an afternoon, not another workshop.

Slide 11 of 1316:9

Worked example: a returns journey rebuilt from twelve months of tickets

An ecommerce retailer mapped its returns journey with almost no analytics — the return flow ran partly through a third-party logistics portal the team could not instrument.

StepWhat happened
Inputs4,800 support tickets tagged returns or refund, 1,900 product reviews mentioning returns, the carrier's status export
ExtractionPer-source evidence agents; ticket extraction fanned out into 24 batches
Key finding44% of return tickets were 'where is my refund', sent while the parcel was still in transit; median 11-day gap from drop-off to refund, only 4 of it processing
Challenge gateConfirmed the claim was multi-source and strong; flagged that smooth returns appear nowhere in the data
DecisionProactive status emails at carrier-scan and refund-issued; the next quarterly re-run measured a 31% drop in where-is-my-refund tickets

The worst moment in the journey was not the return itself — it was the silence after the parcel was dropped off. The data located it; the team decided what to do about it.

Slide notes

This trace comes from the school's journey-mapping workflow and is worth walking slowly because it shows the method working under awkward conditions: almost no product analytics. The returns flow ran partly through a third-party logistics portal the team could not instrument, so the evidence was a year's worth of support tickets, product reviews that mentioned returns, and the carrier's status export. The per-source agents handled the volume — ticket extraction alone fanned out into twenty-four batches — which is precisely the reading work no researcher was ever going to do by hand.

The synthesis put the journey's worst moment somewhere the team had not been focusing: not the return request, but the silence after drop-off. Forty-four percent of return-related tickets were where-is-my-refund messages sent while the parcel was still in transit, and the median gap between drop-off and refund confirmation was eleven days, of which only four were actual processing. Reviews told the same story in harsher language, and the challenge agent confirmed the claim was multi-source and strong — which is what gave the team confidence to act on it.

Notice what the challenge agent flagged on the other side: nothing in tickets or reviews represents customers who returned an item smoothly and said nothing, so the emotion row was explicitly labelled as the journey of the unhappy minority. And notice where the humans were: defining the journey's edges, deciding that proactive status emails were the right response, and reading the next quarter's re-run — a thirty-one percent drop in those tickets — as evidence the fix worked. The agents located the problem. The team designed the answer.

Narration for this slide

Let's trace one real run. An ecommerce retailer mapped its returns journey with almost no analytics, because part of the flow ran through a logistics portal they could not instrument. The evidence was twelve months of data: forty-eight hundred support tickets, nineteen hundred reviews mentioning returns, and the carrier's status export. Ticket extraction fanned out into twenty-four batches. The synthesis found the worst moment was not requesting the return — it was the silence afterwards. Forty-four percent of return tickets were 'where is my refund', sent while the parcel was still in transit, across a median eleven-day gap. The challenge agent confirmed the claim was strong, and flagged that smooth returns appear nowhere in this data. The team shipped proactive status emails, and the next quarterly re-run measured a thirty-one percent drop in those tickets.

Slide 12 of 1316:9

Exercise: inventory the data behind one journey you own

Pick one journey your team argues about. Do not run anything yet — spend twenty minutes finding out what evidence the organisation actually has.

  • Define the journey's edges in one paragraph: where it starts, where it ends, what counts as success
  • List every data source that touches it: analytics, tickets, session notes, research, runbooks, CS notes
  • For each source, note what it can support, what period it covers, and its main blind spot
  • Write the known-gaps line: what would be invisible to a map built only from these sources
  • Note who would need to be in the validation session — the people who do the work, not just the stakeholders

Keep the inventory. It becomes the manifest for the run, and Module 6 will ask which of its findings would actually change a brief.

Slide notes

The exercise is deliberately a paper exercise, the same way the Module 1 exercise of this course's companion was: no agents, no exports, just an honest inventory. Most participants discover two things in the first ten minutes. First, the organisation has more relevant evidence than they expected — tickets, old research, runbooks — scattered across tools nobody has connected. Second, the journey's edges are fuzzier than they assumed, and two colleagues asked to define the same journey will give different start and end points. Both discoveries are the point.

Steer people towards a journey that is contested or expensive: one where teams disagree about where users struggle, where a leadership group is about to fund work based on an assumed journey, or where the existing map is old enough that nobody trusts it. The exercise is much less useful on a journey everyone already agrees about.

The known-gaps line is the part to push hardest on, because it is the part teams skip. Uninstrumented surfaces, users who never generate data, the silent majority who had a fine experience — writing these down before any run is what stops the eventual map being read as more complete than it is. And the last bullet seeds the validation habit: if the participant cannot name the frontline people who would check the back-stage lanes, that is a gap in itself.

Narration for this slide

Time to apply this to your own work. Pick one journey your team argues about — onboarding, renewal, returns, support — and spend twenty minutes on an inventory, not a run. Define the edges in one paragraph: where the journey starts, where it ends, what success means. List every data source that touches it, and for each one note what it can support, what period it covers, and its blind spot. Then write the known-gaps line: what would a map built only from these sources be unable to see? Finally, name the people who would need to be in the validation session. Keep the page — it becomes the manifest when you run this for real, and the next module will ask what you would do with the findings.

Slide 13 of 1316:9

Summary, and what comes next

  • Most journey maps are workshop recollection; the fix is provenance, not better facilitation
  • Each source answers a different question — analytics, tickets, session paths, and interviews do not substitute for each other
  • The run is evidence agents, a synthesiser, and a challenger; no claim without a citation, no cell filled for symmetry
  • Blueprints extend the journey into front-stage, back-stage, systems, and policies — validated with the people who do the work
  • The map stays current because regeneration is a scheduled run, not another workshop

Module 6 takes the pain points this map ranked and turns them into briefs — the step that decides whether any of this research changes what gets built.

Slide notes

Recap by connecting the bullets rather than repeating them. Provenance is the through-line: the same evidence discipline that governed packets, teardowns, and synthesis earlier in the course, applied to two artifacts that traditionally escape it. The source table explains why extraction stays per-source; the three-agent pattern explains how the volume gets handled without the discipline collapsing; the challenge gate and the validation session are the two places weak evidence gets caught — one automated, one human. And regeneration is what makes the whole thing worth doing more than once.

Be explicit that this module produced an artifact full of ranked pain points, owners, and evidence — and that none of it has changed the product yet. That is deliberate framing for Module 6, From Insight to Brief, which covers the step most research decks skip: converting findings into design briefs that carry their evidence with them, prioritising by decision impact rather than by how interesting a finding is, and tracking what actually shipped because of the research.

If participants did the exercise, ask them to bring the inventory page to Module 6. The pain points they expect to find in their own journey are exactly the raw material for the brief-writing exercise there, and the contrast between a ranked pain point and a brief that an agent or a team could act on is the gap that module closes.

Narration for this slide

Let's close the module. Most journey maps are fiction with good typography, and the fix is provenance — every claim traced to something a user did or said. The sources answer different questions, so the workflow keeps them separate during extraction, then synthesises with citations and challenges its own draft before any human sees it. The blueprint extends the journey into the organisation's own machinery — front-stage, back-stage, systems, policies — and gets validated by the people who actually run the service. And the map stays alive because regeneration is a scheduled run, not another workshop. You now have ranked pain points with evidence and owners. Module 6 is about the step that makes all of it matter: turning insight into briefs that change what gets built. See you there.

Module transcript
Module 5, narrated slide by slide

Slide 1The Journey Map the Data Can Actually Support

Welcome to Module 5. So far in this course we have built research packets, run desk research and teardowns, synthesised qualitative data, and handled surveys and funnels. This module applies all of that discipline to two artifacts that usually escape it: journey maps and service blueprints. Most journey maps are workshop recollection with good typography. We are going to build them differently — from events, tickets, session paths, and research the organisation already has, with evidence attached to every stage and the gaps marked honestly. Then we will extend the journey into a blueprint, validate it with the people who run the service, and set it up to be regenerated instead of redrawn.

Slide 2Why most journey maps are fiction with good typography

Here is the uncomfortable truth about most journey maps: they are fiction with good typography. A workshop produces a beautiful poster, the emotions row gets filled in from memory, and six months later nobody can tell you which parts were observed and which were invented to fill an empty cell. But the map keeps getting quoted, and decisions inherit its guesses. The fix is not a better workshop. It is provenance — every claim on the map traces back to a funnel number, a ticket, a session note, or a quote. Maps built that way look rougher and less complete, and that is exactly the point. A stage with no evidence should look empty, not confident.

Slide 3Inputs: each source answers a different question

Before generating anything, look hard at the inputs, because each source answers a different question. Analytics tells you what happened and how often — never why — and anything you have not instrumented looks deceptively calm. Tickets give you pain in the user's own words, but only from people angry enough to write; the silent abandoners never appear. Session notes show real behaviour for a tiny sample. Interviews give you motivation and emotion, filtered through memory. The workflow keeps these sources separate during extraction so that the final map can say which kind of evidence supports each claim. A pain point backed by a funnel drop, forty tickets, and three quotes is simply a different object from one backed by a single memorable quote.

Slide 4Generating the journey: evidence agents, a synthesiser, a challenger

Here is how the journey actually gets generated. You stage the anonymised exports with a manifest that says what each file is, what period it covers, and — critically — what it cannot tell you. Then the run dispatches three kinds of agents. Evidence agents work one source or batch at a time, extracting moments and signals with citations back to the exact row, ticket, or note. A synthesis agent assembles the stages, actions, friction, and pain points, and every single cell has to list the evidence that supports it. Then a challenge agent audits the draft and marks anything thin, single-source, or contradicted as unverified — before any human reads it. The rule the whole run lives by: no claim without a citation, no stage without evidence or an explicit flag.

Slide 5From product data to an evidence-linked map

Here is the whole pipeline in one picture. On the left, the staged sources — product analytics, support tickets, research findings — each anonymised and described in a manifest. Evidence agents read them source by source and extract cited moments into one ledger. The synthesis step assembles journey stages with evidence attached to every stage and pain points ranked by how much evidence sits behind them, and the blueprint layers — front-stage, back-stage, systems, policies — get added underneath. Then the challenge gate marks anything thin or contradicted as unverified, and the draft goes to the owner review, where people accept or dispute claims, assign owners, and date the map. Agents do the reading and assembling. People hold the edges and the decisions.

Slide 6Marking the gaps: what the data cannot see

Now the part that keeps this honest. An evidence-based map is still only a map of the evidence you happened to have. It cannot speak for users who never appear in any export — including the ones who left before generating data. It cannot treat uninstrumented flows as calm; absence of events is not absence of problems. It cannot turn a funnel drop into a cause; it can only tell you where to look. It cannot read emotions off behaviour alone — emotion claims need quotes behind them. And it cannot describe the people who had a smooth experience and said nothing. The challenge agent makes these limits visible on the map itself. It does not make them go away — and neither should you.

Slide 7Good and bad journey map claims

Quality lives at the level of the individual claim, so let's compare. 'Users feel overwhelmed during setup' reads fine and tells you nothing — you cannot check it, and you cannot act on it. The good version names the stage, gives the numbers the data actually supports — sixty-one percent of onboarding tickets, nine minutes on the credentials screen, the funnel drop — and cites its sources. The refunds example goes further and states what the data cannot show: smooth returns are not represented. And the renewal example shows the workflow holding a contradiction honestly instead of averaging it away. After every run, spot-check a few cells by hand before the map gets presented. A trustworthy claim names its evidence and its limits.

Slide 8From journey to blueprint: front-stage, back-stage, systems, policies

Once the journey exists, extend it downward into a service blueprint. The journey shows what users experience; the blueprint shows what the organisation does to produce it — customer actions, front-stage, back-stage, support processes, and the evidence customers actually see. The raw material already exists: research repositories, ticket samples, ops runbooks, process documents, stakeholder notes. Lane agents draft each layer from the extracted evidence, citing sources for every step, and back-stage steps have to name the front-stage moment they support and any handoff across the line — because that is where services break. One rule protects everything: never invent a step to make the grid look complete. Empty cells are workshop input, not failures.

Slide 9Validating with the people who do the work

The run ends with a draft and a gaps list — not a finished map. The validation session is part of the workflow, and it belongs to the people who actually run the service: frontline staff, operations, the teams behind the line of visibility. Read the gaps list before the map, spot-check a handful of citations by hand, and then structure the session around the gaps and conflicts rather than a polite read-through. For every unverified cell, the team makes a call: gather evidence, schedule research, or accept the gap in writing. The top pain points get owners. And the map gets a date. Documentation tells you what is supposed to happen. Only the people doing the work can tell you whether it does.

Slide 10Keeping it current: regeneration as a scheduled run

Here is what makes this durable. The workshop poster decays because updating it means redrawing it, so nobody does. A generated map gets updated by re-running the generation. Save the run as a workflow command, keep the evidence schema and the agents in version control, and keep the ledger next to the map — so when someone challenges a claim months later, the answer is a lookup, not an argument. Re-run it each quarter or before each planning cycle against fresh exports. The re-run does double duty: it refreshes the map, and it tells you whether the pain point you funded actually moved. The map stays alive because regenerating it costs an afternoon, not another workshop.

Slide 11Worked example: a returns journey rebuilt from twelve months of tickets

Let's trace one real run. An ecommerce retailer mapped its returns journey with almost no analytics, because part of the flow ran through a logistics portal they could not instrument. The evidence was twelve months of data: forty-eight hundred support tickets, nineteen hundred reviews mentioning returns, and the carrier's status export. Ticket extraction fanned out into twenty-four batches. The synthesis found the worst moment was not requesting the return — it was the silence afterwards. Forty-four percent of return tickets were 'where is my refund', sent while the parcel was still in transit, across a median eleven-day gap. The challenge agent confirmed the claim was strong, and flagged that smooth returns appear nowhere in this data. The team shipped proactive status emails, and the next quarterly re-run measured a thirty-one percent drop in those tickets.

Slide 12Exercise: inventory the data behind one journey you own

Time to apply this to your own work. Pick one journey your team argues about — onboarding, renewal, returns, support — and spend twenty minutes on an inventory, not a run. Define the edges in one paragraph: where the journey starts, where it ends, what success means. List every data source that touches it, and for each one note what it can support, what period it covers, and its blind spot. Then write the known-gaps line: what would a map built only from these sources be unable to see? Finally, name the people who would need to be in the validation session. Keep the page — it becomes the manifest when you run this for real, and the next module will ask what you would do with the findings.

Slide 13Summary, and what comes next

Let's close the module. Most journey maps are fiction with good typography, and the fix is provenance — every claim traced to something a user did or said. The sources answer different questions, so the workflow keeps them separate during extraction, then synthesises with citations and challenges its own draft before any human sees it. The blueprint extends the journey into the organisation's own machinery — front-stage, back-stage, systems, policies — and gets validated by the people who actually run the service. And the map stays alive because regeneration is a scheduled run, not another workshop. You now have ranked pain points with evidence and owners. Module 6 is about the step that makes all of it matter: turning insight into briefs that change what gets built. See you there.