Slide 1 — Research Synthesis with Agents
Welcome to Module 3. This one is about synthesis — the step where research most often goes to die. Twelve interviews, nine hours of recordings, a synthesis session that gets postponed twice and then squeezed into an afternoon of sticky notes. Here is the framing for the whole module: synthesis is two jobs. Sorting — coding every transcript and ticket and keeping the evidence attached. And meaning — deciding what the patterns imply and what to do about them. Agents are very good at the first job and have no business doing the second. By the end of this module you will know how to delegate the coding without delegating the conclusion.
Slide 2 — Preparing the corpus: privacy comes first
Before any technique, the part the researcher owns outright: privacy. Nothing from a study reaches an agent until it is anonymised. Names become participant IDs. Emails, employers, and incidental personal details are stripped. The file that maps IDs back to real people stays outside the agent's folder, full stop. Then consent. If participants agreed to their recordings being analysed by your team, you decide — explicitly, on the record — whether automated analysis sits inside that consent, and whether your tooling keeps the data out of model training. If the answer is no, this workflow is simply not available for that study. That is not a limitation to work around. That is the system working.
Slide 3 — The codebook is the contract
The codebook is where your thinking enters the run. It is a short document: each code gets a name, a one-sentence definition, and an example quote. The codes come from your research questions, and you leave explicit room for new codes the agent proposes when nothing fits. Keep it to somewhere between eight and fifteen — thirty codes means everything matches something and you learn nothing. And here is the step you cannot delegate: read at least three transcripts yourself before you write it. The codebook captures what you noticed in the data. If you have not read the data, the agent will quietly supply its own framing — and its framing is the average of everyone else's research.
Slide 4 — The synthesis funnel
Here is the whole module in one picture. Five stages. You prepare: anonymise, settle consent, read a sample, write the codebook. Agents code: one per transcript or batch of tickets, same codebook, verbatim quotes only, every quote carrying a participant ID and a line reference. An agent clusters the codes into themes — each theme gets a claim, its supporting participants, its quotes, and an honest count. A separate challenge agent then plays sceptical second researcher: thin evidence, invented quotes, contradictions the synthesis ignored. And then it comes back to you for interpretation. Notice the green band underneath. The evidence trail runs the full length of the funnel. That trail is the difference between synthesis and a fast, confident opinion.
Slide 5 — Theming with evidence: every claim points at quotes
Here is the standard that everything else hangs off. Every theme is a one-sentence claim with its evidence attached: the codes it draws on, the participant IDs that support it, and two to four quotes — verbatim, with a line reference. Verbatim means copied word for word from the source. If the agent cannot find the exact words, it does not report a quote, because the failure mode here is specific: an agent that wants to be helpful will smooth three similar comments into one tidy quote nobody ever said. So you check. Pick five quotes at random and find them in the sources. If even one is fabricated, the whole evidence table is suspect and the coding reruns. That sounds severe. It is the price of being able to trust the rest.
Slide 6 — Counting honestly: prevalence vs loudness
Manual synthesis has a loudness problem. The most quotable participant and the angriest ticket end up carrying the conclusion, because vivid things are what we remember in the synthesis room. Counts fix that — and agents make counting cheap. Every theme says how many participants support it, or what share of the coded tickets it covers. But counts come with their own discipline. Eight of twelve is a description of your sample, not a statement about your user base. Tickets only represent people who bothered to write in. So the honest move is to treat the counted theme as a strong, evidenced hypothesis — and pair it with analytics before you size the fix. We will spend all of Module 4 on that quantitative side.
Slide 7 — Hunting contradictions on purpose
Synthesis wants to tell a tidy story — and so do agents. Similar codes get merged, the theme that matches what the team already believes gets stated a touch too confidently, and the two participants who contradict the pattern just disappear from the summary. So we hunt contradictions on purpose. A separate challenge agent — not the one that wrote the themes — reviews everything against the coded excerpts. It flags themes resting on fewer than three sources, quotes that are not actually verbatim, claims that say users want when only this sample said anything, and the disconfirming evidence the synthesis skipped. In practice the challenge pass demotes a meaningful share of themes, and the demotions are often the most valuable part. One habit makes it work: read the challenge file before the findings. Once the story sets, caveats become footnotes.
Slide 8 — Good vs bad synthesis output
Let's make the standard concrete with a comparison. On the left, claims that sound right: users are frustrated with onboarding, customers churn because of pricing, users want a Salesforce integration. On the right, the same territory stated so it can be checked: eight of twelve participants, named by ID, with quotes and line references. Price named in seven interviews — but in five of them only after a value gap was described, which changes the conclusion entirely. The CRM integration? Two mentions, one of them speculative, flagged as thin. Notice something uncomfortable: the left column reads better in a deck. The right column is what survives the first hard question from a stakeholder. That is the trade, and it is worth it every time.
Slide 9 — The interpretation pass: what stays human
Now the second job — the one that stays yours. The agents hand you themes that are coded, counted, and challenged. They cannot tell you which themes matter for the decision in front of the team, what the pattern means given everything you know that never appears in a transcript, or what should change because of it. So the interpretation pass is explicit work, not a formality. Give every theme a confidence level — strong, suggestive, or thin — and say why. Park the thin ones as open questions for the next study, and write down that you parked them. And own how the findings are represented to stakeholders, including which caveats travel with the headline. The agent shows you that a pattern exists. You decide what it means. You sign it.
Slide 10 — Reporting formats: packets, not decks of adjectives
Where do the findings live? Not primarily in a deck. Decks are where research goes to decay — a finding becomes a bullet, the bullet becomes an adjective, and six weeks later nobody can trace it. The output of synthesis is a packet: the themes file with claims and confidence levels, the evidence table, the challenge file with its demoted themes left visible, an opportunity map where every entry links to a supported theme, and the open questions that seed the next study. The codebook and the prompt get archived with it, so the study can be re-run or compared later. You still make the deck for the readout — but the deck is generated from the packet and points back into it. The packet is the deliverable.
Slide 11 — Worked example: forty support conversations
Let's trace one run end to end. The corpus: forty support conversations tagged onboarding. The researcher read ten of them, drafted a nine-code codebook, and anonymised the export. Four coding agents took ten conversations each and returned a hundred and sixty-three coded excerpts, every quote verbatim with a ticket reference. Clustering proposed six themes — the biggest was confusion between inviting a teammate and transferring ownership, in fourteen of the forty conversations. The challenge pass demoted two themes: one was thin, and one quietly generalised from people who wrote in to users in general. The researcher spot-checked quotes, set confidence levels, and paired the lead theme with analytics before anyone sized a fix. Total researcher time, about ninety minutes — spent on reading and interpretation, which is exactly where it belongs.
Slide 12 — Exercise: theme a small corpus and audit the trail
Time to run this on your own material. Pick a small corpus you already have — fifteen to thirty support tickets or reviews, or three to five old transcripts — and start by anonymising it and checking that consent actually covers automated analysis. Read a third of it yourself, then write a codebook of eight to twelve codes. Run the synthesis: coding with verbatim quotes, themes with coverage counts, a challenge pass. Then the part that matters most: audit the trail. Pick five quotes at random and go find them in the sources. Finally, write the interpretation by hand — a confidence level for each theme, what it means, and one open question you are parking. If all five quotes check out, you have earned some trust in the pipeline. If one fails, you have learned, cheaply, exactly why the rules exist.
Slide 13 — Summary, and what comes next
Let's close. Synthesis is two jobs, and this module split them deliberately. Agents do the sorting: coding against your codebook, clustering into themes, counting coverage, and challenging their own output for thin evidence and contradictions. You do the meaning: consent before the run, the codebook, the spot-check, the confidence levels, and the call about what changes because of it. The output is a packet with the evidence trail intact — not a deck of adjectives. And the counts, remember, describe your sample, not your user base. That is exactly where Module 4 picks up: surveys, experiments, and funnel diagnosis — the quantitative work where agents speed things up but the rigour still has to come from you. See you there.