Slide 1 — Deep Design Research and Competitive Teardowns
Welcome to Module 2. In the last module we built the research packet — the question, the method, the findings, and the evidence trail, all in one auditable structure. Now we fill that structure with the two kinds of research agents do at a scale a single researcher cannot: deep desk research and competitive teardowns. The headline is on the slide. Coverage is cheap now — an agent can read forty sources or walk eight competitor products in an afternoon. Judgment about that coverage is not cheap, and it does not transfer to the agent. This module is about scoping the work, capturing real evidence, and keeping what you observed visibly separate from what you are inferring.
Slide 2 — Scope the question: what would change a decision
Everything starts with the question, and the test for a good one is simple: what decision would change because of the answer? Name that decision. Name the products, the moment in the experience, and the user. Then set the evidence standard up front — for example, marketing copy alone does not count as evidence of what a product does. And write down what would change your mind, because if nothing could, you are not researching, you are collecting support for a decision already made. Finally, choose the research angles yourself — usually one per competitor, one for the standard, one for criticism, one for recency. Five to eight angles. The agent supplies the breadth. You supply the aim.
Slide 3 — Two kinds of competitive research
There are two ways to research competitors, and they produce different artifacts. Deep desk research reads what has been written — articles, documentation, reviews, standards — and synthesises it into a cited report. A teardown walks the products themselves: it signs up, clicks through onboarding, screenshots the pricing page, and counts the steps. Here is why the difference matters. A literature search will tell you a competitor is loved for its onboarding. Only a teardown tells you it is a three-step flow that defers account creation until after the first project exists. You need both, and the most common mistake is substituting one for the other — usually letting published opinion stand in for what the product actually does.
Slide 4 — Decide what evidence counts before collecting any
Before anyone collects anything, fix what evidence counts. This is the dimensions file, and it is the rubric every competitor gets measured against. For onboarding: ordered screenshots from signup to the first meaningful action, with step counts. For pricing: the full page, the tiers, what is gated where. For positioning: the headline and the three claims the marketing leans on hardest, verbatim, with URLs. For core flows and empty states: what the product shows before any data exists. And for every marketing claim, the cross-check — did the walkthrough support it, contradict it, or fail to verify it? One more rule: anything that could not be collected gets recorded with the reason. Never guessed. Not collected is a finding.
Slide 5 — The teardown pipeline
Here is the whole teardown as a pipeline. You define the set: the decision, five to eight competitors, and the evidence rubric. Then agents take over — one collection agent per competitor, each filling an evidence folder with screenshots, pricing pages, and verbatim claims, against the same rubric. The findings get normalised: every cell cites a file, marketing claims get checked against what the walkthrough actually showed. Then synthesis: the comparison matrix, the named patterns, the opportunity gaps. And then it comes back to you — sample the evidence, separate what was seen from what is being inferred, and decide which gaps are worth pursuing. The dashed line is the re-run: next quarter, the same pipeline becomes a diff against the archive.
Slide 6 — Screenshot evidence and the rights questions
Screenshots are the evidence that settles arguments — but capturing competitor products comes with rules worth setting up front. Use trial accounts and public pages, stay inside the terms of service, and never misrepresent who you are to get access. Keep what you capture as internal evidence: reference it, link matrix cells to it, but do not republish competitor UI in decks or articles — use your own concept work for anything external. Date every capture, because pricing pages and onboarding flows change without notice, and undated evidence cannot be diffed later. And if a product is hidden behind a sales call, write that down as a finding about how they sell. It is not a gap you should fill some other way.
Slide 7 — Pattern surveys: table stakes vs differentiating
With evidence in the folders, the cross-competitor read begins, and the useful frame is table stakes versus differentiators. Table stakes are the patterns nearly everyone ships — setup checklists, template galleries, three pricing tiers with the middle one anchored. Nobody praises them; their absence is what gets noticed. Differentiators are the patterns one or two products own, and the evidence needs to show how they execute, not just that the feature exists. Count honestly: four of five products show a checklist on first run, with citations — not, most products do this. And remember the limit. A pattern the whole category shares might be best practice, or it might be a herd error. Structure is not performance, and the matrix cannot tell you which is which.
Slide 8 — Observation, inference, recommendation
Here is the discipline that keeps the whole module honest. Three kinds of statement, kept visibly separate. Observations are what the evidence shows — competitor A takes four steps from signup to a populated template, and here are the screenshots. Anyone can verify them. Inferences are readings of what the observations mean — short paths to first value seem to go with deferring team invites. Reasonable, but a step away from the evidence, so label it. Recommendations are what we should do about it — and they belong to you, the designer, because you carry the accountability. When these layers blur, marketing claims become facts and hunches become findings. Keep the labels on, and make the agent keep them on too.
Slide 9 — Source quality: every source type has a bias
Every source type has a bias, and the cross-checking only works on the sources you gave it. Marketing pages are claims to verify, never evidence — record them verbatim, then check them against the walkthrough. Docs and changelogs are solid for what a product does, silent on how well it works. Reviews surface real friction, but they skew towards the angry and the recent. Published teardowns are useful and often cite each other in a circle, so check the dates. And for standards, fetch the primary text and cite criteria by number. Two limits to remember: cross-checking cannot catch a claim that is wrong everywhere it appears, and the public web underrepresents anything paywalled, internal, or not in English.
Slide 10 — Good and bad cells in the same matrix
Here is what quality looks like at the level of a single matrix cell. On the left, cells that read fine and cannot be checked: best onboarding in the category, AI-powered setup, transparent pricing, an estimated price. On the right, the same cells done properly: a step count with dated screenshots, a verbatim claim marked contradicted because the walkthrough showed a template picker, three of eight competitors routing larger teams to sales despite a published price, and — the uncomfortable one — not collected, with the reason recorded. Two failures cause almost all of the bad cells: marketing claims laundered into observed facts, and empty cells filled in because empty looks unfinished. An honest empty cell beats a confident guess every time.
Slide 11 — Worked example: three competitors, one job-to-be-done
Let's trace a real run. The job-to-be-done: get a new team from signup to its first working project. Five project-management tools were torn down; here are three of them on four dimensions. Competitor A: four steps, opens into a populated template, invites deferred. Competitor B: seven steps and an AI-powered setup claim that the walkthrough contradicted — it was a relabelled template picker. Competitor C: eleven steps through a setup wizard. Then the row that mattered: what the second teammate sees. Across the whole set, invited teammates landed on a bare board with no orientation. That gap became the focus of the redesign. Notice the division of labour — the matrix found the gap, and the team decided it was an opportunity rather than a graveyard.
Slide 12 — Exercise: a single-flow teardown of one competitor
Time to run this at small scale. Pick one competitor and one flow — onboarding, pricing, or a core task — tied to a decision you actually face. Write a five-line dimensions note first: what evidence this flow requires, and what counts as not collected. Then capture it: ordered screenshots, the step count, and the positioning claim for that flow, verbatim, with the URL. Write your findings in three labelled layers — observations that point at files, inferences marked as inferences, and one recommendation. Finally, mark the claim: supported, contradicted, or unverified, and note one thing you could not see from the outside. Keep the folder. The same evidence-first habit carries straight into the next module.
Slide 13 — Summary, and the bridge to synthesis
Let's close the module. Scope before you run: name the decision, the evidence standard, what would change your mind, and choose the angles yourself. Know which kind of research you are doing — deep research reads what has been written, a teardown walks the products — and do not let one stand in for the other. Fix the rubric before collecting, make every cell trace to dated evidence, and let not collected stay empty. Keep observation, inference, and recommendation visibly separate. And remember what the matrix cannot do: it maps what exists, but deciding which gaps are opportunities is your judgment. In Module 3 we turn from the public web to your own data — synthesising interviews, tickets, and feedback into themes that trace back to quotes. See you there.