Methodology

UserTold.ai captures behavioral evidence from real interviews — what users actually did, said, and struggled with — and structures it into machine-readable evidence your agent can reason over.

This methodology is qualitative, source-backed, and delivery-oriented. It is designed to help builders find concrete product friction and verify the source context before creating work. It is not a statistical benchmark, survey replacement, or automatic product decision engine.

For the authoritative entity definitions and interview-to-work lifecycle, read Core Concepts. This page focuses on how to conduct and review the research well.

Methodological Basis

UserTold combines established user-research practices with an agent-readable evidence model:

Observed task behavior: Nielsen Norman Group describes usability testing as task-based observation that uncovers problems, opportunities, and user behavior in an interface. See Usability Testing 101. UserTold applies this through observe segments that record screen, speech, navigation, clicks, and product context while the participant completes a realistic task.

Think-aloud evidence: NN/g frames think-aloud testing as asking participants to use a system while verbalizing thoughts. See Thinking Aloud: The #1 Usability Tool. UserTold preserves participant language from talk and observe segments as quotes, so evidence remains inspectable instead of becoming only a summary.

Qualitative limits: NN/g distinguishes qualitative usability studies, which identify issues, from quantitative studies, which estimate population-level metrics. See Why 5 Participants Are Okay in a Qualitative Study, but Not in a Quantitative One. UserTold treats evidence cards as reviewable observations. Counts and confidence help triage; they do not claim statistical prevalence.

Jobs and decision context: The Christensen Institute describes Jobs to Be Done as a lens for the circumstances and functional, social, and emotional forces behind decisions. See Jobs to Be Done Theory. UserTold uses desired_outcome, hiring_criteria, firing_moment, and decision_point to preserve what progress the user was trying to make and why alternatives mattered.

Pattern grouping: Braun and Clarke describe thematic analysis as methods for interpreting patterned meaning across datasets. See Thematic Analysis. UserTold can cluster related evidence into review packets, but a human or project-aware agent still verifies source fit before delivery work is pushed.

Research Scaffold

Use this scaffold when designing a study or reviewing its output:

Define the decision: Write the product question this study should inform. Prefer questions that can change a product, onboarding, pricing, activation, or support decision.
Choose the evidence mode: Use talk for context and decision history, observe for real product behavior, and scripted speak for neutral instructions or transitions.
Capture source moments: Preserve verbatim quotes, timestamps, page context, and recent actions. The source moment is the unit of evidence; the summary is secondary.
Classify conservatively: Assign a signal_type only when the quote or behavior supports it. Use no_issue_found or smooth_completion when the observed path does not show actionable friction.
Group by pattern, not volume alone: Cluster evidence when multiple source moments point to the same product problem, user goal, or decision force. Do not promote a packet only because it has the most cards.
Verify before delivery: Inspect the linked transcript or recording, confirm the product area, check whether the issue is still relevant, and only then create or push work.
Close the loop after shipping: Resolve linked evidence when the tracker issue completes and watch future sessions for recurrence without claiming causal proof.

What Counts as Source-Backed

A finding is source-backed when a reviewer can answer all of these from the evidence card or packet:

What did the participant say or do?
Where in the product did it happen?
What task, goal, or decision was in progress?
What source moment can be replayed or inspected?
Why does the proposed work follow from that evidence?

Source-backed does not mean the system knows prevalence, priority, or root cause by itself. Those require product judgment, additional data, or follow-up research.

Classification Examples

The complete signal_type vocabulary and evidence-card anatomy live in Core Concepts. Apply those types conservatively. For example:

struggling_moment: "I tried this three times and still can't find billing settings."
desired_outcome: "I just want to export this to CSV without all these extra steps."
workaround: "I usually copy it into a spreadsheet and filter it there."
firing_moment: "If this keeps happening I'll go back to the old tool."
decision_point: "I almost picked the annual plan but wasn't sure about the refund policy."

Evidence JSON

{
  "id": "sig_abc123",
  "signal_type": "struggling_moment",
  "quote": "I tried this flow three times...",
  "confidence": 0.91,
  "intensity": 0.8,
  "session_id": "ses_xyz789",
  "timestamp_ms": 142300
}

Evidence cards are typed JSON with confidence scores. Agents can inspect the same source-linked evidence through the dashboard, CLI, MCP, or REST API. The classification supports review; it does not replace inspection of the linked source moment.

Evidence vs. Surveys