Qualitative coding: Thematic analysis tutorial for user research

Qualitative analysis is a foundational research methodology that enables researchers to interpret and make sense of non-numerical data, such as interview transcripts, open-ended survey responses, and observational notes. Unlike quantitative methods that focus on numbers and statistics, qualitative analysis delves into the richness of human experience, exploring the “why” and “how” behind behaviors, attitudes, and perceptions. By systematically coding qualitative data, researchers can transform raw data into meaningful insights that reveal patterns, relationships, and underlying themes. Thematic analysis, a widely used approach in qualitative data analysis, helps researchers identify and interpret recurring themes within qualitative data, making it possible to answer complex research questions and inform decision-making. Whether you’re conducting qualitative research in business, social sciences, or UX, the process of analyzing qualitative data is essential for uncovering deep, actionable understanding from your data set.

Data collection methods

Effective qualitative research begins with robust data collection methods designed to capture the depth and complexity of human experiences. Common approaches include in-depth user interviews, focus groups, direct observations, and open-ended survey questions. Each method offers unique advantages: user interviews provide detailed, personal accounts; focus groups encourage dynamic discussions and reveal group norms; observations capture real-world behaviors in context; and open-ended surveys allow participants to express themselves freely. The richness of qualitative data collected through these methods forms the foundation for thorough qualitative data analysis. Leveraging qualitative data analysis software can streamline the organization and analysis of this data, making it easier to identify patterns and generate insights. By carefully selecting and executing data collection methods, researchers ensure they gather the nuanced information needed for meaningful qualitative analysis and data-driven decision-making.

Focus groups

Focus groups are a powerful qualitative research method for data collection, bringing together a small group of participants to discuss a specific topic or product. This group setting encourages participants to share their attitudes, opinions, and experiences, often sparking new ideas and revealing consensus or disagreement within the group. Focus groups can be conducted in person or online, making them a flexible option for reaching diverse participants. In qualitative research, focus groups are especially valuable for identifying themes and patterns that might not emerge in individual interviews. In contrast to quantitative research, which seeks objective, numerical data, focus groups help uncover shared frustrations, unexpected use cases, or cultural differences in perception. By facilitating open discussion, researchers can collect rich qualitative data that informs product development, marketing strategies, and user experience improvements.

Understanding qualitative coding basics

Qualitative coding involves labeling text segments from user interviews, usability tests, and feedback to organize unstructured data and identify patterns and themes. Codes capture topics, actions, problems, or concepts, creating structure from narrative data.

For example, when a participant says, “I waste time searching through versions,” researchers might code this as “version control problems” and “team coordination friction,” capturing multiple ideas.

Coding reduces data volume, reveals patterns, and enables comparison across user groups. Effective coding balances detail and abstraction—too specific creates many codes, too broad loses meaning. The approach depends on research objectives.

This tutorial covers code types, initial coding, refinement, theme development, and examples from product research. Coding is iterative, involving multiple rounds of reading, coding, refining, and grouping based on study goals.

Types of codes in user research

Different code types serve different analytical purposes in product research. Understanding types helps researchers choose appropriate approaches for specific research objectives and questions.

Descriptive codes summarize content

Descriptive codes label what’s being discussed without interpretation simply summarizing topic or subject matter. These codes organize data topically enabling researchers to retrieve all mentions of specific subjects.

Notion interview segment: “I use databases for tracking projects, documenting processes, and managing team tasks.” Descriptive codes: “database usage,” “project tracking,” “process documentation,” “task management”

Descriptive coding provides starting point for analysis identifying what participants discussed. However, stopping at description misses deeper insights requiring interpretive analysis.

Process codes capture actions and behaviors

Process codes use gerunds (verbs ending in -ing) describing what users do, how they work, or actions they take. These codes reveal workflows, behaviors, and user strategies.

Calendly interview segment: “I check my calendar, copy available times into email, send to the person, then wait for their response.” Process codes: “checking calendar availability,” “copying time slots manually,” “sending availability via email,” “waiting for response”

Process codes reveal actual user behaviors versus stated preferences helping identify workflow friction and automation opportunities.

Emotion codes capture feelings and reactions

Emotion codes label feelings, frustrations, satisfactions, or emotional responses users express about products, features, or experiences revealing psychological dimensions beyond functional issues. Emotion coding is a recognized technique for identifying and tagging emotions in qualitative data to gain deeper insights into customer feelings and attitudes.

Superhuman interview segment: “I feel overwhelmed every morning seeing 200 unread emails. It’s stressful not knowing what needs immediate attention.” Emotion codes: “morning inbox anxiety,” “information overload stress,” “prioritization uncertainty”

Emotion codes identify pain points creating negative experiences and delight factors generating positive reactions informing prioritization and positioning.

Values codes identify beliefs and motivations

Values codes capture underlying beliefs, priorities, motivations, or principles driving user behaviors and decisions revealing why users care about specific problems or solutions.

Linear interview segment: “Speed matters because context switching kills productivity. I need to capture thoughts immediately before losing them.” Values codes: “values productivity,” “prioritizes speed,” “fears context loss,” “prefers immediate capture”

Values codes explain why features matter to users informing positioning, messaging, and value proposition development.

In vivo codes use participant language

In vivo codes, also known as vivo coding, is the practice of using participants’ own words or phrases as codes, especially when they use distinctive, memorable, or culturally significant expressions. This approach preserves the authentic user voice.

Slack interview where participant repeatedly says “information black holes” referring to lost messages. In vivo code: “information black holes” captures user’s vivid metaphor better than generic “message loss”

In vivo codes preserve user language useful for marketing copy, feature naming, or communicating findings to stakeholders using authentic voice.

Structural coding organizes data by predefined structures

Structural coding is used to categorize and organize interview responses or discourse data based on predefined structures, such as interview questions or topic areas. This technique helps researchers quickly segment and analyze large datasets by grouping responses according to the structure of the research instrument, facilitating deeper analysis.

After identifying code types, the qualitative coding process typically begins with open coding, which involves breaking down qualitative data into discrete excerpts and labeling them with codes. Next, focused coding is used to create a finalized set of codes and categories from the initial coding passes. Finally, axial coding relates codes or categories to one another to find relationships and links between them, supporting theme development and deeper insights.

When naming codes, it’s important to identify relevant codes that accurately reflect the data and research objectives. Using keywords in code names helps transform raw data into insightful, manageable units for further examination and effective analysis.

Initial coding process

Initial coding systematically applies codes to transcripts, creating foundational data organization supporting subsequent theme development and insight generation. This systematic approach is known as the qualitative coding process, which involves several structured steps to ensure thorough and reliable analysis.

Prepare transcripts for coding

Format transcripts consistently including speaker labels, timestamps, and paragraph breaks. Clean obvious transcription errors. Remove or anonymize identifying information protecting participant privacy.

Miro formats transcripts: “[00:05:23] Participant 3: When I facilitate workshops remotely, the biggest challenge is…” with consistent timestamp format and speaker identification.

Read transcript completely first

Read entire transcript without coding understanding overall content, participant perspective, and interview flow. Initial reading builds familiarity before systematic coding preventing premature conclusions.

After reading Webflow designer interview, researcher notes overall impression: “Experienced designer, frustrated with responsive design complexity, values visual development, skeptical about code export quality.”

Choose appropriate coding unit

Decide whether to code sentences, paragraphs, thought units, or exchanges. Smaller units enable precise coding but create more codes. Larger units provide context but less precision.

Airtable researchers code thought units which might be one sentence or several completing coherent idea: “I tried creating a view showing only high-priority tasks but the filtering interface was confusing so I gave up and just use manual sorting instead.”

Apply codes systematically line by line

Work through transcript methodically assigning one or more codes to each relevant text segment. Don’t skip sections assuming nothing important; insights often emerge from unexpected places.

Notion transcript segment: “Our onboarding takes forever because new people don’t understand databases. They need examples before concepts make sense.” Applied codes: “onboarding duration concerns,” “database conceptual complexity,” “learning through examples preference”

Code inductively from data

Let codes emerge from data rather than forcing content into predetermined categories. Stay open to unexpected patterns rather than only seeing what you expect.

Linear researcher starts expecting to code engineering workflow but discovers participants frequently mention stakeholder communication challenges. Rather than ignoring, researcher creates new codes capturing this unexpected but important theme.

Use consistent code naming conventions

Develop clear code naming patterns making codes easy to understand and apply consistently. Use active verbs for process codes, feeling words for emotion codes, and descriptive phrases for content codes.

Good code names: “struggles with permission settings,” “feels overwhelmed by options,” “values keyboard shortcuts,” “searches for lost files” Poor code names: “problem,” “user issue,” “feature,” “thing”

Create code definitions immediately

When creating new code, write brief definition explaining what it means and when to apply it. Definitions prevent code drift where same label gets used inconsistently.

Code: “export workflow friction” Definition: “User experiences difficulty, confusion, or extra steps when trying to export content from product in desired format. Includes format limitations, unclear options, failed exports, or multi-step workarounds.”

To organize and code qualitative data efficiently, many researchers use spreadsheet software like Microsoft Excel or Google Sheets, as well as specialized software programs for qualitative data analysis. These tools help structure data in rows and columns, making the data coding process more manageable and systematic.

Track code frequency informally

Note when codes appear frequently suggesting important patterns worth exploring. While qualitative analysis isn’t primarily about counting, frequency provides useful context about prevalence. Data coding can be facilitated by software programs and spreadsheet software, which help track code frequency and organize coded segments efficiently.

After coding 8 interviews, Figma researcher notices “component update breaking designs” appears 12 times across 6 participants suggesting widespread problem requiring investigation.

After your first round pass at coding qualitative data, you can begin to group your codes into categories and subcodes based on similarities. Further rounds of coding involve re-examining the codes and categories you’ve created, allowing for renaming, merging, and re-categorizing as necessary. Intercoder reliability (ICR), using metrics like Cohen’s Kappa, is used to measure consistency among multiple coders and ensure the reliability of your qualitative coding process.

When multiple researchers are involved in coding the same data, it is essential to use a detailed codebook to keep track of your codes and maintain organization throughout the data analysis process. Conducting intercoder reliability tests helps ensure that all researchers interpret and tag the data consistently, maintaining data integrity across the same data set. Collaborative refinement of the codebook and regular discussions among researchers further enhance the reliability and validity of the qualitative coding process.

Process Coding

Process coding is a specialized technique in qualitative research that focuses on capturing actions, routines, and sequences described by participants. By assigning codes to specific processes—such as “decision-making,” “problem-solving,” or “collaborating”—researchers can systematically analyze qualitative data to uncover how people behave and interact in various contexts. Process coding is particularly useful in thematic analysis, as it helps identify patterns in workflows, strategies, and user journeys across different data sources, including user interviews, observations, and open-ended surveys. For instance, analyzing user interviews with process coding might reveal common steps users take when troubleshooting a product, highlighting opportunities for streamlining or automation. Integrating process coding into your qualitative data analysis enables a deeper understanding of the dynamics at play, supporting the identification of actionable themes and the development of user-centered solutions.

Narrative Analysis

Narrative analysis is a qualitative data analysis method that centers on the stories and personal accounts shared by participants. Rather than breaking data into isolated codes, narrative analysis examines the structure, content, and meaning of entire stories to understand how individuals make sense of their experiences. This approach is particularly effective for exploring complex phenomena, as it uncovers the motivations, emotions, and cultural contexts that shape people’s actions and beliefs. In qualitative research, narrative analysis can be applied to data from user interviews, focus groups, and open-ended surveys to identify themes and patterns within participants’ narratives. For example, analyzing user stories about adopting a new technology can reveal common barriers, turning points, and sources of satisfaction or frustration. By using narrative analysis, researchers gain a holistic view of participants’ experiences, enabling them to generate deeper insights and develop more empathetic, effective solutions.

Refining and developing codes

Initial coding produces many provisional codes requiring refinement through merging similar codes, splitting overly broad codes, and developing code hierarchy organizing relationships.

Review codes for redundancy

Identify codes capturing same concept with different wording. Merge redundant codes choosing most descriptive label and updating all instances.

Initial codes: “can’t find feature,” “feature discovery problem,” “unclear feature location” Merged code: “feature findability issues” combining redundant concepts

Split overly broad codes

Identify codes applied to too many different situations lacking specific meaning. Split into more precise codes distinguishing important variations.

Broad code: “collaboration challenges” applied to real-time editing, permissions, feedback, and notifications Split codes: “real-time editing conflicts,” “permission confusion,” “feedback coordination problems,” “notification overload”

Develop code hierarchy

Organize codes into parent-child relationships with higher-level categories grouping related detailed codes creating structure without losing specificity.

Calendly code hierarchy:

Scheduling friction (parent)- Timezone coordination problems (child)
Availability communication challenges (child)
Rescheduling complexity (child)
Calendar sync issues (child)

After developing code hierarchies, it is important to consider the approaches used in qualitative coding. Inductive and deductive approaches are commonly applied—inductive coding generates codes and themes directly from the data, while deductive coding uses pre-existing theories or frameworks to guide the process. Deductive thematic analysis, for example, is a method where coding is structured by a theoretical framework, allowing researchers to test or extend existing theories. Many studies blend both approaches to ensure rigor and connect empirical findings to established concepts.

Create pattern codes

Identify patterns appearing across multiple descriptive codes worthy of their own pattern-level codes connecting related concepts.

Descriptive codes about Notion: “finds features accidentally,” “doesn’t understand databases initially,” “watches YouTube tutorials,” “learns from templates” Pattern code: “self-directed discovery learning” capturing common learning approach across specific behaviors

Apply codes to additional transcripts

After developing initial codebook with first 3-5 transcripts, apply codes to remaining transcripts testing whether codes work across all data and refining as needed.

Superhuman researchers code first 5 interviews developing codebook with 35 codes. While coding interview 6, they realize “email triage strategy” code needs splitting into “inbox zero approach” and “inbox processing approach” reflecting two distinct strategies.

Achieve code saturation

Continue coding until no new codes emerge with additional data suggesting coding scheme captures data comprehensively. Saturation typically occurs after coding 10-15 interviews in focused study.

Webflow researcher notes after coding 12 interviews that interviews 11 and 12 generated no new codes only additional examples of existing codes indicating saturation achieved.

Document code evolution

Track how codes changed through analysis process including merges, splits, and refinements creating audit trail showing analytical decisions and supporting transparency. Theory development and the creation of a conceptual model are influenced by the chosen methodology and research questions, shaping how codes evolve and are interpreted.

Linear maintains coding log: “Week 1: Created 47 initial codes. Week 2: Merged 8 redundant codes, split 3 overly broad codes, resulting in 42 codes organized into 8 categories.”

A well-developed conceptual model, grounded in a clear theoretical framework, provides a guide for the investigation and helps identify significant factors and relationships in the data. This process supports rigorous theory development and ensures that qualitative coding is both systematic and meaningful.

Developing themes from codes

Thematic analysis moves beyond descriptive coding to interpretive analysis identifying broader patterns, meanings, and insights across coded data. Thematic analysis coding is used to systematically identify key themes and significant themes in qualitative data, helping researchers organize and interpret open-ended responses or textual data.

Group codes into potential themes

Review all codes identifying which ones relate to similar concepts, problems, or patterns. Use visual methods like affinity mapping physically or digitally grouping related codes.

Notion researchers print all 38 codes on sticky notes arranging on wall into groups: “Learning and onboarding” (8 codes), “Collaboration and sharing” (7 codes), “Information organization” (12 codes), “Workflow integration” (6 codes), “Performance and reliability” (5 codes).

After grouping codes, conducting thematic analysis involves several systematic steps: transcription of qualitative data, familiarization with the data set, coding for recurring ideas, and theme development. This process allows researchers to identify patterns of meaning and organize qualitative insights into coherent themes.

Identify theme**-level patterns**

Look for overarching patterns saying something significant about user experiences, needs, or behaviors beyond what individual codes capture. Themes answer “What does this mean?” and “Why does this matter?”

Figma codes related to components, variants, updates, and naming cluster into theme: “Design system maintenance burden.” This theme reveals tension between standardization benefits and maintenance costs not obvious from individual codes alone.

Write theme descriptions

Develop clear narrative descriptions explaining each theme including what it means, how it manifests, why it matters, and boundaries distinguishing it from other themes.

Theme: “Onboarding cliff” Description: “New users experience steep learning curve in first week due to conceptual complexity, unfamiliar mental models, and lack of guided learning paths. Characterized by confusion, frustration, and abandonment threats before users achieve basic competency. Particularly acute for users without prior experience in similar tools.”

In reflexive thematic analysis, the researcher’s active role in interpreting data is central to developing qualitative insights, as their subjective experience and reflexivity shape the meaning and depth of each theme.

Validate themes against data

Return to original transcripts checking whether themes accurately represent participant experiences. Strong themes should have clear supporting evidence across multiple participants without cherry-picking quotes.

Slack researcher tests theme “communication overload from channel proliferation” reviewing all 18 transcripts confirming 14 explicitly discuss too many channels with specific examples validating theme strength.

Thematic analysis is a popular and rigorous method among qualitative researchers, applicable across various methodologies such as constructivist, positivist, grounded theory, and interpretive phenomenology. It is crucial for forming conceptual models in grounded theory, ethnographic, and narrative research, and enhances the trustworthiness and validity of qualitative findings.

Refine theme scope and boundaries

Ensure themes are distinct from each other with clear boundaries. If themes overlap significantly, consider merging or redefining boundaries creating clearer analytical categories.

Initial themes: “Real-time editing conflicts” and “Version control confusion” overlap substantially. Refined into single theme: “Multi-user editing challenges” with two subthemes distinguishing simultaneous editing from version history aspects.

Develop theme hierarchy

Organize themes into main themes and subthemes creating structure showing relationships between broader patterns and specific manifestations.

Calendly theme hierarchy: Main theme: “Scheduling coordination friction”

Subtheme 1: “Availability communication complexity”
Subtheme 2: “Timezone coordination burden”
Subtheme 3: “Rescheduling administrative overhead”

Name themes meaningfully

Create descriptive theme names clearly communicating meaning to stakeholders. Good names are specific, memorable, and grounded in data using user language when possible.

Weak theme names: “Problem 1,” “User issue,” “Feature request” Strong theme names: “Permission confusion blocking collaboration,” “Export workflow requires workarounds,” “Onboarding overwhelm threatening retention”

The systematic thematic analysis process—including transcription, familiarization, coding, and theme development—enhances the rigor and replicability of qualitative research findings. By applying thematic analysis coding, researchers can transform qualitative data into significant and key themes, providing actionable qualitative insights that deepen understanding of user experiences and behaviors.

Practical coding examples

Real examples demonstrate coding and theme development in action, showing how researchers move from raw transcript to actionable insights. Qualitative data coding enables researchers to systematically analyze actual data and transform qualitative data into meaningful themes.

Example 1: Notion database complexity

Raw transcript segment: “I wanted to create a simple task list but got confused by all the database options. Do I need a relation? What’s a rollup? I spent an hour reading docs and watching YouTube before giving up and just using a bullet list. Databases seem powerful but too complicated for basic task tracking.”

Applied codes:

“database feature complexity” (descriptive)
“conceptual confusion about features” (descriptive)
“seeking external learning resources” (process)
“feeling overwhelmed by options” (emotion)
“abandoning advanced features” (process)
“defaults to simple alternatives” (process)
“values simplicity over power” (values)

In this example, coding in qualitative research helps make sense of complex data by breaking down the actual data into specific codes, supporting the discovery of underlying ideas and theory building.

Theme connection: This segment supports theme “Power-simplicity tension” where users perceive database power but abandon due to complexity preferring simple alternatives for basic needs.

Example 2: Linear keyboard shortcuts

Raw transcript segment: “Once I learned keyboard shortcuts, everything changed. I can create issues, assign them, set priorities, and add to projects without touching my mouse. It makes me feel like a power user. Now when I use Jira, I’m frustrated by how slow it feels clicking through everything.”

Applied codes:

“keyboard shortcut mastery” (process)
“mouse-free workflow preference” (process)
“experiencing speed gains” (descriptive)
“feeling accomplished and expert” (emotion)
“comparing favorably to competitors” (descriptive)
“values efficiency and speed” (values)

Here, qualitative data coding allows researchers to identify and quantify common themes in customer language, supporting data-driven decisions and theory development.

Theme connection: Supports theme “Keyboard-first workflow as differentiator” where power users achieve efficiency through shortcuts creating competitive advantage and strong product loyalty.

Example 3: Miro remote workshop challenges

Raw transcript segment: “Remote workshops are harder than in-person. People’s attention wanders, they multitask, you can’t read body language. The tools help but don’t replace being in the same room. I spend extra time on engagement activities keeping people involved and still worry people tune out.”

Applied codes:

“remote versus in-person comparison” (descriptive)
“attention and engagement concerns” (emotion)
“missing non-verbal communication” (descriptive)
“compensating with extra engagement activities” (process)
“worrying about participant experience” (emotion)
“acknowledging tool limitations” (descriptive)

This example demonstrates how coding in qualitative research supports the systematic interpretation of actual data, helping researchers discover patterns and build theories from complex feedback.

Theme connection: Supports theme “Remote facilitation requires different skills” where facilitators adapt techniques for remote context with tools enabling but not fully replacing in-person dynamics.

Coding enables systematic, rigorous, and transparent analysis of qualitative data, making the research process more structured and reflexive. By transforming qualitative data into organized codes and themes, researchers ensure the validity and reliability of their findings. Qualitative coding is essential for making sense of complex data, supporting theory building, and enabling data-driven decisions based on customer feedback.

Tools for coding and thematic analysis

Various tools support coding and theme development from simple to sophisticated. Choose based on project size, budget, team collaboration needs, and technical comfort.

**Spreadsheet coding (Excel, Google Sheets)**Spreadsheet software such as Microsoft Excel or Google Sheets can be used to organize and code all your data by creating columns for participant ID, quote, codes applied, and notes. This approach is good for small projects (under 10 interviews) and researchers comfortable with spreadsheets.

Strengths: Free, familiar, flexible, portable
Limitations: Manual code management, no automation, limited collaboration

**Document annotation (Google Docs, Word)**Use comments or highlights applying codes directly to transcript. Good for solo analysis and maintaining context.

Strengths: Easy to use, codes appear with original text, accessible
Limitations: Difficult tracking codes across documents, no aggregation features

**Specialized qualitative software (Dovetail, NVivo, Atlas.ti, MAXQDA)**Specialized software programs and qualitative data coding tools like Delve (a Computer-Assisted Qualitative Data Analysis Software, or CAQDAS) are commonly used in modern qualitative research. These purpose-built tools offer systematic coding, code management, theme development, and reporting features. Automated coding of qualitative data can also be achieved using thematic analysis software that employs AI and natural language processing.

Strengths: Comprehensive features, team collaboration, pattern visualization, audit trails
Limitations: Learning curve, subscription costs ($100-$1,000+ monthly)

**Flexible databases (Airtable, Notion)**Create custom structures with tables for transcripts, codes, themes, and insights with linking between elements.

Strengths: Customizable, good collaboration, filtering and views, affordable
Limitations: Requires setup, not specialized for coding, manual processes

**Visual collaboration tools (Miro, Figma)**Visual tools such as Miro and Trello can be used for affinity mapping, mapping codes and categories, theme development, and collaborative synthesis workshops, though they are not ideal for transcript coding.

Strengths: Visual thinking, team workshops, spatial organization
Limitations: Not designed for coding, informal structure

Most researchers combine tools: Google Docs for transcript annotation, Sheets for codebook maintenance, and Miro for theme development workshops creating effective workflow with free or low-cost tools. Organizing and coding all your data digitally is essential for easier management, searchability, and analysis.

Common coding mistakes

Avoiding common pitfalls improves coding quality, analysis rigor, and insight usefulness for product decisions. Content analysis and data coding are essential parts of the research process, enabling systematic and rigorous analysis of qualitative data.

Creating too many codes

Researchers sometimes create hundreds of overly specific codes making pattern recognition impossible. Coding enables the identification of meaningful patterns and themes within complex data. Aim for 30-70 codes at appropriate abstraction level.

Solution: Review codes regularly merging similar ones and maintaining appropriate specificity-generality balance.

Stopping at descriptive coding

Coding what participants discussed without interpreting meaning or developing themes wastes qualitative data’s analytical potential.

Solution: Push beyond description asking “What does this mean?” and “Why does this matter?” developing interpretive themes.

Inconsistent code application

Applying same code differently across analysis or using different codes for same concept reduces reliability and makes pattern identification difficult.

Solution: Write code definitions immediately, review coding regularly checking consistency, and have second coder review subset checking reliability.

Forcing codes onto data

Starting with predetermined codes and forcing data into categories misses unexpected patterns and user perspectives contradicting assumptions.

Solution: Use inductive coding letting codes emerge from data. Start open-ended building codebook from actual content rather than predetermined categories.

Ignoring contradictions

Overlooking data contradicting emerging patterns or themes creates misleading findings missing important nuance or user diversity.

Solution: Actively search for divergent cases exploring why some users differ from dominant patterns revealing segments or contextual factors.

Weak connection to themes

Creating many codes without developing meaningful themes produces lists rather than insights failing to answer research questions.

Solution: Regularly step back from detailed coding asking what patterns mean and organizing codes into coherent themes answering “So what?”

Overall, coding qualitative data through content analysis and thematic analysis makes the research process more systematic and rigorous, providing transparency and enhancing the rigor and replicability of qualitative research findings.

Frequently asked questions

How many codes should I create?
Most user research projects generate 30-70 codes providing sufficient specificity without overwhelming complexity. Fewer codes (under 20) may be too general while many codes (over 100) suggest need for consolidation.

Should I use predetermined codes or create codes from data?
For exploratory research, use inductive coding, which is often applied when no prior expectations exist and lets codes emerge from data. For focused or evaluative research, deductive coding is a top-down approach where you start by developing a codebook with your initial set of codes based on your research questions or existing frameworks. Most projects benefit from a hybrid approach, combining both inductive and deductive strategies throughout the coding process.

How do I know when I’ve found themes?
Themes emerge when patterns appear across multiple participants, codes cluster around similar concepts, and patterns say something meaningful about research questions beyond simple description. Grounded theory is a qualitative research methodology that develops themes and theories directly from data through an iterative, inductive process.

Can one quote have multiple codes**?**
Yes, rich quotes often warrant multiple codes capturing different concepts, emotions, or processes within single segment. However, limit to 3-5 codes per segment avoiding over-coding.

How is coding different from tagging?
Coding is systematic analytical process interpreting data and developing themes. Tagging is organizational labeling categorizing content. Coding is more rigorous and interpretive than simple tagging.

What if different coders disagree on codes?
Disagreement is normal and valuable. Discuss differences reaching consensus on code definitions and application. Calculate inter-rater reliability for subset (aim for 80%+ agreement) demonstrating systematic coding.

‍