Classical epidemiology is the study of the distribution and determinants of disease in populations. Clinical epidemiology applies the principles of classical epidemiology to the prevention, detection, and treatment of disease in a clinical setting. The two main types of epidemiological studies are observational and experimental. Descriptive observational studies (e.g. case series, ecological studies) characterize factors related to individuals with a particular outcome. Analytical studies (e.g., cohort studies, case-control studies) seek to assess the association between exposures and outcomes. In experimental studies (e.g., randomized controlled trials), an intervention is performed to study its impact on a particular outcome. Measures such as proportions, rates, and ratios can be calculated for the data collected. An association between two variables (i.e., exposure and outcome) does not necessarily imply a causal relationship. Other reasons for observed associations in epidemiological studies may include errors (e.g., random error, systematic errors), confounding, and reverse causality.
The following concepts are discussed separately: incidence, prevalence, mortality rates), (e.g., relative risk, absolute risk reduction), measures used in the (e.g., sensitivity, specificity), and , , the practice of , and foundational statistical concepts (e.g., , , , ).(e.g.,
Fundamental concepts in epidemiology
- Classical epidemiology: the study of the determinants and distribution of disease in populations 
- Clinical epidemiology: the application of the principles of classical epidemiology to patients in a clinical setting
Goals of research 
- Descriptive research: to summarize characteristics of a group
- Predictive research: to forecast outcomes (e.g., for prevention, diagnosis, or management)
- Explanatory research or causal inference: to establish causal mechanisms
Elements of epidemiological studies
- Population (epidemiology): the total number of people in the group being studied 
- Sample (epidemiology): a group of people selected from a larger population; meant to be representative of the larger population
- Data (epidemiology): information collected during observation and/or experimentation that is used as a basis for analysis and discussion 
- Exposure: a factor that is potentially associated with a particular outcome
- Intervention: a treatment, drug, or management step that is being studied in an experimental study
- Outcome: an endpoint (e.g., a disease or health-related event) that may occur after exposure to a risk factor or intervention.
- A period of apparent inactivity between stimulation (e.g., infection with a pathogen, exposure to a toxin, onset of a disease process, administration of a drug) and response (e.g., signs and symptoms, pharmacological effects). 
- In clinical studies, the latency period must be considered to ensure adequately long follow-up duration. 
Research questions can be formulated using the Population, Intervention, Comparison (or Control group), and Outcome:
Study group: a group of participants with a common feature that distinguishes them from other participants in the study.
- In a clinical trial, participants are divided into a group that receives a particular intervention and a group that does not receive the intervention.
- In a cohort study, participants are divided into a group that has had a particular exposure and a group that has not had the exposure.
- In a case-control study, participants are divided into a group that has a particular outcome and a group that does not have the outcome.
Control group (definition varies by study type)
- In a clinical trial, the control group is the people in the sample that do not receive the intervention (e.g., a drug), while the treatment group is the people in the sample that do receive the intervention.
- In a case-control study, the control group is the people who do not have the outcome being studied (e.g., a disease); the control group is compared to the cases, i.e. the people who have the outcome.
Basic data parameters 
These basic measures are often used to describe or compare findings from epidemiological studies. See also “Measures of disease frequency”, “Measures of association”, and “Evaluation of diagnostic research studies.”
- Comparison of one part of the population to the whole
- Proportions are usually expressed as percentages.
- A measure of the frequency of an event in a population over a specific period of time
- Rates are usually reported as numbers of cases per 1,000 or 100,000 in a given time unit.
- Crude rate: applies to the entire population (specific characteristics are not taken into account)
- Specific rate: applied to a population group with specific characteristics (e.g., sex-specific, age-specific)
- Standardized rate (adjusted rate): crude rates that have been adjusted for potential confounding characteristics to allow for comparison between different populations (e.g., age-standardized rates are commonly used for death rates)
Elements describing population health
See “Population health” for the following:
Principles of study design
- Study designs may be:
- Observational: Participants are not assigned to any intervention.
- OR experimental: Some participants are assigned to receive an intervention.
- The choice of study design should be tailored to the research question.
- Certain study designs typically produce results with a higher than others.
Choose study designs and methods that maximize the strength of study results while minimizing , , and , within the limits of available resources and ethical standards.
Overview of types of epidemiological studies 
|Observational studies||Experimental studies|
|Descriptive studies||Analytical studies|
|Intervention|| || |
|Purpose|| || || |
|Description|| || || |
- No intervention involved
- Patients are observed and the clinical course of the disease is studied.
- The observations are used to form a hypothesis.
|Overview of descriptive studies|
|Case report||Case series report ||Ecological study (correlation study)||Cross-sectional study (prevalence study) |
|Description|| || || || |
|Study method|| || || |
|Disadvantages|| || |
|Example|| || |
- Can be analytical, e.g., when used to assess the association between an exposure and an outcome
- See “ .”
Case-control study 
- Aim: : to study if an exposure (i.e., a risk factor) is associated with an outcome (i.e., disease)
- Researchers begin by selecting patients with the disease (cases) and individuals without the disease (controls).
- Controls are selected from the same source population and ideally have similar characteristics (e.g., gender, age) to the cases to reduce potential confounding.
- The is then determined between these groups.
- Helps determine whether individuals with a disease are more likely to have been exposed to a risk factor than patients without that disease.
- Cost-efficient (exposure and outcome are measured only once)
- Can be used to study rare diseases
- Can be used to study diseases with long
- Example: A group of patients with histologically confirmed cervical cancer (cases) is compared to otherwise similar patients without histologically confirmed cervical cancer (controls) for the presence of human papillomavirus (exposure).
A case-control study generally examines a small population group over a short period of time (less cost-intensive) and evaluates the association between multiple exposures and one outcome. A cohort study generally examines a large population over a long period of time (more cost-intensive) and determines how one exposure is associated with multiple outcomes.
Cohort study 
- Aim: to study the incidence rate and whether a given exposure is associated with the outcome of interest
- The researchers gather a group of study participants who have common characteristics.
- Participants are then classified into two groups: exposed and unexposed.
- The incidence of the outcome of interest is compared between the two groups.
|Types of cohort studies|
|Prospective cohort study||Retrospective cohort study|
|Description||Study begins before the groups develop an outcome of interest||Study begins after the exposure and outcome of interest have already occurred|
|Exposure||Study participants are categorized into an exposed group and an unexposed group.||Study participants are categorized into a group that was previously exposed to a given risk factor (exposed; e.g., smoking) and a group that was not (unexposed).|
|Exposure of interest has to be present prior to outcome of interest.|
The participants are followed prospectively for a period of time to see whether there is a difference in the rate at which the exposed and unexposed groups develop the outcome of interest.
|Data previously collected about the participants is compared to see whether there was a difference in the rate at which the exposed and unexposed groups developed the outcome of interest (e.g., lung cancer) over a period of time.|
Individuals with a smoking history of ≥ 1 pack of cigarettes a day (exposed group) are compared to individuals who are nonsmokers to see if there is a difference in the proportion of patients in each group that develop lung cancer (e.g., the outcome) within a specific follow-up period.
|Individuals with a smoking history of ≥ 1 pack of cigarettes a day (exposed group) 5 years ago are compared to individuals who were nonsmokers 5 years ago to see if there is a difference in the proportion of patients in each group that eventually developed lung cancer (outcome) within a specific follow-up period.|
|Measurements must be taken at a minimum of two points in time.|
In cohort studies, the study sample is selected based on exposure to a risk factor. In case-control studies, the study sample is selected according to having a disease or not, and then it is determined which participants were exposed to a risk factor.
Community survey 
- Description: a study design in which self-reported data is collected from a large cohort
- Researchers identify a study question and compose a study form to gather relevant data
- A sample of study participants is selected from the general population (can be random or nonrandom).
- Data are gathered from study participants either via interviewing or filling in a pre-designed form.
- The data are then analyzed with the appropriate statistical methods.
- Requires fewer resources than other study designs
- Requires a relatively small amount of time to complete
- Can cover a large number of study participants (easy to get a representative sample)
- Response rate is hard to control.
- All data are subjectively reported by study participants and therefore may be imprecise.
Twin concordance study
- Aim: to determine the influence of genetic and environmental risk factors on the development of a disease
- Study method: comparing the frequency of a disease in twins (monozygotic or dizygotic)
- Example: The probability of twins both being diagnosed with Hodgkin disease is compared among monozygotic and dizygotic twin pairs. If the probability is higher among monozygotic twin pairs than dizygotic twin pairs, genetic factors are likely involved.
- Aim: to determine the influence of genetic and environmental risk factors on the development of a disease
- The frequency of disease is compared between adopted children whose biological parents have the disease and adopted children whose biological parents do not have the disease 
- Among adoptees with a disease, the frequency of the disease is compared among birth and adoptive parents.
- Example: Two groups of adults that were adopted are studied. Individuals in the first group have a biological parent with schizophrenia (exposure), and individuals in the second group have biological parents without schizophrenia. The prevalence of schizophrenia (outcome) is compared between the two groups; if the prevalence is higher in children of parents with schizophrenia, genetic factors are likely involved.
- Aim: a retrospective study to analyze data obtained from patient registries
- Study method
An experimental study is one in which the researcher manipulates one or more independent variables to see how the changes affect one or more outcomes (dependent variables). Experimental studies can be divided into those that occur in clinical settings (e.g., randomized controlled trials) and those that occur in community settings (e.g., field trials and community trials).
Randomized controlled trials (RCTs) 
- Aim: to determine the possible effect of a specific intervention on a given population
- Minimizes bias
- Can demonstrate causality
- Cannot be used to evaluate rare diseases
- Cannot be used when treatments have well-known adverse side effects
- Expensive and time-consuming
- Results may not be applicable to “real world” settings, and may not be generalizable to other populations.
Randomized controlled trials are considered the gold standard for testing interventions.
RCT design elements
- Enrollment: Study participants should be selected according to predefined inclusion and exclusion criteria.
- Intervention group: a group of study participants that receive the studied intervention.
Control group: a group of study participants that are compared to the intervention group and do not receive the studied intervention. 
- Placebo-controlled trial: Control group participants receive a .
- Study designs in which control subjects receive a treatment used in standard practice:
- Superiority trials: aim to prove that the studied intervention is better than an existing intervention
- Equivalence trials: aim to prove that the studied intervention is as effective or has nearly the same effectiveness as an existing intervention
- Noninferiority trials: aim to prove that the studied intervention is not inferior to an existing intervention 
- The act of randomly assigning study participants into either the intervention group or the control group to reduce selection bias and confounding bias in clinical trials.
- The absence of a significant difference between the intervention group and the control group in certain baseline characteristics (e.g., age, gender, disease severity) suggests effective randomization.
- A procedure that ensures that the person who performs randomization remains unaware of which participants are allocated to which groups.
- Examples of allocation concealment methods include the following sequentially numbered sealed envelopes and central randomization (e.g., telephone or internet-based).
Blinding: the practice of not informing an individual or group about which study participants are part of the control group and which are part of the treatment group (used to reduce bias)
- Single-blind study: Study participants do not know whether they are part of the control or treatment group.
- Double-blind study: Neither the researchers nor the study participants know which study participants are part of the control group and which are part of the treatment group. Double blinding is the gold standard when studying treatment outcomes. 
- Triple-blind study: Neither the researchers, the study participants, nor the data analysts know which study participants are part of the control group and which are part of the treatment group.
- In blinded clinical trials, unblinding is the intentional revelation to investigators and/or research subjects of participant allocation into either the intervention or control group.
- May take place after completion of the trial or because of specific circumstances (e.g., pregnancy, adverse events that affect safety)
RCT variants based on the study design 
Parallel group design
- Definition: an experimental study design in which participants remain within their assigned groups for the entire duration of the study
- Parallel group design is the most commonly used RCT variant.
- Relatively simple and easy to implement
- Suitable for studying irreversible or long-lasting effects
- Definition: an experimental study design in which each participant switches from the intervention group to the control group or vice versa with a washout period in between
- The order in which the participant is assigned to the intervention group or control group is typically randomized.
- Increased dropout rates because of the long duration of the study
- Cannot be performed if the intervention has a long-lasting effect because the effects of one phase can persist into the next (carryover effect)
- Definition: an experimental study design in which multiple interventions are studied simultaneously by assigning participants to various combinations of interventions and placebo
- Can investigate a high number of interventions simultaneously
- Can identify potential interactions between interventions (e.g., synergistic effects of drug combinations)
- Interim data analysis and safety monitoring can be complicated.
- Participant recruitment and protocol adherence are more demanding.
- Definition: an experimental study design in which the unit of randomization is a group rather than an individual participant
- Lower implementation cost and easier enrollment
- Decreases risk of contamination of intervention
RCT variants based on the type of intervention
Clinical drug trials 
- Aim: to assess the safety and efficacy of new medications in human subjects
- Study methods
Surgical and medical device trials 
- Aim: to assess the safety and efficacy of surgical intervention (surgical trial) or a device used for treatment or diagnostics in human subjects (medical device trial)
- Study methods
- Implementing a placebo (e.g., sham surgery, sham device) and double-blinding may not be feasible or ethical in many circumstances.
- Standardization of a surgical intervention is difficult to achieve because of individual variations in surgical technique.
- Participant recruitment is more difficult than with clinical drug trials.
Interim analysis and safety monitoring 
- Every experimental study involving human subjects requires a monitoring plan to ensure participant safety and scientific merit.
Data and safety monitoring board
- An independent panel of experts who periodically review the conduct and data of ongoing trials to ensure participant safety and scientific merit.
- A data and safety monitoring board is required for nearly all phase 3 trials and for phase 1 and phase 2 trials that are multicenter, blinded, or involve a high-risk intervention.
- Interim analysis
- Serious adverse event (clinical trial): an adverse event during a clinical trial that is life-threatening, causes or prolongs hospitalization, or results in congenital anomalies, significant morbidity, or death
Unexpected adverse event (clinical trial)
- An adverse event during a clinical trial whose nature, severity, or frequency is unknown at the start of the trial (e.g., not described in protocol-related documentation of the intervention or product labeling by the manufacturer) or unexpected given the participant's underlying diseases or risk factors
- Can also occur after market approval (i.e., during phase 4 trials)
- An unexpected adverse event, especially if it is serious, should result in temporary discontinuation of the clinical trial until the reasons for the event are determined.
- If the unexpected adverse event is determined to be related to the trial, it may be necessary to modify the research protocol to minimize the risk to participants (e.g., by altering the exclusion or inclusion criteria, changing the dosing, increasing the monitoring frequency).
- If it is not feasible to minimize risk to an acceptable level, the trial should be terminated.
Discontinuing experimental studies
Ongoing trials are discontinued if any of the following become evident on interim analysis:
- Evidence of operational futility (e.g., poor enrollment rate, higher than expected dropout rate, clearly inferior treatment efficacy)
- Safety concerns
- Early evidence of clear benefit
- A set of statistical criteria established before a clinical trial that specifies when the intervention should be stopped because of safety concerns from expected adverse events, operational futility, or early evidence of clear benefit.
- Should be specified in the study protocol
- Ongoing trials are discontinued if any of the following become evident on interim analysis:
Analysis upon RCT completion
Intention-to-treat and per-protocol analyses
|Methods of analysis for randomized controlled trials|
|Intention-to-treat analysis ||Per-protocol analysis |
|Description|| || |
Subgroup analysis 
- Aim: to investigate the heterogeneity of results in a study or recognize significant discrepancies or similarities in the treatment outcome among different subgroups of patients
- All participants are stratified into subgroups, according to shared characteristics (e.g., sex), in order to compare them.
- Participants can be stratified before the study (prespecified analysis) or after the study (post-hoc analysis).
- To account for the multiple comparisons problem that arises in subgroup analyses, the significance level should be adjusted for multiplicity/multiple testing (see “ ”).
- Advantages: may show stable treatment effects over similar subgroups of patients
Other types of experimental studies
- Aim: to determine the effect of disease prevention interventions in individuals who do not already have a disease
- Example: observation of children who did and did not receive the Salk vaccine for prevention of poliomyelitis to see whether they developed paralysis or death
Other types of studies
Systematic reviews and meta-analyses are considered secondary research study designs as they analyze data from multiple primary research studies. Survival analyses can be performed on data from prospective cohort studies or RCTs.
Systematic review 
- Aim: to answer a defined research question
- Researchers collect and summarize evidence from the existing literature that fits established criteria.
- Quality assessment of the study is achieved by methods such as the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement (see “Tips and Links”).
- A systematic review may include a statistical analysis of the data (i.e. metaanalysis).
- Advantages: can improve evidence-based clinical decision-making
- Cannot correct issues with the quality of the individual studies included in the review
- Susceptible to publication bias
- Aim: : to increase and achieve more precise results
- Data from multiple studies are systematically assessed, combined, and processed with statistical methods.
Risk-of-bias assessment (See also “Systematic errors”).
- Bias in metaanalysis can stem from the individual studies included in the analysis or from the methodological flaws of the metaanalysis itself.
- Risk of bias for each study included in the meta-analysis is assessed semiquantitatively with risk-of-bias scales or checklists.
- Funnel plots can be used to visually represent bias and
- The results are often reported graphically using forest plots.
- Advantages: can identify similarities and/or differences between individual studies
- Definition: a graph that combines the estimated results from multiple studies investigating the same question and visualizes each study's effect size, confidence interval, and calculated overall effect
- X-axis: primary outcome measure (e.g., relative risk)
- Central vertical line on x-axis: null value (e.g., RR=1, ARR=0)
- First column: a list of the individual studies
- Squares: the results (i.e., point estimate) of each study
- Vertical lines: the exact point estimate
- Horizontal lines: the confidence interval of the result
- Area of the squares: proportional to the weight that the individual study is given in the overall metaanalysis (e.g., larger square due to larger sample size and/or statistical power)
Diamond at the bottom of the plot: summary combining the results of all studies
- Midpoint: the point estimate of the combined result
- Width: the confidence interval of the combined result
Survival analysis (prognosis study) 
- To determine the average time to a given outcome identified on follow-up
- Often used to measure disease prognosis
Always prospective in nature (i.e. using data from cohort studies or RCTs)
- Time-to-event analysis: Individual follow‑ups are performed from the onset of a disease to a chosen endpoint (e.g., death, development of a particular complication), or after exposure to a risk factor until the onset of a disease.
- Five-year survival rate: the percentage of patients with a particular disease who have survived for 5 years after the initial diagnosis
- In survival analysis, a hazard ratio is often calculated to compare outcomes among two groups.
- Always prospective in nature (i.e. using data from cohort studies or RCTs)
- Censored cases: Not all participants will have the study endpoint during the period of observation; no prediction can be made for these participants.
- Censoring can be caused by the following:
- The study period ending before participants experience the outcome of interest
- Participant drop out
- Competing events (e.g., death from a different cause)
Kaplan-Meier analysis 
- Used to analyze incomplete time-to-event data and to estimate the survival of subjects over a set period of time following a certain treatment
- Ideal for estimating the survival of a cohort composed of a small number of cases
- Events of interest include death, treatment effectiveness, and recovery.
- Allows for an estimation of survival over time even if participants are studied over different time intervals (e.g., some participants drop out or are lost to follow-up)
- Data on survival analysis can be graphically displayed as a Kaplan-Meier curve.
Kaplan-Meier curve: graphical representation of Kaplan-Meier analysis in a step-shaped diagram
- Allows for an estimation of the probability that the subject will survive up to a point in time
- The horizontal axis represents time.
- The vertical axis represents the estimated probability of survival.
- Time intervals are defined by specific events: a time interval ends when an outcome of interest occurs.
- Probability of survival for each time interval = number of patients for whom the event has not occurred/number of patients who are at risk
Qualitative studies 
- To understand the subjective perspective of individuals on their experience of reality and the implications of these perspectives and experiences through the analysis of qualitative data
- Hypothesis generation
- Structural or thematic coding of the gathered data is used to group data and draw insights.
- No statistical analysis is performed.
Data collection methods
- Focus groups
- Surveys with open-ended questions
- Archive work and review of recorded sources (e.g., text, audio, video)
Qualitative research methodologies 
- Grounded theory: the induction of hypotheses from systematically collected and analyzed data
- Ethnography: the study of culture and social interactions of a particular group of people
- Narrative inquiry: the study of narratives as the basic unit of describing and recording human experience
- Phenomenological hermeneutics: a study of the lived experiences and perceptions of individuals within specifc contexts (e.g., cultural context, historical context)
- Action research: a research method that involves collaboration between researchers and individuals in a particular setting to solve problems
- Case study: the systematic investigation of a complex phenomenon within a specific context to gather insight into the phenomenon of interest
Random errors, bias, and confounding
Random error 
- Definition: an error that occurs due to chance and/or limitations of precision
- Solutions: can be reduced by repeated measurements and averaging over a large number of observations
Systematic error (bias)
- Definition: An error in the study design or the way in which the study is conducted that causes systematic deviation of findings from the true value
- Risk of bias
- Description: The individuals in the sample group are not representative of the population from which the sample is drawn because the sampling or the treatment allocation is not random.
Types of selection bias include:
Sampling bias (ascertainment bias)
- Occurs when certain individuals are more likely to be selected for a study group, resulting in a nonrandom sample
- This can lead to incorrect conclusions being drawn about the relationship between exposures and outcomes.
- Limits generalizability
Types of sampling bias
- Nonresponse bias: Participants who do not return information during a study (i.e., nonresponders) have systematically different characteristics from those who do (e.g., some study participants do not return a call because they are feeling unwell).
- Healthy worker effect: The working population is healthier on average than the general population.
- Volunteer bias: Individuals who volunteer to participate in a study have different characteristics than the general population.
- A type of nonresponse bias
- Selective loss of participants to follow up
- Most commonly seen in prospective studies
- Risk that the remaining participants differ systematically from those lost to follow up
- Berkson bias: Individuals in sample groups drawn from a hospital population are more likely to be ill than individuals in the general population.
- Susceptibility bias: One disease predisposes affected individuals to another disease, and the treatment for the first disease is mistakenly interpreted as a predisposing factor for the second disease.
- Also known as prevalence-incidence bias and
- When observed subjects have more or less severe manifestations than the standard exposed individual
- If individuals with severe disease die before the moment of observation, those with less severe disease are more likely to be observed.
- If individuals with less severe disease have a resolution of their disease before the moment of observation, those with more severe disease are more likely to be observed.
- Most commonly occurs in case-control and cross-sectional studies.
- Sampling bias (ascertainment bias)
- In clinical trials, randomize to control for known and unknown .
- In , ensure the sample is representative of the population of interest.
- Ensure the correct reference group is chosen for comparison.
- Collect as much data on the characteristics of the participants as possible.
- Nonresponder characteristics should not be assumed. Instead, undisclosed characteristics of nonresponders should be recorded as unknown.
Description: incorrect data collection, measurement, or interpretation that leads to misclassification of groups or exposure
- Information is gathered differently between the treatment and control groups.
- Insufficient information about exposure and disease frequency among subjects
Types of information bias
- Measurement bias: any systematic error that occurs when measuring the exposure or outcome
Reporting bias: a distortion of the information from research due to the selective disclosure or suppression of information by the individuals involved in the study
- Can involve the study, design, analysis, and/or findings
- Results in underreporting or overreporting of exposure or outcome
- Interviewer bias: The interview approach distorts the responses provided by study participants, which results in researchers finding differences between groups when there are none.
Surveillance bias: An outcome is diagnosed more frequently in a sample group than in the general population because of increased testing and monitoring.
- Can result in misleadingly high incidence and prevalence rates
- Example: Endometrial cancer is more frequently detected in postmenopausal patients exposed to estrogen therapy than in those not exposed to estrogen therapy. Estrogen therapy increases the risk of bleeding, leading to more frequent screening.
- Can be reduced by comparing the treatment group to an unexposed control group with a similar likelihood of screening
Performance bias: differences between study groups that are related to group assignment
- Subjects change their behavior once they are aware that they are being observed.
- Especially relevant in psychiatric research
- This type of bias is difficult to eliminate.
- Procedure bias: This bias occurs when patients or investigators decide on the assignment of treatment and this affects the findings. The investigator may consciously or subconsciously assign particular treatments to specific types of patients (e.g., one group receives a higher quality of treatment).
- May be reduced by blinding , or use of a cluster randomization design
- Hawthorne effect
Recall bias: awareness of a condition by subjects changes their recall of related risk factors (recall a certain exposure)
- Common in retrospective studies
- Example: After claims that the MMR vaccine caused autism became public, parents of children diagnosed with autism were more likely to recall the start of autism being soon after their child was vaccinated, as compared with parents of children who were diagnosed with autism prior to these claims becoming public.
- Can be reduced by decreasing time to follow-up in retrospective studies (e.g., retrospective cohort studies or case-control studies), or using data on risk factors that was collected prior to the occurrence of the outcome (if available)
Misclassification bias: an error in which research subjects are classified into the wrong exposure or outcome groups, thereby distorting the observed association
Nondifferential misclassification bias: a misclassification bias in which the frequency of classification errors is similar in the groups under comparison
- Occurs when misclassification is unrelated to other variables in the study
- If the misclassified variable is dichotomous, the odds ratio is skewed toward the null value.
- If the misclassified variable is not dichotomous (i.e., more than two exposure or outcome categories), the odds ratio can be skewed either towards or away from the null value.
- Example: In a study on risk factors for lifetime cancer diagnosis, participants are asked if they have ever been exposed to radiation but are not asked for further clarification (e.g., what type, how recently, for how long). This leads to an incorrect conclusion about the association between radiation exposure and lifetime cancer diagnosis. Because all participants were asked the same vague question, the frequency of classification errors is similar across exposed and unexposed groups.
Differential misclassification bias: a misclassification bias in which the frequency of classification errors differs between the groups under comparison
- Occurs when misclassification is related to other variables in the study
- Skews the odds ratio either toward or away from the null value
- Example: Subjects who develop a rash after being exposed to a chemical are more likely to remember the exposure than those who do not develop a rash (recall bias). This results in unequal misclassification of study participants into exposed and unexposed groups.
- Nondifferential misclassification bias: a misclassification bias in which the frequency of classification errors is similar in the groups under comparison
- Definition: a systematic difference in the way that participants are assigned to treatment and control groups
- Example: assigning patients with better baseline blood pressure to the treatment group for a new antihypertensive treatment
- Description: : The personal beliefs of the study participants and/or investigators influence the results of the study.
Types of cognitive bias
- Response bias: Study participants do not respond truthfully or accurately because of the manner in which questions are phrased (e.g., leading questions) and/or because subjects interpret certain answer options to be more socially acceptable than others.
- Observer bias (experimenter-expectancy effect or Pygmalion effect): The measurement of a variable or classification of subjects is influenced by the researcher's knowledge or expectations.
- Confirmation bias: The researcher includes only those results that support their hypothesis and ignores other results.
- Placebo and nocebo effects: A placebo or nocebo affects study participants' preconceptions/beliefs about the outcome.
- Use of a placebo
- Prolong the time of observation to monitor long-term effects.
- Definition: type of bias that occurs when the findings of a study influence the decision to publish it
- Registration of clinical trials before any participants are enrolled
- Publishing study protocol before commencing the study
Biases specific to screening tests
See “Evaluation of diagnostic research studies” for details.
- Example: : Exposure to coal can cause lung cancer in mine workers. Many miners also smoke cigarettes, which can lead to lung cancer as well.
Minimizing confounding during study design
Methods to reduce potential confounding during the study design phase include:
- Crossover study design: Each study participant serves as his or her own control, thereby reducing the influence of confounding.
- Definition: a study design in which only individuals who meet certain criteria are included in the study sample (e.g., only male individuals with a particular disease are included in a study to avoid the influence of biological sex on the exposure and outcome)
- Limits generalizability
- Makes obtaining a large sample group difficult
- Definition: selection of study participants so that the distribution of variables is similar between study groups
- Can be done in two ways:
- Study participants are matched individually to participants with similar attributes (pairwise or individual matching).
- Study participants are matched in groups such that the groups have similar frequency of variables (frequency matching).
- Commonly used in to minimize confounding
- The matching variable should meet the criteria for a confounder.
- Example: In a study on the association between hypertension and end-stage renal disease, obesity is a potential confounder because it is associated with both diseases. By matching each participant with hypertension to a participant with a similar BMI who does not have hypertension, the potential for confounding by obesity is reduced.
Minimizing confounding during data analysis
Methods to reduce potential confounding during the data analysis phase include:
Stratified analysis: can help identify the presence of confounding and/or effect modification.
- Calculate the crude (or unadjusted) measure of association for the population (e.g., crude OR. )
- Stratify participants into subgroups according to a third variable considered to be a potential confounder (e.g., age, gender, race) to control for confounding effects and evaluate for effect modification.
- After stratification, new measures of association may be calculated:
- The results of a stratified analysis can help to distinguish whether a variable is an effect modifier or confounder.
- Standardization of data: See “” in “ .”
- Multivariable regression analysis
Once other reasons for an association between two variables have been excluded, it can be evaluated whether the association is likely to be causal. The Bradford-Hill criteria are a list of criteria that, if met, help to establish causality in epidemiological studies.
Association is not the same as causation. Remember to exclude , , and , and evaluate for before evaluating .
|Causal criteria (Bradford-Hill criteria)|
|Strength of association (effect size)|| || |
|Dose-response relationship (biological gradient)|| |
|Reproducibility (consistency)|| |
|Biologic plausibility|| |
|Coherence (epidemiology)|| |
|Experimental evidence|| |
|Analogy (epidemiology)|| || |
To establish causation, the cause must precede the effect. A temporal relationship may be difficult to establish using case-control studies or cross-sectional studies because the possible effect (i.e., outcome) and cause (i.e., exposure) are measured at the same time.
- Definition: an association between exposure and outcome that is different than common presumption
- Example: people assume that low socioeconomic status causes schizophrenia, but in fact, schizophrenia causes a decline in socioeconomic status over time
Complex multicausal relationships 
Occur when a third variable influences the effect of an exposure on an outcome, leading to a different effect on the control and treatment groups. These are not considered types of bias, but rather biological phenomena.
Effect modification: occurs when the level of the effect of an exposure on an outcome is different across different strata of a third variable (i.e., the exposure has a different impact in different circumstances)
- Example: A certain drug works in children but does not have any effect on adults.
- can allow better identification and understanding of effect modification.
- Effect modification can be inferred if stratifying participants into subgroups according to the third variable results in a stronger relationship in one subgroup.
- Interaction: occurs when two or more interdependent exposures influence the outcome measured in a variety of ways
Effect modification is not a type of error. Causal relationships can exist even if the effect of the exposure on the outcome changes across strata for another variable (e.g., age groups).
Preparation phase 
- Perform a literature review to establish what is already known on the topic.
- Select a mentor or mentorship team.
- Develop the research question using and .
- Choose an appropriate study design.
- Choose the study population.
- Inclusion and exclusion criteria
- Sites of recruitment
- Select relevant variables on which to collect data.
- Establish an analysis plan.
- Develop a hypothesis.
- Meet with a statistician to discuss study methods and planned analyses.
- Perform a power/sample size calculation to identify the number of participants required to see a difference in effect between groups.
- Obtain approval to do research on human subjects through an institutional review board (IRB).
- Determine the process for obtaining informed consent from study participants.
- Explore potential funding opportunities, if necessary.
Meeting with a statistician before starting a research project can help in determining the best study design and analysis plan.
The following criteria can help formulate an appropriate research question:
PICO criteria 
- Intervention (or exposure)
- Comparison group
FINER criteria 
Choosing a study design
|Considerations in choosing a study design|
|Study feature||Appropriate study design(s)|
|Assessment of causality|
Multiple study designs may be appropriate for answering the same clinical question.
Implementation phase 
Collection of primary data
- Subjective, e.g., surveys (paper or online), interviews
- Objective, e.g., electronic health records, laboratory testing, imaging studies or other diagnostic testing
- Analysis: Select appropriate analytic techniques in consultation with mentors and/or statisticians.
- Report results according to guidelines for main study types.
- See “Equator network reporting guidelines” in “Tips and Links.”
Dissemination of results
- Submission of manuscripts to scientific journals.
- Submission of abstracts to local, national, or international meetings.
Participant data should be entered and stored in a secure database.