Number needed to harm
The number needed to harm (NNH) is an epidemiological measure in medicine that quantifies the potential for adverse effects from a treatment or exposure, defined as the average number of individuals who must be exposed to a risk factor over a specified period to cause one additional adverse outcome compared to those not exposed.[1] It serves as the counterpart to the number needed to treat (NNT), which measures benefits, by focusing on harms to facilitate balanced risk-benefit assessments in clinical decision-making.[2] NNH is calculated as the reciprocal of the absolute risk increase (ARI), or the difference in adverse event rates between the exposed and control groups: NNH = 1 / (adverse event rate in exposed group – adverse event rate in control group).[1] This formula allows for straightforward interpretation; for instance, an NNH of 10 means that for every 10 patients treated, one additional adverse event is expected due to the intervention.[2] A lower NNH indicates a higher likelihood of harm, making it a practical tool for clinicians to communicate risks to patients and compare interventions.[1] In clinical trials and evidence-based practice, NNH is often reported alongside NNT to provide a comprehensive view of therapy outcomes, though its use remains less common than relative risk measures, with only about 0.9% of controlled trials from 2001 to 2019 explicitly reporting it.[3] Variations like unqualified NNH (NNH_UF) adjust for scenarios where harm occurs without any benefit, enhancing its utility in weighing net effects.[2] Despite its value, NNH should be interpreted cautiously, considering confidence intervals, baseline risks, and time horizons, as it assumes constant event rates across populations.[1]Definition and History
Definition
The number needed to harm (NNH) is defined as the average number of individuals who need to be exposed to a specific risk factor, treatment, or intervention for one additional person to experience a specified harm or adverse event compared to those not exposed. This metric quantifies the potential for harm in a straightforward manner, focusing on the excess risk attributable to the exposure rather than baseline event rates. NNH is primarily employed in epidemiology and clinical research to evaluate adverse effects associated with treatments, such as medications or procedures, as well as broader exposures like environmental hazards. For instance, it helps assess risks from pharmacological agents (e.g., adverse drug reactions) or non-therapeutic factors (e.g., biomass smoke exposure leading to respiratory harm).[4][5][6] Unlike measures of benefit, NNH specifically highlights the occurrence of adverse events, using rates of harm in exposed versus unexposed groups to emphasize risks over advantages. It is founded on the concept of absolute risk increase (ARI), which represents the difference in harm event probabilities between groups, providing a basis for estimating the scale of potential detriment without delving into complex relative measures. As the counterpart to the number needed to treat (NNT), which gauges benefits, NNH aids in balancing therapeutic trade-offs.[4]Historical Development
The concept of the number needed to treat (NNT), a precursor to the number needed to harm (NNH), was introduced in 1988 by Andreas Laupacis, David Sackett, and Robin Roberts in their assessment of clinically useful measures for evaluating treatment outcomes, emphasizing its role in quantifying benefits from interventions in randomized trials. This measure, defined as the reciprocal of the absolute risk reduction, provided a practical way to communicate the effort required to achieve one additional beneficial outcome, influencing subsequent developments in medical statistics. The NNH was proposed by Sackett and colleagues in 1996 as an analogous metric to the NNT, specifically for assessing harms or adverse events in evidence-based medicine, where it represents the reciprocal of the absolute risk increase for a harmful outcome.[7] This extension addressed the need to balance treatment benefits against potential risks in clinical decision-making, emerging amid growing emphasis on integrating rigorous evidence from trials into practice. During the 1990s, the NNH gained prominence with the expansion of meta-analyses and randomized controlled trials, which highlighted the importance of quantifying both therapeutic effects and side effects; notable discussions appeared in high-impact journals, solidifying its place in evidence synthesis. By the early 2000s, the metric was incorporated into reporting guidelines by organizations like the Cochrane Collaboration, which recommended its use alongside confidence intervals for adverse events in systematic reviews to enhance transparency in harm assessment.Calculation
Formula
The number needed to harm (NNH) is calculated using the formula \text{NNH} = \frac{1}{\text{[ARI](/page/Ari)}}, where ARI denotes the absolute risk increase.[8] The ARI represents the difference in harm event rates between the experimental (or exposed) group and the control group, specifically ARI = EER - CER, with EER as the experimental event rate (incidence of harm in the exposed group) and CER as the control event rate (incidence of harm in the control group).[9][10] This yields the explicit expression \text{NNH} = \frac{1}{\text{EER} - \text{CER}}. [10] The NNH is conventionally expressed as a whole number, rounded up to the nearest integer to ensure a conservative estimate.[11][12] If the ARI is negative—indicating a lower harm rate in the experimental group (a benefit rather than harm)—the NNH does not apply, as the measure pertains only to adverse outcomes.[13] In cases where ARI = 0 (no difference in harm rates between groups), the NNH is undefined or infinite.[14]Derivation from Risk Differences
The number needed to harm (NNH) is derived by inverting the absolute risk increase (ARI), transforming a measure of probabilistic risk into a count-based estimate that emphasizes patient-level implications, similar to how the number needed to treat (NNT) is obtained from the absolute risk reduction (ARR).[10] This approach provides a practical, intuitive metric for clinicians assessing the potential for adverse events in treatment decisions.[15] The derivation begins with the definition of ARI as the difference in the probability of harm between the exposed (treatment) group and the control group: \text{[ARI](/page/Ari)} = P(\text{harm} \mid \text{exposed}) - P(\text{harm} \mid \text{[control](/page/Control)}) Here, P(\text{harm} \mid \text{exposed}) represents the event rate for the adverse outcome in the treatment arm, and P(\text{harm} \mid \text{[control](/page/Control)}) is the corresponding rate in the control arm.[15] The NNH is then calculated as the reciprocal of this ARI: \text{NNH} = \frac{1}{\text{[ARI](/page/Ari)}} This inversion yields the average number of patients who must be exposed to the treatment for one additional harm to occur compared to the control, directly linking the group-level risk difference to an expected individual-level outcome.[16] Absolute risk differences, as used in this derivation, are preferred over relative risk measures for NNH because relative risks can exaggerate effects when baseline risks vary across populations, potentially leading to biased interpretations of harm magnitude.[15] For instance, the same relative increase might appear dramatically different in absolute terms depending on the underlying event rate, making ARI a more stable and contextually relevant basis for the metric.[17] The derivation assumes binary outcomes, where the event is categorized strictly as harm or no harm, and independence among events, ensuring that the probabilities reflect non-overlapping individual risks without clustering effects.[16]Interpretation
Clinical Meaning
The number needed to harm (NNH) provides an intuitive measure of the additional risk of an adverse event attributable to an intervention, calculated as the reciprocal of the absolute risk increase (ARI).[18] A lower NNH indicates a higher likelihood of harm; for instance, an NNH of 5 means that for every five patients exposed to the intervention, one additional patient will experience the harm compared to those not exposed, whereas an NNH of 100 suggests that harm occurs in only one additional patient per 100 exposed.[19] This interpretation helps clinicians grasp the practical implications of treatment risks in everyday decision-making.[19] Low NNH values often signal significant safety concerns, prompting heightened caution, though thresholds are inherently context-dependent and influenced by the severity of the potential harm—such as mild gastrointestinal upset versus life-threatening events.[18][20] For severe harms, even moderately low NNH values may outweigh benefits, while higher thresholds may be tolerable for minor side effects.[18] NNH plays a central role in risk-benefit analysis by enabling direct comparisons with the number needed to treat (NNT) for efficacy, often through the likelihood to be helped or harmed (LHH = NNH / NNT) ratio, where an LHH greater than 1 favors benefits over harms.[19] This framework aids in weighing whether the effort to achieve a positive outcome justifies the potential for adverse effects.[2] NNH estimates are typically framed over a specific time period defined by the study, such as one year of treatment, underscoring the importance of considering the temporal context to avoid misinterpreting short-term versus long-term risks.[2] Shorter durations generally yield higher NNH values, reflecting lower cumulative harm, while longer periods may reveal greater risks.[19]Confidence Intervals
The confidence interval (CI) for the number needed to harm (NNH) is derived from the CI of the absolute risk increase (ARI), reflecting the reciprocal relationship in the NNH formula, which often results in asymmetric intervals due to the nonlinear inversion.[21] When the ARI is positive (indicating harm in the experimental group), the lower and upper bounds of the NNH CI are obtained by taking the reciprocals of the upper and lower bounds of the ARI CI, respectively, and swapping their order to maintain logical consistency.[21] This approach ensures the interval captures the uncertainty inherent in the reciprocal transformation, avoiding symmetric approximations that may underestimate variability in small samples.[22] Common methods for computing the NNH CI begin with estimating the standard error of the ARI, calculated as the square root of the sum of the variances in each group: \text{SE(ARI)} = \sqrt{\frac{\text{EER}(1 - \text{EER})}{n_{\text{exp}}} + \frac{\text{CER}(1 - \text{CER})}{n_{\text{ctrl}}}} where EER is the event rate in the experimental group, CER is the event rate in the control group, n_{\text{exp}} is the sample size in the experimental group, and n_{\text{ctrl}} is the sample size in the control group.[21] The 95% CI for ARI is then ARI ± 1.96 × SE(ARI), assuming a normal approximation; the NNH CI follows via the reciprocal inversion as described by Altman.[21] Alternative approaches include the delta method, which approximates the standard error of NNH as \text{NNH}^2 \times \text{SE(ARI)} for a symmetric CI around the point estimate, suitable for larger samples but less accurate for asymmetry.[22] Bootstrapping provides a nonparametric option by resampling the original data with replacement (e.g., 1,000–10,000 iterations), recomputing ARI and NNH for each sample, and taking the 2.5th and 97.5th percentiles as the CI bounds, which performs well even with small or skewed data.[22] Wide CIs for NNH typically arise from small sample sizes or low event rates, signaling imprecise estimates and greater uncertainty in the harm assessment.[21] If the ARI CI includes zero, the corresponding NNH CI will extend to infinity (or negative infinity on one side), indicating that the observed harm may not be statistically significant and could plausibly represent no effect or even benefit.[21] Reporting guidelines emphasize including the 95% CI alongside the point estimate of NNH in publications to convey this uncertainty and prevent overinterpretation of potentially misleading single values.[21]Comparisons
With Number Needed to Treat
The number needed to harm (NNH) functions as a direct counterpart to the number needed to treat (NNT), providing a parallel framework for evaluating both risks and benefits in clinical interventions. Whereas NNT is derived as the reciprocal of the absolute risk reduction (ARR) to quantify the number of patients required for one additional beneficial outcome, NNH is the reciprocal of the absolute risk increase (ARI) to indicate the number of patients needed for one additional adverse event. This structural similarity allows both metrics to convert relative risk differences into intuitive, patient-centered estimates, facilitating comparisons across studies and treatments.[23][24] A key distinction in their interpretation arises from their opposing implications for treatment value: a lower NNT signifies greater efficacy, as fewer patients need treatment to achieve one benefit, while a higher NNH denotes improved safety, as harms occur less frequently and require more patients to manifest one adverse effect. This inverted scaling underscores their complementary nature—NNT rewards stronger positive effects, whereas NNH penalizes more common harms—enabling clinicians to balance efficacy against tolerability without relying solely on relative measures. For instance, the NNT briefly represents the patients needed to prevent one adverse outcome, mirroring NNH's focus on induced harms.[23][24][25] In practice, NNT and NNH are often used together to compute a benefit-harm ratio, such as NNT divided by NNH, which assesses the relative scale of advantages versus disadvantages; for example, an NNT of 10 paired with an NNH of 50 yields a 1:5 ratio, suggesting benefits outweigh harms for every five patients treated. This ratio aids decision-making in evidence-based medicine by providing a holistic view of a therapy's net impact, though it requires contextual judgment regarding the severity of outcomes.[24][23] Historically, NNH emerged as the harm-oriented extension of NNT, developed by the same pioneers of evidence-based medicine, including Laupacis, Sackett, and Roberts, who introduced NNT in 1988 to simplify clinical effect measures. Their work emphasized absolute rather than relative risks to better inform patient care, with NNH formalized shortly thereafter to address adverse events symmetrically.[25]With Other Risk Measures
The number needed to harm (NNH) is the reciprocal of the absolute risk increase (ARI), transforming the ARI—a proportion representing the additional risk of harm in the treatment group compared to the control group—into a countable metric that clinicians find more intuitive for estimating the scale of potential adverse effects on patients.[26][10] Unlike the relative risk increase (RRI), which measures the proportional increase in harm risk and can exaggerate effects when baseline risks are low—for instance, an RRI of 100% with an ARI of 1% results in an NNH of 100, highlighting that 99 patients would need treatment without additional harm—NNH emphasizes absolute differences to provide a balanced perspective on clinical significance.[26][27][28] In randomized controlled trials (RCTs) involving rare adverse events, NNH is often preferred over the odds ratio (OR), as the OR approximates relative risk but does not directly convey absolute harm probabilities, potentially misleading interpretations of treatment risks in low-incidence scenarios.[27][29] NNH offers advantages as a patient-oriented measure that is accessible to non-statisticians, facilitating communication of harm risks in clinical decision-making, though it is limited to binary outcomes and may not apply well to continuous or time-to-event data.[26][10] Similar to the number needed to treat (NNT) for benefits, NNH focuses on absolute rather than relative measures to aid practical application.[26]Applications and Examples
In Clinical Trials
In randomized controlled trials (RCTs), the number needed to harm (NNH) is calculated post-hoc using adverse event data from the treatment arm compared to the placebo or control arm, quantifying the number of patients who would need to receive the intervention for one additional harmful outcome to occur. This involves determining the absolute risk increase (ARI), which is the difference in event rates between groups, and then taking its reciprocal. The CONSORT extension for better reporting of harms, introduced in 2004 as an update to the 2001 CONSORT statement and further updated in 2022, recommends presenting absolute measures like risk differences for harms alongside relative measures to improve transparency and avoid overemphasizing effects.[30][31] In meta-analyses of multiple RCTs, NNH is derived by pooling the ARI across studies using fixed-effects or random-effects models, depending on the heterogeneity of trial results, before computing the reciprocal of the pooled ARI to obtain a summary NNH. Fixed-effects models assume a common true effect size across studies, while random-effects models account for between-study variation, making them suitable when trial populations or methods differ. This approach allows for a synthesized estimate of harm risk, though caution is needed as direct pooling of NNH values can be misleading due to varying baseline risks; instead, pooling at the ARI level is preferred.[32][33] Regulatory bodies such as the U.S. Food and Drug Administration (FDA) and the European Medicines Agency (EMA) mandate the inclusion of adverse event data in drug labeling, often requiring quantitative metrics akin to NNH to describe risks for common harms, including gastrointestinal adverse events with nonsteroidal anti-inflammatory drugs (NSAIDs). For instance, EMA guidance encourages expressing risks using NNH for clarity in benefit-risk assessments, while FDA labels emphasize absolute risks to inform prescribers about event probabilities.[34][35] When selecting events for NNH calculation in clinical trials, emphasis is placed on specific, clinically significant harms—such as myocardial infarction or major bleeding—rather than encompassing all possible side effects, to enable focused evaluation of intervention safety. This targeted approach helps prioritize harms with substantial patient impact, as seen in cardiovascular trials where NNH for myocardial infarction informs the balance against benefits.[36][2]Numerical Examples
To illustrate the computation of the number needed to harm (NNH), consider a hypothetical randomized controlled trial evaluating a new medication for hypertension. In the treatment group of 100 patients, 10 individuals (10%, or EER = 0.10) experience a serious adverse event, such as acute kidney injury, over the study period. In the control group of 100 patients receiving placebo, 5 individuals (5%, or CER = 0.05) experience the same event. The absolute risk increase (ARI) is calculated as EER minus CER, yielding 0.10 - 0.05 = 0.05. The NNH is then the reciprocal of the ARI: 1 / 0.05 = 20. This means that for every 20 patients treated with the medication, one additional case of acute kidney injury occurs compared to placebo. A real-world application appears in a population-based cohort study of antidepressant use and weight gain. Among participants initiating antidepressants, the adjusted rate of ≥5% weight gain in the second year of treatment was approximately 11.8 per 100 person-years, compared to 8.1 per 100 person-years without antidepressants (rate ratio = 1.46). This corresponds to an ARI of approximately 0.037, yielding an NNH of 27 patient-years—meaning one additional episode of clinically significant weight gain occurs for every 27 patient-years of antidepressant exposure in the second year.[37] The NNH varies substantially depending on the severity of the harm, reflecting differences in event rates and clinical importance. For instance, in trials of antiemetics for postoperative nausea and vomiting prevention, the NNH for mild adverse effects like extrapyramidal symptoms with metoclopramide (50 mg) was 140, indicating a less common minor harm.[38] In contrast, for severe rare events such as stroke associated with hormone replacement therapy in postmenopausal women for primary cardiovascular prevention, the ARI was 0.006 (based on a relative risk of 1.32), resulting in an NNH of 165—highlighting how one additional stroke occurs after treating approximately 165 women for one year.[39] The following table summarizes these examples, showing event rates, ARI, and NNH for clarity:| Example | Group | Event Rate | ARI | NNH |
|---|---|---|---|---|
| Hypothetical hypertension trial (acute kidney injury) | Treatment | 0.10 | 0.05 | 20 |
| Control | 0.05 | |||
| Antidepressant cohort (≥5% weight gain, second year) | Treatment | 0.118 | 0.037 | 27 patient-years |
| Control | 0.081 | |||
| Antiemetic trial (mild extrapyramidal symptoms) | Treatment (metoclopramide) | 0.007 | 0.007 | 140 |
| Control (dexamethasone) | 0.000 | |||
| Hormone therapy (stroke) | Treatment | 0.0061 | 0.006 | 165 |
| Control | 0.0055 |