Fact-checked by Grok 2 weeks ago

Local average treatment effect

The local average treatment effect (LATE) is a causal in and statistics that quantifies the average effect of a on a specific subpopulation—known as compliers—whose treatment status is altered by an exogenous instrumental variable (). Compliers are individuals who receive the treatment when the IV is present (e.g., Z=1) but would not receive it in its absence (Z=0), distinguishing them from always-takers (who receive treatment regardless of the IV), never-takers (who never receive it), and potential defiers (ruled out by key assumptions). Formally, LATE is defined as the expected in potential outcomes for compliers: \mathbb{E}[Y_1 - Y_0 \mid D_1 > D_0], where Y_1 and Y_0 are potential outcomes with and without treatment, and D_1 and D_0 are potential treatment receipts under the IV states. Introduced by and in their seminal 1994 paper, for their methodological contributions to the analysis of causal relationships, including the LATE framework, Angrist and Imbens shared the 2021 Nobel Memorial Prize in Economic Sciences with . LATE provides a rigorous interpretation for estimators in settings with heterogeneous treatment effects and imperfect , such as randomized experiments with non-adherence or observational studies with . Under the core assumptions of independence (the instrument is uncorrelated with potential outcomes and treatment potentials) and monotonicity (the instrument affects treatment status in one direction only, excluding defiers), LATE is nonparametrically identified by the Wald estimator: the ratio of the IV's effect on the outcome to its effect on treatment receipt, \frac{\mathbb{E}[Y \mid Z=1] - \mathbb{E}[Y \mid Z=0]}{\mathbb{E}[D \mid Z=1] - \mathbb{E}[D \mid Z=0]}. This estimand equals the intention-to-treat () effect on outcomes divided by the effect on treatment, offering a local rather than global measure of that applies only to the complier subgroup. LATE has become foundational in applied , enabling researchers to draw valid conclusions from strategies in fields like labor economics (e.g., estimating returns to schooling using quarter-of-birth instruments) and (e.g., assessing drug effects via prescribing as an ). Extensions include bounding LATE under partial violations of monotonicity and nonparametric methods to relax functional form assumptions, though challenges persist in extrapolating LATE to broader populations or verifying the exclusion restriction (where the affects outcomes only through ). Its emphasis on subpopulation-specific effects underscores the limitations of assuming homogeneous impacts, promoting more nuanced evaluations.

Fundamentals

Definition

The local average treatment effect (LATE) is a causal that measures the effect of a on a specific of the known as "compliers," who change their treatment status in response to an exogenous . Formally, it is defined as \mathbb{E}[Y(1) - Y(0) \mid D(1) > D(0)], where Y(1) and Y(0) are the potential outcomes under treatment and no treatment, and D(1) and D(0) are the potential treatment receipts under the instrument values Z=1 and Z=0. In empirical settings where treatment assignment is not perfectly randomized or compliant, LATE isolates the impact on those individuals whose participation in the treatment is directly influenced by the instrument, providing a targeted estimate of causal effects rather than an overall . Consider a with an outcome Y (such as ), a indicator D (1 if treated, 0 otherwise), and a Z (1 if the instrument is applied, 0 otherwise). The LATE then represents the mean difference in Y between treated and untreated states for compliers—those with D=1 when Z=1 and D=0 when Z=0. This contrasts with the (ATE), which estimates the across the entire and can be identified as LATE under full compliance. LATE was formally introduced by Angrist and Imbens in their seminal 1994 paper, which built on earlier (IV) methods to address challenges in non-experimental data. This framework has become foundational in for evaluating policies, such as the effects of education programs or draft lotteries, where instruments like random assignments influence treatment uptake imperfectly.

Potential Outcomes Framework

The potential outcomes framework, also known as the , provides a foundational approach to defining causal effects in observational and experimental settings. In this model, for each unit i, the individual treatment effect is given by the difference between the potential outcome under treatment, Y_i(1), and the potential outcome under no treatment, Y_i(0), where Y_i(d) denotes the value of the outcome Y_i that would be observed if unit i received treatment status d \in \{0, 1\}. However, only one potential outcome is observed for each unit—the one corresponding to the actual treatment received—leading to the fundamental problem of , as the counterfactual outcome remains unobservable. To address settings with imperfect compliance, where treatment assignment does not perfectly determine receipt, the framework incorporates an instrumental variable Z_i, typically , that influences the received D_i but operates through potential treatment statuses D_i(z) for instrument values z \in \{0, 1\}. Here, D_i(z) represents the treatment status unit i would take if assigned to instrument level z, allowing the model to capture how the instrument affects treatment uptake without directly altering the outcome. Units can be classified into principal strata based on their potential treatment behaviors: always-takers, for whom D_i(0) = 1 and D_i(1) = 1; compliers, for whom D_i(0) = 0 and D_i(1) = 1; never-takers, for whom D_i(0) = 0 and D_i(1) = 0; and defiers, for whom D_i(0) = 1 and D_i(1) = 0. The local average treatment effect targets the average causal effect specifically for the complier subgroup, where treatment receipt changes with the instrument, i.e., units satisfying D_i(1) > D_i(0). Non-compliance in this framework can be one-sided, such as scenarios involving only compliers and never-takers (where no unit takes without the instrument), or two-sided, which includes always-takers who receive regardless of the instrument.

Assumptions

Non-Compliance Framework

In the potential outcomes framework, non-compliance arises when the instrument Z, which randomly assigns individuals to treatment or control, does not perfectly determine actual treatment receipt D. Specifically, potential treatment variables D(1) and D(0) denote whether an individual would receive if assigned to Z=1 or Z=0, respectively. This setup partitions the population into four principal strata based on their compliance behavior: compliers, for whom D(1)=1 and D(0)=0; always-takers, for whom D(1)=1 and D(0)=1; never-takers, for whom D(1)=0 and D(0)=0; and defiers, for whom D(1)=0 and D(0)=1. Non-compliance introduces bias in simple intent-to-treat (ITT) estimates, which compare average outcomes between assignment groups but dilute the treatment effect by including individuals who do not respond to the assignment. The local average treatment effect addresses this by focusing exclusively on compliers, whose treatment status changes with the instrument, thereby isolating the causal effect for this subgroup and correcting for the attenuation bias in ITT. Non-compliance can be one-sided or two-sided. In one-sided non-compliance, defiers are absent—typically in encouragement designs where group members cannot access , but some assignees may not comply—simplifying identification to compliers, always-takers, and never-takers. Two-sided non-compliance permits all four types, including defiers who take when assigned to but not when assigned to , which complicates interpretation as the may reverse for this group. The proportion of compliers, given by P(D(1) > D(0)), represents the size of the subpopulation on which the has a causal effect on receipt and thus defines population relevant for the . This proportion is empirically estimable as the difference in treatment probabilities between assignment groups and scales the effect to recover the complier-specific impact.

Identification Assumptions

The identification of the local average treatment effect (LATE) relies on a set of assumptions within the instrumental variables () framework to ensure that the instrument validly isolates the causal effect for compliers—the subgroup whose treatment status changes with the instrument. These assumptions address potential biases from non-compliance and , enabling the LATE to be recovered from observable data such as intention-to-treat effects. As outlined in the non-compliance framework, individuals are categorized into compliers, always-takers, never-takers, and potential defiers based on their treatment responses to the instrument. The independence assumption, also known as , posits that the Z is of the potential outcomes and potential , i.e., Z \perp (Y_i(0), Y_i(1), D_i(0), D_i(1)) for all individuals i. This ensures that the does not correlate with unobserved factors affecting the outcomes or treatment decisions, mimicking a . In practice, this holds when the is exogenously assigned, such as in a lottery system, but it is untestable and requires domain-specific justification. The exclusion restriction requires that the instrument affects the observed outcome Y solely through its impact on the treatment D, with no direct effect: Y_i(z, d) = Y_i(d) for all z and d. For compliers, this implies that their potential outcomes Y_i(1) and Y_i(0) are independent of Z. This assumption rules out direct channels from the instrument to the outcome, such as side effects unrelated to treatment, and is crucial for attributing all instrument-induced variation in Y to changes in D. It is untestable but can be motivated by the instrument's design, like a policy change that only alters treatment access. The monotonicity assumption eliminates defiers—individuals for whom the reverses their choice—by requiring that receipt does not decrease with the : D_i(1) \geq D_i(0) for all i. This ensures the shifts in one direction only, so that the first stage variation reflects only compliers (and possibly always-takers or never-takers) without opposing effects that could bias the LATE toward a weighted average across subgroups. Monotonicity is untestable but often plausible under linear response models or when the encourages uptake without discouraging it. Finally, the relevance assumption mandates that the instrument strongly predicts treatment, with the proportion of compliers positive: \Pr(D_i(1) > D_i(0)) > 0, or equivalently, E[D_i(1) - D_i(0)] \neq 0. This prevents weak instruments that fail to generate sufficient variation in D, ensuring the denominator in the LATE formula is non-zero and estimable. Relevance is testable via the first-stage F-statistic and is essential for precise inference. In settings with one-sided non-compliance—where, for example, the control group cannot access (no always-takers)—the assumptions simplify: monotonicity holds automatically as D_i(0) = 0 for all i, and the focus shifts to excluding defiers alongside , exclusion, and . In contrast, two-sided non-compliance requires the full set, including monotonicity to rule out defiers, while always-takers and never-takers do not bias the complier effect due to their unchanged across Z, to isolate the complier effect without contamination. These distinctions allow LATE to adapt to experimental designs like encouragement trials.

Identification

Core Identification Formula

The local average treatment effect (LATE) is identified through the ratio of reduced-form intent-to-treat (ITT) effects under the instrumental variables framework. For a binary Z \in \{0, 1\}, the core is given by \text{LATE} = \frac{E[Y \mid Z=1] - E[Y \mid Z=0]}{E[D \mid Z=1] - E[D \mid Z=0]}, where Y is the observed outcome, D is the observed treatment receipt, and the expectations denote averages conditional on the value. This expression equals the ratio of the effect on the outcome, \text{ITT}_Y = E[Y \mid Z=1] - E[Y \mid Z=0], to the effect on treatment receipt, \text{ITT}_D = E[D \mid Z=1] - E[D \mid Z=0]. The numerator \text{ITT}_Y measures the average causal effect of the instrument on the outcome, capturing overall shifts attributable to Z under the exclusion restriction. The denominator \text{ITT}_D quantifies the average change in treatment compliance induced by Z, reflecting the instrument's influence on who receives the . Their ratio isolates the treatment effect for compliers—the whose treatment status switches from D=0 to D=1 when Z changes from 0 to 1—yielding a weighted average causal effect specific to this subpopulation. Under the standard assumptions of monotonicity (no defiers) and exclusion ( Z affects Y only through D), this formula precisely equals E[Y(1) - Y(0) \mid D(1) > D(0)], the conditional for compliers in the potential outcomes framework. While the formula assumes a binary instrument for simplicity, it generalizes to continuous instruments by replacing the conditional differences with covariances, such as \text{LATE} = \frac{\text{Cov}(Y, Z)}{\text{Cov}(D, Z)}, maintaining the focus on reduced-form contrasts.

Proof of Identification

The proof of identification for the (LATE) relies on the potential outcomes framework and key assumptions, including independence of the , the exclusion restriction, and monotonicity. Under these conditions, the instrumental variables () estimand equals the for compliers—the subpopulation whose treatment status changes with the . This derivation, originally established by Imbens and Angrist (1994), proceeds step by step, expressing conditional expectations and showing how non-compliers' contributions cancel out. Consider the potential outcomes notation: for each unit i, let Y_i(1) be the outcome under treatment and Y_i(0) under no treatment, with observed outcome Y_i = D_i Y_i(1) + (1 - D_i) Y_i(0), where D_i \in \{0,1\} indicates treatment receipt. The instrument Z_i \in \{0,1\} affects treatment via potential treatment statuses D_i(1) and D_i(0). The independence assumption states that (Y_i(0), Y_i(1), D_i(1), D_i(0)) is independent of Z_i, ensuring the instrument is exogenous and does not directly affect outcomes except through treatment (incorporating the exclusion restriction). Let P(z) = E[D_i | Z_i = z] = \Pr(D_i(z) = 1). The monotonicity assumption further posits that D_i(1) \geq D_i(0) for all i, ruling out defiers (units who take when Z=0 but not when Z=1) and partitioning the population into always-takers (D_i(1) = D_i(0) = 1), never-takers (D_i(1) = D_i(0) = 0), and compliers (D_i(1) = 1, D_i(0) = 0). This implies \Pr(D_i(1) - D_i(0) = -1) = 0. Now, express the conditional expectation of the outcome: E[Y_i | Z_i = z] = E[Y_i(1) | D_i(z) = 1, Z_i = z] \Pr(D_i(z) = 1 | Z_i = z) + E[Y_i(0) | D_i(z) = 0, Z_i = z] \Pr(D_i(z) = 0 | Z_i = z). By independence, the conditional expectations E[Y_i(1) | D_i(z) = 1, Z_i = z] and E[Y_i(0) | D_i(z) = 0, Z_i = z] equal the unconditional E[Y_i(1) | D_i(z) = 1] and E[Y_i(0) | D_i(z) = 0], which do not depend on z. Substituting for z = 1 and z = 0, the difference becomes: E[Y_i | Z_i = 1] - E[Y_i | Z_i = 0] = E[Y_i(1) - Y_i(0) | D_i(1) = 1, D_i(0) = 0] \cdot [P(1) - P(0)], as contributions from always-takers and never-takers cancel under the exclusion restriction (their outcomes are unaffected by Z) and (no defiers). The term E[Y_i(1) - Y_i(0) | D_i(1) = 1, D_i(0) = 0] is precisely the for . Similarly, for the first stage: E[D_i | Z_i = 1] - E[D_i | Z_i = 0] = \Pr(D_i(1) = 1, D_i(0) = 0) = P(1) - P(0), the proportion of compliers, again by and monotonicity, with always-takers and never-takers canceling. The estimand, or Wald ratio, \frac{E[Y_i | Z_i = 1] - E[Y_i | Z_i = 0]}{E[D_i | Z_i = 1] - E[D_i | Z_i = 0]}, thus simplifies to \frac{[E[Y_i(1) - Y_i(0) | D_i(1) = 1, D_i(0) = 0] \cdot [P(1) - P(0)]]}{P(1) - P(0)} = E[Y_i(1) - Y_i(0) | D_i(1) = 1, D_i(0) = 0], identifying the LATE as the complier average causal effect. Each step invokes the assumptions: independence for unconditional expectations, exclusion for non-compliers' invariance, and monotonicity for the clean partitioning and absence of defiers.

Instrumental Variables Integration

LATE in IV Estimation

In instrumental variables (IV) estimation, the local average treatment effect (LATE) is commonly estimated using the two-stage least squares (2SLS) method, which addresses endogeneity in the treatment variable D by leveraging an instrument Z. In the first stage, D is regressed on Z (and any exogenous covariates) to obtain the predicted values \hat{D}, capturing the exogenous variation induced by Z. In the second stage, the outcome Y is regressed on \hat{D} (and the same exogenous covariates), with the coefficient on \hat{D} providing a consistent estimate of the LATE for compliers—those whose treatment status changes with Z—under the standard IV assumptions of , exclusion, and monotonicity. The 2SLS estimator is consistent for the LATE as the sample size increases to infinity, provided the is relevant and the assumptions hold, meaning it converges in probability to the true LATE parameter. Asymptotic standard errors for the 2SLS estimates can be computed to account for potential weak bias, which arises when the first-stage correlation between Z and D is low, though robust standard errors are recommended in finite samples to improve reliability. Implementation of 2SLS for LATE estimation is widely available in statistical software, facilitating practical application in econometric analysis. In Stata, the ivregress 2sls command performs the two-stage procedure, allowing specification of endogenous regressors and instruments while supporting cluster-robust standard errors to handle heteroskedasticity and dependence. Similarly, in R, the ivreg function from the AER package executes 2SLS estimation with options for robust variance-covariance matrices, essential for valid inference in applied settings with non-i.i.d. data. When multiple instruments are available, leading to an overidentified system (more instruments than endogenous variables), 2SLS generalizes by projecting D onto the space spanned by all instruments in the first stage, yielding an estimate that remains a LATE but is now local to the combined variation across the instruments, weighted by their relevance. Overidentification tests, such as the Sargan-Hansen statistic, can then assess instrument validity, though the estimand stays confined to compliers responsive to the instrumental variation.

Connection to Wald Estimator

The Wald estimator provides a straightforward, non-parametric method for estimating the local average treatment effect (LATE) in settings with a binary instrument Z, treatment D, and outcome Y. It computes the ratio of the reduced-form effect of the instrument on the outcome to the first-stage effect on the treatment, using population expectations or their sample analogs. The estimator is given by \hat{\text{LATE}} = \frac{\bar{Y}_{Z=1} - \bar{Y}_{Z=0}}{\bar{D}_{Z=1} - \bar{D}_{Z=0}}, where \bar{Y}_{Z=z} and \bar{D}_{Z=z} denote the sample means of the outcome and treatment, respectively, conditional on the instrument taking value z \in \{0,1\}. Named after the statistician Abraham Wald, who introduced an early version in the context of errors-in-variables models, the estimator was originally proposed in 1940 for fitting linear relationships with measurement error in both variables. Its application to causal inference and LATE was popularized in the seminal work of Angrist, Imbens, and Rubin (1996), who formalized its interpretation within the potential outcomes framework as the average treatment effect for compliers—those whose treatment status changes with the instrument. In relation to instrumental variables (IV) estimation, the Wald estimator corresponds exactly to the two-stage least squares (2SLS) estimator when the instrument is binary and there are no covariates, reducing to a simple ratio of differences in means. This equivalence highlights its role as a foundational building block for more general IV methods, particularly in randomized experiments where the instrument is randomly assigned, ensuring efficiency under standard assumptions like monotonicity and exclusion. The Wald estimator's key advantages lie in its non-parametric nature, requiring no functional form assumptions beyond the IV conditions for LATE identification, and its computational simplicity, as it relies solely on group means without iterative optimization. For binary outcomes, it can be adapted using or models for the first stage if the treatment probability is modeled parametrically, though the basic ratio form remains applicable for linear projections.

Applications

Hypothetical Scenarios

To illustrate the local average treatment effect (LATE) in an encouragement design, consider a scenario where a random subset of high school graduates receives a offer (instrument Z) intended to encourage (treatment D), with long-term (outcome Y) as the measure of interest. The offer increases the probability of among recipients but does not it, leading to partial compliance. Under standard assumptions including instrument exogeneity, monotonicity (no defiers who attend only without the offer), and instrument , the LATE identifies the average causal effect of on specifically for compliers—those students induced to attend by the offer, or the marginal students who would not otherwise enroll. This contrasts with the (ATE), which averages the impact across the entire population, as LATE isolates the effect for this induced subgroup. In settings with two-sided non-compliance, always-takers—students who attend regardless of the offer—further complicate , as their effects are not captured by the LATE. For instance, always-takers might enroll due to strong academic motivation or family resources, and their earnings outcomes under attendance remain unaffected by the instrument . The LATE excludes these individuals, focusing solely on compliers whose attendance (and thus earnings) responds to , while never-takers—who do not attend even with the offer—are also excluded since they receive no in either case. This exclusion ensures the reflects only the policy-relevant effect for those swayed by the encouragement. The schedule of potential outcomes clarifies how LATE arises in this framework, distinguishing groups based on compliance types under Z=0 (no offer) and Z=1 (offer). Potential outcomes are denoted Y(1) for earnings with attendance and Y(0) without.
Compliance TypeD under Z=0Y under Z=0D under Z=1Y under Z=1
Always-Takers1Y(1)1Y(1)
Compliers0Y(0)1Y(1)
Never-Takers0Y(0)0Y(0)
Here, the LATE equals the mean difference E[Y(1) - Y(0) | compliers], which is the average earnings gain from college for those induced to attend by the scholarship. A key interpretation pitfall is that the LATE may not generalize to the full population if compliers differ systematically from others, such as always-takers who might benefit more (or less) from attendance due to unobserved traits like motivation. For example, if compliers are lower-ability students on the margin of attendance, their earnings response to college could underestimate the ATE for high-ability always-takers. Thus, while LATE provides a valid causal estimate for the induced subgroup, extrapolating it requires additional assumptions about effect homogeneity.

Empirical Examples

One seminal empirical application of the local average treatment effect (LATE) is found in the study of returns to by Angrist and Krueger (1991), who used quarter of birth as an instrumental variable for years of schooling among U.S. men born between 1930 and 1939. Compulsory schooling laws created variation in based on birth quarter, as children born earlier in the year were more likely to start school sooner and thus accumulate more years before the minimum leaving age. The IV estimate identified the LATE for "compellers"—those induced to attend additional school by the law—yielding a return to of approximately 7.5% per year, suggesting minimal ability in ordinary least squares estimates. In , the Student/Teacher Achievement Ratio (STAR) experiment provides another key example, analyzed by Krueger (1999) using initial to small classes (13-17 students) as an instrument for actual experienced by through third-grade students. Although the experiment was randomized, non-compliance occurred due to transitions and implementation variations, allowing an IV approach to estimate the LATE for students who actually received smaller classes. The analysis revealed that smaller classes increased scores by 0.20-0.22 standard deviations overall, with persistent effects into later grades, highlighting benefits for marginalized students such as children and those from low-income families. Health policy research has also leveraged LATE, as in Abaluck and Gruber (2011), who examined plan choices among elderly beneficiaries to assess the effects of coverage uptake. Using variation in plan availability and premiums across regions as quasi-instruments for enrollment decisions, they estimated that suboptimal choices led to 27% lower compared to rational selection, with coverage reducing out-of-pocket costs but revealing inefficiencies in uptake among those eligible for low-income subsidies. This LATE interpretation focused on compliers influenced by plan characteristics, underscoring barriers to effective drug utilization. Recent applications include studies on hesitancy, where mandates served as instruments for uptake. For instance, Karaivanov et al. (2022) used proof-of-vaccination mandates in Canadian provinces and select European countries (, , ) to estimate their impact on first-dose uptake, finding increases of over 60% in some regions (e.g., 66% in ). As of 2023, follow-up analyses of state-level mandates in the U.S. have confirmed reductions in hesitancy among mandate-responsive groups, though direct LATE estimates on health outcomes vary.

Extensions and Generalizations

Reweighting Approaches

Reweighting approaches seek to extend the (LATE), which identifies the for compliers under standard instrumental variable (IV) assumptions, to broader estimands such as the (ATE) for the full population. These methods typically require additional assumptions beyond the core LATE identification, such as of the instrument or monotonicity in treatment response, to adjust the weights applied to observed outcomes or conditional effects. One prominent reweighting strategy relies on the assumption that treatment assignment is ignorable given observed covariates, meaning the instrument is independent of potential outcomes and treatment responses conditional on covariates. Under this conditional ignorability, inverse compliance score weighting (ICSW) adjusts the LATE to the ATE by reweighting units inversely proportional to their estimated compliance score, the probability of compliance with the instrument given covariates. The compliance score P_{C}(X) = P(D = Z \mid X) is typically estimated via logistic regression or maximum likelihood using covariates X, serving as a form of propensity score for compliance. The weights are then w = 1 / \hat{P}_{C}(X), upweighting units more likely to be non-compliers to mimic the full population distribution. The resulting weighted IV estimator for the ATE is given by \hat{\tau}_{\text{ATE}} = \frac{ \frac{\sum w_i Z_i Y_i}{\sum w_i Z_i} - \frac{\sum w_i (1 - Z_i) Y_i}{\sum w_i (1 - Z_i)} }{ \frac{\sum w_i Z_i D_i}{\sum w_i Z_i} - \frac{\sum w_i (1 - Z_i) D_i}{\sum w_i (1 - Z_i)} }, where Z is the instrument, D the treatment receipt, and Y the outcome. This approach ensures that the complier subpopulation is reweighted to represent the entire population, provided the compliance score is strictly between 0 and 1 for all units. Another reweighting method operates under the monotonicity assumption, which rules out defiers (units whose treatment response reverses with the instrument), allowing the LATE to be expressed as a weighted average of conditional treatment effects with weights proportional to the complier proportion \pi(X) = P(D_1 > D_0 \mid X). To extrapolate to the ATE, which requires an unweighted average across the population, the conditional LATEs can be reweighted inversely by the complier proportions to de-emphasize groups with higher compliance rates. For instance, an adjustment weight w = \pi / P(D=1), where \pi is the overall complier proportion and P(D=1) the marginal treatment probability, can be used to scale the LATE toward population-level effects, often yielding bounds on the ATE when full homogeneity is not assumed. Implementation proceeds similarly via propensity score estimation of \pi(X), enabling extrapolation while respecting monotonicity. Sloczynski (2018) derives such weighted representations for IV estimands, showing how complier proportions determine the effective weights and providing a framework for bounds on broader effects under monotonicity. These reweighting techniques have been applied in labor economics to recover population-level causal effects from IV estimates that initially target compliers. For example, in studies estimating returns to education using instruments like quarter of birth or proximity to colleges, reweighting adjusts the LATE for marginal students—who comply with schooling incentives—to approximate the ATE across all individuals, revealing broader policy implications for wage gains from additional schooling.

Beyond Standard LATE

The local average treatment effect (LATE) framework can be extended to account for heterogeneity in treatment effects across complier subgroups by incorporating covariates that interact with the instrument or treatment. In this approach, researchers stratify the sample based on pretreatment covariates to identify LATEs for more specific subpopulations of compliers, where the first-stage effect of the instrument on treatment is symmetric or balanced across groups. For instance, by conditioning on variables such as age or education, the LATE can approximate the average treatment effect (ATE) within those subgroups under assumptions like conditional constant effects or linearity in the first stage. This method allows for variation in effects, revealing how treatment impacts differ—for example, larger effects among younger compliers compared to older ones—without assuming homogeneity across the entire complier population. When treatment effects vary across individuals, the standard LATE captures the effect only for marginal patients influenced by the , potentially differing from the overall ATE. To address this, interactions between covariates and the in the model enable the recovery of heterogeneous , such as through two-stage regressions that include covariate-treatment interactions. Empirical applications, like evaluating postacute care choices, demonstrate how such heterogeneity leads to LATE estimates showing higher readmission risks but lower costs for among compliers defined by geographic distance to providers. These extensions highlight that estimates remain valid for specific complier subgroups but require careful interpretation to avoid overgeneralization to the full . For multi-valued treatments, where the treatment variable takes on more than two levels (e.g., ordered doses or intensities), the LATE framework generalizes to marginal treatment effects (MTEs), which capture effects at specific points in the treatment distribution. Mogstad et al. (2018) develop a method to express IV estimands, including two-stage least squares, as weighted averages of MTEs, with identifiable weights that allow bounds on policy-relevant parameters even under selection on unobservables. This approach enables extrapolation from complier effects to broader populations, such as estimating the impact of varying subsidy levels on program participation, by relaxing strict binary assumptions and incorporating the full treatment heterogeneity. For ordered treatments, the MTEs provide a continuous analog to LATE, facilitating inference on how effects vary along the treatment margin induced by the instrument. Relaxing the monotonicity assumption in LATE estimation accommodates the presence of defiers—individuals who receive the opposite from what the intends—by deriving sharp nonparametric bounds on the . Fan and Park (2010) propose estimators for these bounds using marginal distributions of potential outcomes, without requiring monotone responses, and establish their asymptotic validity for inference via methods like the fewer-than-n bootstrap. In the presence of defiers, the bounds tighten with covariates under selection-on-observables, providing partial identification where point identification fails, such as in applications assessing policy with ambiguous compliance patterns. This generalization ensures LATE-like interpretations remain feasible, albeit interval-valued, when standard assumptions do not hold. Extensions to fuzzy regression discontinuity designs (RDDs) further relax monotonicity for LATE by imposing one-sided monotonicity locally around the and replacing full independence with moment continuity conditions on potential treatment and outcomes. In fuzzy RDD, the (e.g., a policy threshold) induces discontinuous jumps in treatment probability, and these relaxed assumptions allow the ratio of outcome to treatment discontinuities to identify the LATE for local compliers without global exclusion restrictions. This applies to settings like educational , where treatment uptake varies but monotonicity holds only near the threshold, yielding causal estimates robust to mild violations of classical conditions. Dynamic LATE addresses time-varying instruments or sequential treatments by identifying period-specific effects for compliers under a static binary instrument, particularly for irreversible treatments like job training programs. Ferman and Tecchio (2023) show that per-period IV estimates capture weighted sums of dynamic effects across latent groups, with recursive corrections under calendar-time homogeneity to recover the LATE for first-period compliers. Recent extensions include nonparametric methods for dynamic in time series data (Callaway et al., 2024). When late treatment switching is limited, bounds on subsequent effects tighten, enabling analysis of cumulative impacts in without additional instruments. This extension is valuable for sequential decision-making, such as multi-stage interventions, where standard LATE would conflate timing heterogeneity.

Limitations

Key Constraints

The local average treatment effect (LATE) identifies the causal effect of a treatment on outcomes solely for the subpopulation of compliers—those individuals whose treatment status changes in response to the instrument—rather than the entire population. This local nature limits the generalizability of LATE estimates, as the effect for compliers may not represent the treatment impact on always-takers, never-takers, or the broader population without further assumptions about treatment effect homogeneity. The monotonicity assumption, which requires the absence of defiers (individuals who take the treatment when the instrument is off but not when it is on, or vice versa), is central to LATE identification but fragile in practice. This assumption is difficult to test directly, as it cannot be verified from observed data alone, and its violation leads to biased estimates by incorporating the opposing effects of defiers into the IV estimand. Recent developments, such as partial identification methods, allow bounding or estimating LATE even with potential defiers. Weak instruments pose another key constraint, where a low between the instrument and the endogenous results in finite-sample and unreliable in IV estimation. Specifically, when the first-stage F-statistic is below conventional thresholds (e.g., around 10), the two-stage estimator can exhibit substantial toward the confounded OLS estimate, exacerbating problems in applied settings. Violations of the exclusion restriction, which mandates that the instrument affects the outcome only through the treatment, can severely misidentify the LATE by introducing direct channels from the instrument to the outcome. For instance, in policy evaluations of training programs for unemployed workers, an instrument like eligibility rules may violate exclusion if it directly influences job search or independently of program participation, leading to inconsistent estimates.

Comparisons to Other Causal Effects

The Local Average Treatment Effect (LATE) differs from the (ATE) primarily in its scope and identifying assumptions. LATE estimates the causal impact of specifically for the subpopulation of compliers—individuals whose status changes in response to the —under conditions of heterogeneity and non-compliance. In contrast, the ATE captures the average causal across the entire , which requires stronger assumptions such as full or conditional independence of assignment given covariates. When heterogeneity exists, the IV identifies LATE rather than ATE, as the 's influence is confined to compliers. Compared to the Average Treatment Effect on the Treated (ATT), LATE focuses on the effects for those induced into treatment by the instrument, who do not necessarily overlap with the group that receives treatment in the absence of the instrument. The ATT measures the average effect among individuals who actually take up the treatment, often under selection based on observables or unobservables, whereas LATE isolates the impact on instrument-responsive individuals, such as potential participants encouraged by policy incentives. This distinction is evident in examples like the Vietnam draft lottery, where LATE estimates the earnings reduction for those drafted who would not have otherwise served, differing from the ATT for all veterans. Overlap between LATE and ATT arises if the instrument closely replicates the natural selection mechanism into treatment. The Marginal Treatment Effect (MTE) extends beyond LATE by addressing heterogeneity at specific margins of the selection process, defined as the expected treatment effect conditional on covariates and the unobservable resistance to treatment U_D. LATE emerges as a special case of the MTE framework, equivalent to a weighted of MTEs over the of U_D values influenced by the , particularly under monotonicity and assumptions. If treatment effects are uniform across individuals, LATE coincides with the MTE at the complier margin; otherwise, MTE reveals variation along the full of gains, enabling more nuanced . LATE is preferred in instrumental variable analyses involving non-compliance, where it provides a credible estimate for the subpopulation affected by the without relying on global randomization. For settings with sharp policy discontinuities, regression discontinuity designs offer an alternative that also identifies LATE locally at the , leveraging the running variable for rather than an external .

References

  1. [1]
    None
    ### Summary of Local Average Treatment Effect (LATE)
  2. [2]
    [PDF] Identification and Estimation of Local Average Treatment Effects ...
    Mar 29, 2008 · effects (Angrist and Imbens (1991)). It defines causal effects in terms of potential. outcomes or counterfactuals rather than in terms of the ...
  3. [3]
    Identification and Estimation of Local Average Treatment Effects
    Mar 1, 1994 · Econometrica: Mar, 1994, Volume 62, Issue 2. Notes and Comments: Identification and Estimation of Local Average Treatment Effects.
  4. [4]
    Identification and Estimation of Local Average Treatment Effects
    Feb 1, 1995 · Joshua D. Angrist and Guido W. Imbens, "Identification and Estimation of Local Average Treatment Effects," NBER Working Paper t0118 (1995), ...
  5. [5]
    Identification and Estimation of Local Average Treatment Effects - jstor
    (Angrist, Imbens, and Rubin (1993)), we discuss conditions similar to this in great detail, and investigate the implications of violations of these conditions.
  6. [6]
    Estimating causal effects of treatments in randomized ... - APA PsycNet
    Estimating causal effects of treatments in randomized and nonrandomized studies. Publication Date. Oct 1974. Language. English. Author Identifier. Rubin, Donald ...
  7. [7]
    [PDF] Identification of Causal Effects Using Instrumental Variables
    Angrist, Imbens, and Rubin (AIR) apply the method of instrumental variables (IV) to estimate the local average treatment effect (LATE) of Imbens and Angrist ( ...<|control11|><|separator|>
  8. [8]
    Data analysis | The Abdul Latif Jameel Poverty Action Lab
    Non-compliance can be one- or two-sided. One-sided noncompliance is when individuals assigned to the treatment group refuse treatment OR individuals ...
  9. [9]
    Two-Stage Least Squares Estimation of Average Causal Effects in ...
    Imbens, G., and Angrist, J. (1994), "Identification and Estimation of Local. Average Treatment Effects," Econometrica, 62, 467-476. Leamer, E. E. ...
  10. [10]
    [PDF] ivregress — Single-equation instrumental-variables ... - Stata
    Angrist and Pischke (2009, chap. 4) offer a casual yet thorough introduction to instrumental-variables estimators, including their use in estimating treatment ...
  11. [11]
    [PDF] External Validity and Overidentification in the LATE Framework
    This paper develops a covariate-based approach to external validity of IV estimates, using overidentification test statistics to define a population for which ...
  12. [12]
    The Fitting of Straight Lines if Both Variables are Subject to Error
    The fitting of straight lines if both variables are subject to error. Abraham Wald. Download PDF + Save to My Library. Ann. Math. Statist. 11(3): 284-300.
  13. [13]
  14. [14]
    Does Compulsory School Attendance Affect Schooling and Earnings?
    We estimate the impact of compulsory schooling on earnings by using quarter of birth as an instrument for education. The instrumental variables estimate of the ...
  15. [15]
    Experimental Estimates of Education Production Functions
    This paper analyzes data on 11,600 students and their teachers who were randomly assigned to different size classes from kindergarten through third grade.
  16. [16]
    Evidence from Plan Choice in the Medicare Part D Program
    Citation. Abaluck, Jason, and Jonathan Gruber. 2011. "Choice Inconsistencies among the Elderly: Evidence from Plan Choice in the Medicare Part D Program.Missing: local average treatment
  17. [17]
    COVID-19 vaccination mandates and vaccine uptake - Nature
    Jun 2, 2022 · We find that the announcement of a mandate is associated with a rapid and significant surge in new vaccinations (a more than 60% increase in weekly first doses ...
  18. [18]
    None
    ### Extracted Information
  19. [19]
    [PDF] Instrumental Variables and Heterogeneous Treatment Effects
    Mar 22, 2022 · When treatment effects are heterogeneous, then the local average treatment effect might not equal the average treatment effect. Instrumental ...
  20. [20]
    Using Instrumental Variables for Inference About Policy Relevant ...
    Nov 6, 2018 · Since the weights are identified, knowledge of the IV estimand generally places some restrictions on the unknown marginal treatment effects, and ...
  21. [21]
    [PDF] SHARP BOUNDS ON THE DISTRIBUTION OF TREATMENT ...
    In this paper, we propose nonparametric estimators of sharp bounds on the distribu- tion of treatment effects of a binary treatment and establish their ...Missing: defiers | Show results with:defiers
  22. [22]
    Relaxing conditions for local average treatment effect in fuzzy ...
    In this paper, we revisited the requisite assumptions for local average treatment effect (LATE) in fuzzy regression discontinuity. The well-known LATE ...
  23. [23]
    [PDF] Identifying Dynamic LATEs with a Static Instrument
    May 30, 2023 · One advantage of focusing on irreversible treatments is that any possible sequence of treatment statuses at time t can be identified by zero if ...
  24. [24]
    the Endogenous Explanatory Variable Is Weak - jstor
    They developed an asymptotic distribution theory that does not rely on approximation or on the assumption of normality for IV estimates with weak instruments.
  25. [25]
    [PDF] An Economic Analysis of Exclusion Restrictions for Instrumental ...
    In such cases there is an incentive for the agent to acquire information on the value of the IV. This leads to violation of the exclusion restriction. We.Missing: LATE | Show results with:LATE
  26. [26]
    Understanding Treatment Effect Terminology in Pain and Symptom ...
    In our example, the local average treatment effect (LATE) is the treatment effect for individuals who could be persuaded to change membership in the medication ...
  27. [27]
    [PDF] structural equations, treatment effects, and econometric policy ...
    This paper uses the marginal treatment effect (MTE) to unify the nonparametric literature on treatment effects with the econometric literature on structural ...
  28. [28]
    Approaches to the Estimation of the Local Average Treatment Effect ...
    Typically, a local average treatment effect (LATE) estimator (Imbens & Angrist, 1994; Hahn et al., 2001) is used to provide an estimate of this effect.