Truncated normal distribution
The truncated normal distribution is a continuous probability distribution derived from the normal distribution by restricting the support to a finite interval [a, b], where the density is zero outside this range and renormalized within it to ensure the total probability integrates to one.[1] It is parameterized by the mean μ and standard deviation σ of the underlying normal distribution, along with the lower truncation point a and upper truncation point b, where typically -∞ ≤ a < b ≤ ∞.[2] This distribution arises naturally when a normally distributed random variable is conditioned to lie within specified bounds, preserving many properties of the normal while avoiding extreme values.[1] The probability density function (PDF) of the truncated normal distribution is given byf(x \mid \mu, \sigma, a, b) = \frac{\phi\left(\frac{x - \mu}{\sigma}\right)}{\sigma \left[ \Phi\left(\frac{b - \mu}{\sigma}\right) - \Phi\left(\frac{a - \mu}{\sigma}\right) \right]}
for a < x < b, and 0 otherwise, where \phi and \Phi denote the standard normal PDF and CDF, respectively.[2] The cumulative distribution function (CDF) is similarly adjusted as
F(x \mid \mu, \sigma, a, b) = \frac{\Phi\left(\frac{x - \mu}{\sigma}\right) - \Phi\left(\frac{a - \mu}{\sigma}\right)}{\Phi\left(\frac{b - \mu}{\sigma}\right) - \Phi\left(\frac{a - \mu}{\sigma}\right)}
for a < x < b.[2] The mean is \mu + \sigma \frac{\phi(\alpha) - \phi(\beta)}{Z}, where \alpha = (a - \mu)/\sigma, \beta = (b - \mu)/\sigma, and Z = \Phi(\beta) - \Phi(\alpha), while the variance is \sigma^2 \left[ 1 + \frac{\alpha \phi(\alpha) - \beta \phi(\beta)}{Z} - \left( \frac{\phi(\alpha) - \phi(\beta)}{Z} \right)^2 \right].[1] These moments differ from those of the untruncated normal, with the mean shifting toward the center of the interval and the variance typically decreasing as the truncation narrows.[2] In statistics and econometrics, the truncated normal distribution is essential for analyzing data subject to truncation or censoring, such as income levels above a reporting threshold.[3] It forms the basis for models like truncated regression, which corrects for selection bias in samples where observations below or above certain cutoffs are excluded, as seen in studies of earnings distributions.[3] Applications extend to queueing theory, where it models stationary waiting times in single-server queues with impatient customers under heavy traffic conditions, and to robust estimation in location and regression problems by simplifying asymptotic theory.[4][5] Additionally, it supports efficient computational methods, including sampling algorithms and quadrature for multidimensional stochastic modeling.[1]