Fact-checked by Grok 2 weeks ago

Gompertz distribution

The Gompertz distribution is a continuous probability distribution defined on the positive real line, characterized by a probability density function of the form f(x; \eta, \theta) = \frac{\eta}{\theta} \exp\left( \frac{\eta x}{\theta} + 1 - \exp\left(\frac{\eta x}{\theta}\right) \right) for x > 0, where \eta > 0 is a shape parameter and \theta > 0 is a scale parameter, and featuring an exponentially increasing hazard rate that makes it suitable for modeling aging-related failure processes. Named after the British actuary Benjamin Gompertz, the distribution derives from his 1825 proposal of a mathematical law describing human mortality as increasing geometrically with age, expressed through a force of mortality \mu(x) = B c^x for constants B > 0 and c > 1. This law forms the basis of the distribution's survival function S(x) = \exp\left( -\int_0^x \mu(t) \, dt \right) = \exp\left( -a (c^x - 1) \right), where a = B / \ln c > 0, leading to the cumulative distribution function F(x) = 1 - \exp\left( -a (c^x - 1) \right). Key properties include a hazard function h(x) = \frac{\eta}{\theta} \exp\left(\frac{\eta x}{\theta}\right) that starts low and rises exponentially, reflecting accelerating risk over time, and moments that involve the exponential integral function without closed-form expressions, such as the mean \mathbb{E}[X] = \frac{\theta}{\eta} \, e \, E_1(1), where E_1(z) = \int_z^\infty \frac{e^{-t}}{t} \, dt is the exponential integral. The distribution is positively skewed, unimodal for certain parameter values, and related to extreme value distributions through transformations, enabling simulation via inverse CDF methods with uniform random variables. The Gompertz distribution finds primary applications in actuarial science for pricing life insurance and annuities by modeling human mortality tables, in demography for analyzing population survival curves, and in reliability engineering for lifetimes of components subject to wear-out failures. Extensions, such as the Gompertz-Makeham distribution incorporating a constant baseline hazard, further enhance its utility in epidemiological studies of age-specific death rates.

Definition

Parameterization

The Gompertz distribution is a continuous probability distribution for a nonnegative random variable X with support on [0, \infty). It is defined using two positive parameters: the scale parameter \eta > 0, and the shape parameter b > 0, which represents the rate of exponential increase in mortality. The parameter \eta appears in the cumulative hazard function as \eta (e^{b x} - 1), influencing the overall scale of the distribution, while b controls the acceleration of the hazard over time, capturing the aging process. This parameterization establishes the foundational hazard structure h(x) = b \eta e^{b x}, from which the probability density function and other distribution functions are derived. The initial hazard rate at age zero is h(0) = b \eta. Alternative parameterizations are employed in applications such as actuarial modeling of adult mortality, where a shifted form starts at an initial age x_0 > 0 and uses an adjusted initial rate c = b \eta e^{b x_0} to reflect the effective mortality at that age, while retaining the same exponential growth rate b. The standard unshifted form, however, remains the primary reference for theoretical developments and general survival analysis.

Probability density function

The probability density function of the Gompertz distribution, parameterized by scale parameter \eta > 0 and shape parameter b > 0, is given by f(x; \eta, b) = b \eta \exp\left(\eta + b x - \eta e^{b x}\right), \quad x \geq 0. This form arises from the underlying Gompertz law of mortality, where the hazard rate h(x) = b \eta e^{b x} increases exponentially with age x. The survival function is then S(x) = \exp\left(-\int_0^x h(u) \, du\right) = \exp\left(-\eta (e^{b x} - 1)\right), and the PDF follows as f(x) = h(x) S(x), yielding the exponential structure above after simplification. To verify normalization, note that \int_0^\infty f(x; \eta, b) \, dx = -\int_0^\infty \frac{d}{dx} S(x) \, dx = -[S(\infty) - S(0)] = 1 - 0 = 1, confirming it integrates to unity over [0, \infty). The PDF shape is right-skewed, starting near zero at x = 0, rising to a single mode at x = \frac{1}{b} \ln\left(\frac{1}{\eta}\right) (provided \eta < 1; otherwise, the mode is at the boundary), and then decaying asymptotically to zero as x increases, reflecting accelerating failure rates typical in actuarial and reliability contexts.

Distribution functions

Cumulative distribution function

The cumulative distribution function (CDF) of the Gompertz distribution with parameters \eta > 0 and b > 0 is given by F(x; \eta, b) = 1 - \exp\left(-\eta (e^{b x} - 1)\right), \quad x \geq 0, and F(x; \eta, b) = 0 for x < 0. This formula arises as the integral of the probability density function from 0 to x, or equivalently as F(x) = 1 - S(x), where S(x) denotes the survival function. The CDF satisfies the boundary conditions F(0; \eta, b) = 0 and \lim_{x \to \infty} F(x; \eta, b) = 1. It is strictly increasing and continuous on [0, \infty), reflecting the positive support and non-negative density of the distribution. The quantile function, or inverse CDF, is the solution to F(q(p); \eta, b) = p for $0 < p < 1, yielding the explicit form q(p; \eta, b) = \frac{1}{b} \ln\left(1 + \frac{1}{\eta} \ln\left(\frac{1}{1 - p}\right)\right). This closed-form expression facilitates computation of quantiles, such as medians or percentiles, in applications like survival analysis.

Survival function

The survival function of the Gompertz distribution, denoted S(x; \eta, b), represents the probability that a lifetime exceeds age x \geq 0, and is given by S(x; \eta, b) = \exp\left( -\eta \left( e^{b x} - 1 \right) \right), where \eta > 0 is a scale parameter and b > 0 is a shape parameter controlling the rate of increase in mortality. This function is derived directly from the cumulative distribution function F(x; \eta, b) as S(x; \eta, b) = 1 - F(x; \eta, b), providing the complementary probability of survival beyond x after accounting for the accumulated risk up to that point. Key properties include S(0; \eta, b) = 1, indicating certain survival at age zero, and the function is strictly decreasing to \lim_{x \to \infty} S(x; \eta, b) = 0, reflecting inevitable mortality over time. Additionally, the natural logarithm of the survival function simplifies to \log S(x; \eta, b) = -\eta \left( e^{b x} - 1 \right), which highlights an exponential decay in survival probability, modulated by the age-dependent term e^{b x} that accelerates with increasing b. In reliability and survival analysis, S(x; \eta, b) quantifies the likelihood of system or organism endurance past duration x, making it valuable for modeling aging processes with exponentially rising failure rates, such as human mortality in actuarial contexts.

Hazard rate function

The hazard rate function, also known as the force of mortality in actuarial science, for the Gompertz distribution is given by h(x; \eta, b) = b \eta e^{b x} for x \geq 0, where \eta > 0 is a scale parameter and b > 0 is a shape parameter controlling the rate of increase in mortality. This explicit form is derived from the general definition of the hazard rate as the ratio of the probability density function to the survival function, h(x) = f(x)/S(x), or equivalently as h(x) = -\frac{d}{dx} \log S(x). Applying these to the Gompertz distribution's component functions results in the characteristic linear exponential expression. Key properties of the hazard function include its strict monotonicity: it is increasing for all x \geq 0 because the derivative h'(x) = b^2 \eta e^{b x} > 0. The initial value is h(0) = b \eta, and \lim_{x \to \infty} h(x) = \infty, reflecting an unbounded escalation in failure risk over time that suits modeling accelerating mortality. This functional form captures the Gompertz law of mortality, first articulated by Benjamin Gompertz in 1825, which describes how the intensity of mortality increases exponentially with age owing to a progressive, uniform weakening of the body's capacity to resist death across successive equal time periods.

Moments and generating functions

Raw moments

The raw moments of the Gompertz distribution, assuming the hazard function h(x) = a e^{b x} for x \geq 0 with parameters a > 0 and b > 0, are given by the general formula E[X^r] = \frac{r!}{b^r} \, e^{a/b} \, E_{r,1}\left( \frac{a}{b} \right), where E_{n,s}(z) = \int_1^\infty e^{-z t} t^{-n} \, dt is the generalized exponential integral function. The first raw moment, or mean, is thus E[X] = \frac{1}{b} \, e^{a/b} \, E_1\left( \frac{a}{b} \right), with E_1(z) = \int_z^\infty \frac{e^{-t}}{t} \, dt denoting the standard exponential integral; equivalently, E[X] = -\frac{1}{b} \, e^{a/b} \, \Ei\left( -\frac{a}{b} \right), where \Ei is the exponential integral function. This expression quantifies the central tendency, reflecting the distribution's heavy-tailed nature influenced by the aging rate b and initial hazard level a. The second raw moment is E[X^2] = \frac{2}{b^2} \, e^{a/b} \, E_2\left( \frac{a}{b} \right), where E_2(z) follows from the generalized form. The variance is obtained as \Var(X) = E[X^2] - (E[X])^2, yielding the explicit relation \Var(X) = \frac{2}{b^2} \, e^{a/b} \left[ -\frac{a}{b} \ {}_3F_3\left(1,1,1;2,2,2;\frac{a}{b}\right) + \frac{1}{2} \left( \frac{\pi^2}{6} + \left( \gamma + \ln \frac{a}{b} \right)^2 \right) \right] - (E[X])^2, with \gamma \approx 0.5772156649 the Euler-Mascheroni constant and {}_3F_3 the generalized hypergeometric function; this measures the spread, approaching \pi^2/(6 b^2) for small a/b. Higher-order raw moments follow the general formula involving E_{r,1}, but practical computation is challenging due to the need for numerical evaluation of the special functions, particularly for small a/b where singularities and logarithmic terms complicate stability.

Moment-generating function

The moment-generating function (MGF) of the Gompertz distribution, denoted M(t) = \mathbb{E}[e^{tX}], where X follows the distribution with parameters b > 0 and \eta > 0, is derived by integrating the probability density function: M(t) = \int_0^\infty e^{tx} \, b \eta e^{bx} \exp\left(-\eta (e^{bx} - 1)\right) \, dx. This integral can be evaluated using the substitution u = e^{bx}, which yields dx = du / (b u) and transforms the expression into a form involving the upper incomplete gamma function \Gamma(s, z) = \int_z^\infty v^{s-1} e^{-v} \, dv: M(t) = e^\eta \eta^{-t/b} \Gamma\left(1 + \frac{t}{b}, \eta\right), valid for all real t since all moments exist due to the rapid decay of the distribution's tail. The derivation proceeds by simplifying the exponent in the integrand to (t + b)x - \eta e^{bx}, applying the substitution to obtain \eta e^\eta \int_1^\infty u^{t/b} e^{-\eta u} \, du, and then rescaling the variable v = \eta u to recognize the incomplete gamma form after adjusting the power and limits. This relation to the incomplete gamma function also connects the MGF to the Laplace transform of the distribution, which is M(-t). The MGF facilitates the computation of moments and cumulants through differentiation: the k-th raw moment is M^{(k)}(0). However, explicit evaluation of M(t) or its derivatives often requires numerical methods, as the incomplete gamma function lacks a simpler closed-form expression in elementary functions for arbitrary parameters. The raw moments, obtainable via this differentiation, provide key descriptive statistics but are not expressible in elementary terms for general k.

Parameter estimation

Maximum likelihood estimation

Maximum likelihood estimation (MLE) provides optimal estimators for the parameters η and b of the Gompertz distribution under standard regularity conditions, utilizing the full information from the observed data. This approach is particularly valuable in applications involving lifetime data, where the distribution often arises in modeling increasing failure rates. For a random sample of n uncensored observations x_1, \dots, x_n from the Gompertz distribution, the likelihood function is given by L(\eta, b \mid \mathbf{x}) = \prod_{i=1}^n f(x_i \mid \eta, b), where f(x \mid \eta, b) is the probability density function. In the presence of right-censoring, the likelihood incorporates the survival function S(x \mid \eta, b) for censored observations, yielding L(\eta, b \mid \mathbf{x}, \boldsymbol{\delta}) = \prod_{i=1}^n \left[ f(x_i \mid \eta, b) \right]^{\delta_i} \left[ S(x_i \mid \eta, b) \right]^{1 - \delta_i}, with \delta_i = 1 for uncensored and \delta_i = 0 for censored cases. The log-likelihood for uncensored data simplifies to \begin{align*} l(\eta, b \mid \mathbf{x}) &= \sum_{i=1}^n \left[ \log(b \eta) + \eta + b x_i - \eta e^{b x_i} \right] \ &= n \log(b \eta) + n \eta + b \sum_{i=1}^n x_i - \eta \sum_{i=1}^n e^{b x_i}. \end{align*} Maximizing this function requires solving the score equations \partial l / \partial \eta = 0 and \partial l / \partial b = 0, which generally lack closed-form solutions due to their nonlinearity and must be addressed via numerical optimization methods such as Newton-Raphson iteration. For censored data, the log-likelihood takes a similar form but adjusts the summation over uncensored terms for the density contributions. Under standard assumptions, the MLEs \hat{\eta} and \hat{b} are consistent and asymptotically efficient, with asymptotic normality \sqrt{n} (\hat{\theta} - \theta) \to \mathcal{N}(0, \mathcal{I}(\theta)^{-1}), where \theta = (\eta, b) and \mathcal{I}(\theta) is the Fisher information matrix. Standard errors for inference can be obtained from the inverse of the observed Hessian matrix evaluated at the MLE. The nonlinear nature of the optimization can lead to convergence difficulties, especially with small sample sizes or highly skewed data, potentially requiring careful initial parameter guesses or alternative algorithms like BFGS. Implementations are available in statistical software, such as the mlgompertz function in the R package univariateML for uncensored data using Newton-Raphson. For survival analysis with censoring, R's flexsurv package supports Gompertz models via parametric fitting.

Method of moments estimation

The method of moments (MoM) estimation for the two-parameter Gompertz distribution equates the sample mean \bar{x} and sample variance s^2 to the theoretical mean E[X] and variance \mathrm{Var}(X), respectively, yielding a system of nonlinear equations solved numerically for the parameters \eta > 0 and b > 0. The theoretical mean is given by E[X] = \frac{e^{\eta} E_1(\eta)}{b}, where E_1(\cdot) denotes the exponential integral function E_1(z) = \int_z^\infty \frac{e^{-t}}{t} \, dt. The variance is \mathrm{Var}(X) = \frac{2}{b^2} e^{\eta} \left[ -\eta \, {}_3F_3\left(1,1,1; 2,2,2; \eta \right) + \frac{1}{2} \left( \frac{\pi^2}{6} + (\gamma + \ln \eta)^2 \right) \right] - (E[X])^2, with \gamma \approx 0.57721 the Euler-Mascheroni constant and {}_3F_3(\cdot) the generalized hypergeometric function; alternatively, it can be computed from the second raw moment minus the squared mean, where raw moments are derived via the Laplace transform or integration. To solve, an initial estimate of b is obtained from the variance-mean relation, often using the approximation \mathrm{Var}(X) \approx \pi^2 / (6 b^2) valid for small \eta, so b \approx \pi / \sqrt{6 s^2}, though the full relation requires iteration to account for \eta-dependence. With b fixed, \eta is then found by numerically inverting e^{\eta} E_1(\eta) = b \bar{x} via methods like Newton-Raphson, as no closed-form solution exists. This approach leverages the raw moments of the distribution for straightforward computation from sample summaries. MoM offers simplicity and serves as a quick initial estimate for more sophisticated methods, but it is generally less statistically efficient than maximum likelihood estimation and can be sensitive to deviations from the assumed distributional form, particularly in the tails.

Characteristic properties

Parameter effects on shape

In the general parameterization of the Gompertz distribution with hazard function h(x) = \eta \exp(b x) for \eta > 0 (initial hazard rate) and b > 0 (growth rate, independent of \eta), larger values of \eta elevate the baseline hazard across the support, resulting in higher initial probability density and a leftward shift in the mass of the distribution toward lower values of x, as failures become more likely at earlier stages. This effect is evident in the probability density function (PDF), where increased \eta amplifies the density near x = 0 while compressing the tail, thereby raising the early hazard rate and reducing the expected lifetime. In contrast, larger values of b accelerate the exponential growth of the hazard function, leading to a slower initial accumulation of risk followed by a more rapid escalation later on. This produces a more pronounced right skew in the PDF and cumulative distribution function (CDF), with the bulk of the probability mass concentrating at higher ages or times, reflecting delayed but intensified failure risks. The survival function decays more gradually at first but drops sharply thereafter, emphasizing the distribution's utility in modeling accelerating aging processes. Visualizations of parameter families illustrate these dynamics clearly. For fixed b, plots of the PDF with varying \eta show densities starting higher and tapering more quickly as \eta increases, often transitioning from unimodal to monotonically decreasing shapes when \eta exceeds b. Similarly, for fixed \eta, increasing b yields families of PDFs and hazard rates that exhibit delayed peaking, with the mode shifting rightward and the hazard curve steepening exponentially; the CDF rises more slowly initially before accelerating. The mode, when it exists (for b > \eta), is located at x = \frac{1}{b} \ln\left(\frac{b}{\eta}\right), though this position sensitively depends on the relative magnitudes of the parameters. Small perturbations in b are particularly influential in fitting real-world data, as they capture subtle accelerations in aging or failure rates, such as in human mortality tables where even minor increases in b align the model with observed rises in late-life hazards without overpredicting early events. This parameter sensitivity underscores b's role in tailoring the distribution to empirical trends in demography and reliability.

Information measures

The Kullback-Leibler (KL) divergence is a measure of the difference between two probability distributions, defined as D_{\text{KL}}(P \parallel Q) = \int f_P(x) \log \left( \frac{f_P(x)}{f_Q(x)} \right) dx, where f_P and f_Q are the probability density functions of distributions P and Q. For the Gompertz distribution, a closed-form expression exists for the KL divergence between two Gompertz distributions with parameters (b_1, q_1) and (b_2, q_2), where the pdf is given by f(x \mid b, q) = e^q b q e^{b x} e^{-q e^{b x}} for x \geq 0 and b, q > 0. This expression is D_{\text{KL}}(F_1 \parallel F_2) = \ln\left( \frac{e^{q_1} b_1 q_1}{e^{q_2} b_2 q_2} \right) + e^{q_1} \left[ \left( \frac{b_2}{b_1} - 1 \right) \text{Ei}(-q_1) + \frac{q_2}{q_1} q_1^{-b_2/b_1} \Gamma\left( \frac{b_2}{b_1} + 1, q_1 \right) \right] - (q_1 + 1), involving the exponential integral \text{Ei}(\cdot) and the upper incomplete gamma function \Gamma(s, x). Computing the KL divergence for a Gompertz distribution against alternatives like the Weibull distribution typically requires numerical integration due to the lack of closed forms, as the integral involves products of exponentials that do not simplify analytically. Approximations or quadrature methods, such as Gauss-Laguerre integration, are commonly employed for such cases. Exact expressions for Gompertz-Weibull comparisons remain unavailable in closed form, though general derivations for univariate continuous distributions provide frameworks using moment-generating functions for numerical evaluation. In applications, the KL divergence aids model selection by quantifying how well a Gompertz model approximates empirical data compared to alternatives, and supports goodness-of-fit testing in lifetime analysis, such as mortality modeling where distinguishing Gompertz from Weibull fits is crucial. Other information measures for the Gompertz distribution include the Shannon entropy, defined as H = -\int f(x) \log f(x) \, dx, which quantifies uncertainty in the distribution. No closed-form expression exists for the Shannon entropy of the standard Gompertz distribution, necessitating numerical estimation via methods like Monte Carlo integration or series expansions. Comparative studies of multiple entropy measures (e.g., Rényi, Tsallis) for Gompertz distributions also rely on numerical computation to assess properties like truncation effects in reliability contexts.

Gompertz–Makeham distribution

The Gompertz–Makeham distribution extends the Gompertz distribution by incorporating an additional constant term in the hazard function to account for age-independent mortality risks, such as accidents. The hazard function is given by
h(t) = a + b e^{c t},
where t \geq 0 represents age, a > 0 is the constant baseline hazard, b > 0 scales the initial level of the age-dependent component, and c > 0 governs the exponential rate of increase with age. This formulation was originally proposed by William Matthew Makeham to improve the modeling of human mortality patterns observed in actuarial data.
The probability density function (PDF) and cumulative distribution function (CDF) derive from the hazard via the survival function S(t) = \exp\left( -\int_0^t h(u) \, du \right), yielding
f(t) = (a + b e^{c t}) \exp\left( -a t - \frac{b}{c} (e^{c t} - 1) \right)
for the PDF and
F(t) = 1 - \exp\left( -a t - \frac{b}{c} (e^{c t} - 1) \right)
for the CDF. These closed-form expressions facilitate analytical computations in survival analysis without requiring numerical integration.
When the baseline parameter a = 0, the distribution reduces to the standard Gompertz distribution with hazard b e^{c t} and parameters b and c. The inclusion of the positive constant a captures mortality causes unrelated to aging, such as external hazards, which enhances the model's empirical fit to human lifetime data across various populations by addressing deviations from pure exponential growth in observed death rates at younger ages.

Comparisons with other lifetime distributions

The Gompertz distribution features a strictly increasing hazard rate that grows exponentially without an upper bound, distinguishing it from the Weibull distribution, whose hazard can be constant, monotonically increasing, or decreasing based on the shape parameter, providing greater flexibility for diverse failure mechanisms but less specificity for age-accelerated mortality in human populations. In analyses of human mortality data, the Gompertz model outperforms the Weibull for all-cause deaths and aggregated disease categories due to its better alignment with observed exponential mortality escalation in adulthood, while the Weibull excels for isolated causes of death. Compared to the exponential distribution, which assumes a constant hazard rate suitable only for scenarios without time-dependent risk variation, the Gompertz extends this by introducing exponential growth in the hazard, thereby capturing essential age effects in lifetime data that the exponential overlooks. The Gompertz shares skewness characteristics with the gamma and lognormal distributions but maintains a purely exponential increasing hazard, unlike the gamma's flexible monotonic (increasing or decreasing) but non-exponential form or the lognormal's non-monotonic hazard that rises to a peak before declining, making the Gompertz ideal for applications emphasizing consistent hazard acceleration. Selection among these distributions often relies on visual inspection of hazard plots to assess shape alignment or quantitative evaluation via the Akaike Information Criterion (AIC) from maximum likelihood fits, favoring the model with the lowest AIC for parsimonious balance of fit and complexity.

Applications

Mortality and demography

The Gompertz distribution plays a central role in actuarial science by modeling adult human mortality for the construction of life tables and the pricing of annuities and life insurance products, capturing the exponential rise in death rates that begins after infancy. Its hazard rate function, which increases exponentially with age, provides a reliable basis for projecting survival probabilities in these financial applications, enabling actuaries to assess risks and set premiums accurately for policies spanning decades. In demography, the Gompertz law—embodied by the distribution—describes observed patterns of longevity across human populations and other species, where the force of mortality rises exponentially after early adulthood. The parameters η (initial mortality level) and b (aging rate) exhibit variation by gender, species, and population; typical values for the aging rate b range from 0.085 to 0.095, with variations by population and gender; studies show slightly higher b for males in some cohorts, indicating faster aging. These differences highlight how environmental and biological factors influence longevity, with lower b values correlating to slower mortality acceleration in long-lived species. Extensions of the Gompertz model address cohort-specific effects, such as improvements in early-life conditions that alter later mortality trajectories, and incorporate period influences like medical advances, which have contributed to reductions in the parameter b over time in certain cohorts by mitigating age-related vulnerabilities. Empirical studies consistently demonstrate the Gompertz distribution's fit to human mortality data, revealing an exponential increase in death rates starting from around age 30, as evidenced by analyses of large-scale datasets like the Human Mortality Database. Post-2020 data further illustrate this, with COVID-19 causing excess mortality that follows a Gompertzian age pattern, amplifying rates particularly among older adults and temporarily elevating parameters in affected populations.

Reliability engineering

In reliability engineering, the Gompertz distribution is employed to model wear-out failures in engineered systems, where the hazard rate increases exponentially with time, reflecting progressive degradation in components such as electronics or materials under sustained stress. The hazard function h(t) = \lambda \phi^t (with \phi > 1) captures this acceleration without an initial decreasing phase, making it suitable for scenarios dominated by aging rather than infant mortality. For instance, in complex network systems exhibiting wear-out behavior, the lifetime distribution emerges as Gompertz-like when system size dominates, akin to parallel configurations with short-range interactions. Applications include predicting component lifetimes in aerospace engineering, such as CubeSats, where the distribution addresses wear-out failures under environmental stresses like radiation or thermal cycling. In accelerated life testing (ALT), the Gompertz model is adapted under constant or step stress, with the shape parameter \phi scaling according to stress levels to extrapolate normal-use reliability from compressed test data. This approach is particularly valuable for high-stakes systems requiring rapid validation, as it allows estimation of survival probabilities and mean time to failure via the survival function S(t) = \exp\left( -\frac{\lambda}{\ln \phi} (\phi^t - 1) \right). Case studies demonstrate its utility in computing and networking. For computer hardware and software reliability, the Gompertz model fits failure rates that intensify over operational time, as validated empirically on fault data from software testing phases. In network node reliability, path lengths of self-avoiding walks on Erdős–Rényi networks follow a Gompertz distribution, implying increasing termination probabilities that inform failure propagation and system resilience. Compared to the Weibull distribution, which accommodates bathtub-shaped hazards for mixed failure modes, the Gompertz excels in purely accelerating failure scenarios without early-life defects, offering a simpler parameterization for targeted wear-out analysis.

Economic modeling

The Gompertz distribution finds significant application in economic modeling of customer lifetime value, where it captures the increasing hazard rate as a defection or churn probability over time. In marketing analytics, this hazard represents the escalating likelihood of customers discontinuing purchases or subscriptions, enabling firms to forecast revenue streams and design retention interventions. A key contribution is the gamma/Gompertz (G/G) model proposed by Bemmaor and Glady, which incorporates unobserved heterogeneity via gamma mixing on Gompertz lifetimes, outperforming the Pareto distribution in flexibility for modeling sudden customer "death" events in empirical datasets from various markets. This approach has been validated on six consumer markets, demonstrating superior fit for non-monotonic interpurchase patterns and aiding in precise valuation of long-tail customer contributions. In product adoption and diffusion processes, the Gompertz distribution models the trajectory toward market saturation or obsolescence, interpreting the rising hazard as a slowdown in new adoptions as potential users diminish. This S-shaped cumulative distribution aligns with economic theories of innovation spread, where initial rapid uptake decelerates due to market exhaustion or competitive substitution. For example, Naseri and Elliott compared the Gompertz model with Bass and logistic alternatives on Australian online shopping adoption data from 1998–2009, finding it effectively captures the asymmetric deceleration phase toward an upper asymptote, with better predictive accuracy for mature markets. Similarly, in forecasting technology product lifecycles, the model has been applied to predict obsolescence quantities, such as in mobile phones, by estimating the increasing rate of market exit or replacement needs. Beyond consumer markets, the Gompertz distribution informs modeling of insurance lapse rates, treating policy duration as a lifetime with an exponentially increasing termination hazard influenced by economic factors like interest rates or policyholder finances. Valdez et al. analyzed life insurance policy survivorship using the Gompertz framework, revealing how lapses accelerate over time and impose substantial costs on insurers through unrecouped acquisition expenses. In broader economic growth contexts, the distribution—often via its cumulative form—describes decelerating expansion phases in resource-limited systems, such as GDP trajectories approaching steady states; adaptations invert the hazard to model saturation from below, as generalized by Jarne et al. for logistic-like economic dynamics. Post-2020 applications have extended Gompertz-based models to subscription services analytics, particularly in digital economies where churn prediction supports scalable retention amid volatile user behaviors. The G/G framework continues to demonstrate utility for heterogeneous subscription cohorts in e-commerce and SaaS platforms using advanced data methods. Parameter estimation in these economic settings typically relies on maximum likelihood methods adapted to transactional data for robust fitting.

History

Benjamin Gompertz's contribution

Benjamin Gompertz, a British actuary, introduced the foundational law of human mortality in his 1825 paper presented to the Royal Society on June 16, titled "On the Nature of the Function Expressive of the Law of Human Mortality, and on a New Mode of Determining the Value of Life Contingencies." Motivated by the need to refine calculations for life insurance premiums and annuities amid the expanding commercial insurance sector in early 19th-century Britain, Gompertz sought a mathematical function that could accurately model age-related increases in mortality rates. His work built on earlier actuarial efforts, addressing the limitations of existing life tables by proposing a more precise expression for survival probabilities. Gompertz formulated the law such that the force of mortality, denoted as the rate at which mortality increases with age, is proportional to an exponential function: μ(x) ∝ e^{c x}, where x represents age and c is a constant reflecting the rate of increase. This model implied that the number of survivors at age x follows a form like d · g^{q x}, with d, g, and q as adjustable parameters, where q relates to c and g to the base of the exponentiation. These multiplicative constants served as precursors to the modern parameters η (initial mortality level) and b (exponential growth rate) in the Gompertz distribution. To establish empirical validity, Gompertz fitted his model to contemporary UK life tables, including those from Carlisle, Northampton, and the Equitable Society's experience, demonstrating close alignment particularly for adult ages. For instance, using Carlisle data at 3% interest, he calculated annuity premiums such as £1 0s 4d at age 30 and £7 16s 9d at age 70, highlighting the practical actuarial utility. However, the formulation explicitly focused on adult mortality, overlooking high infant mortality rates observed in the era's data, as Gompertz noted its inapplicability to early life stages. This limitation reflected the model's intent to capture the accelerating mortality risk in later adulthood rather than comprehensive lifespan dynamics.

Modern extensions

In the early 20th century, researchers began seeking biological explanations for the Gompertz law of mortality, attributing the exponential increase in death rates to processes like the accumulation of physiological damage or wear-out of vital components. By the mid-20th century, the model was formalized as a continuous probability distribution in actuarial science and demography, enabling its use in life table construction and survival analysis through parametric frameworks. The Makeham extension, originally proposed in 1860 to incorporate an age-independent mortality component, was integrated into these probabilistic formulations during this period, enhancing the model's applicability across the full lifespan. Advancements in computing from the 1980s facilitated numerical methods for parameter estimation, as closed-form solutions for maximum likelihood estimators are unavailable, requiring iterative optimization techniques like Newton-Raphson algorithms. This enabled more precise fitting to empirical data in reliability and survival contexts. By the late 20th and early 21st centuries, the Gompertz distribution was incorporated into statistical software, such as R's flexsurv and eha packages, supporting flexible survival modeling and Bayesian inference. Post-2020 developments have seen the Gompertz distribution applied to model excess mortality during the COVID-19 pandemic, capturing the accelerated death rates in affected populations through extensions like generalized forms for dynamic forecasting. Critiques of the model's universality highlight variations in the shape parameter b across species, with lower values in long-lived mammals compared to shorter-lived ones, suggesting context-specific adaptations rather than a fixed biological constant. The original assumptions of the Gompertz law have been challenged by observed increases in human longevity, which correlate with a secular decline in the initial mortality rate rather than the aging rate b, which has remained relatively stable since the early 20th century, indicating that medical and environmental interventions primarily affect baseline hazards rather than flattening the exponential trajectory.