Fact-checked by Grok 2 weeks ago

Probabilistic forecasting

Probabilistic forecasting is a statistical methodology that generates a predictive probability distribution over future quantities or events of interest, rather than a single point estimate, to explicitly account for uncertainty in predictions.^[1] This approach aims to produce distributions that are both calibrated—meaning the predicted probabilities align reliably with observed frequencies—and sharp, meaning they are as concentrated as possible given the information available.^[1] By providing a full spectrum of possible outcomes with their likelihoods, probabilistic forecasting supports informed decision-making under uncertainty, distinguishing it from deterministic methods that yield only a single expected value.^[2] The foundations of probabilistic forecasting trace back to early Bayesian statistical models in the 1960s, evolving through advancements in time series analysis, quantile regression, and decision theory in the late 1970s.^[3] Over time, it has incorporated machine learning techniques such as random forests, gradient boosting, and deep learning to estimate predictive distributions more flexibly, often using methods like conformal prediction or generative models.^[3] Key evaluation tools include proper scoring rules, such as the continuous ranked probability score (CRPS) for overall accuracy and probability integral transform (PIT) histograms for assessing calibration, ensuring forecasts are both reliable and informative.^[1] These metrics emphasize the dual goals of sharpness and calibration, guiding the development of robust models.^[1] Probabilistic forecasting finds broad applications across diverse domains, particularly where uncertainty quantification is critical for risk management.^[3] In meteorology, it underpins ensemble numerical weather prediction systems, such as the European Centre for Medium-Range Weather Forecasts' (ECMWF) Ensemble (ENS), which generates multiple scenarios to model weather probabilities;^[1] recent machine learning innovations like GenCast (2024) and FuXi-ENS (2025) have surpassed traditional methods in skill and speed for global 15-day forecasts.^[4]^[5] In energy systems, it enables wind and solar power predictions by outputting probability densities or intervals, aiding grid stability and resource planning.^[2] Other notable uses include economic forecasting for market volatility, population projections in demographics, and supply chain optimization to mitigate demand risks.^[3] These applications highlight its value in enhancing decision processes, from extreme event preparation to financial modeling.^[4]

Fundamentals

Definition and Principles

Probabilistic forecasting is a predictive approach that expresses uncertainty about future events or quantities through a full probability distribution, rather than a single point estimate. This method provides a comprehensive view of possible outcomes and their likelihoods, enabling users to assess risks and make informed decisions under uncertainty. Unlike deterministic forecasts, which offer only a best-guess value, probabilistic forecasts quantify the range of potential results, often in the form of density or distribution functions that capture the variability inherent in complex systems.^[1] At its core, probabilistic forecasting distinguishes between two fundamental types of uncertainty: aleatory uncertainty, which reflects the inherent randomness or variability in the process being forecasted, and epistemic uncertainty, which arises from incomplete knowledge, model limitations, or insufficient data. Aleatory uncertainty is irreducible and represents the stochastic nature of the outcome, while epistemic uncertainty can potentially be reduced through additional information or improved modeling. Forecasts are typically represented using probability density functions (PDFs), which describe the likelihood of continuous outcomes, or cumulative distribution functions (CDFs), which provide the probability that the outcome falls below a certain value. A basic representation of such a forecast is the conditional probability P(Y \leq y \mid X), where Y denotes the future outcome of interest and X represents the available input data or covariates.^[6]^[1] The principles of probabilistic forecasting emphasize calibration—ensuring that predicted probabilities align with observed frequencies—and sharpness, which seeks the most concentrated distribution consistent with the data. These principles guide the construction of forecasts to be both reliable and informative, often drawing on statistical decision theory for evaluating their utility in decision-making contexts. The early conceptual foundations of this approach emerged from statistical decision theory in the mid-20th century, which formalized methods for reasoning under uncertainty using probability as a tool for optimal choices.^[1]^[7]

Comparison to Deterministic Forecasting

Deterministic forecasting provides a single-point prediction for future outcomes, such as estimating wind speed at a specific value for energy planning, without accounting for inherent uncertainties in the underlying processes.^[2] In contrast, probabilistic forecasting generates a full probability distribution over possible outcomes, such as the probability of precipitation exceeding a threshold in weather models, thereby explicitly representing uncertainty.^[4] This fundamental difference allows probabilistic methods to quantify variability and risk, whereas deterministic approaches often lead to overconfidence by presenting predictions as precise without qualification.^[8] The key advantages of probabilistic forecasting lie in its support for informed decision-making under uncertainty, particularly in domains like policy formulation and investment strategies where understanding potential ranges of outcomes is crucial for risk management.^[9] For instance, in economic forecasting, deterministic models might project a single value for GDP growth, while probabilistic approaches provide confidence intervals based on historical forecast errors, as in the Federal Open Market Committee's (FOMC) Summary of Economic Projections, enabling better assessment of downside risks and scenario planning.^[9]^[10] However, probabilistic methods typically incur higher computational costs due to the need for ensemble simulations or distribution modeling, making them more resource-intensive than the straightforward point estimates of deterministic forecasting.^[8]

Methods and Techniques

Ensemble Methods

Ensemble methods in probabilistic forecasting generate probability distributions by running multiple simulations of a forecasting model, typically with variations in initial conditions or parameters to capture uncertainty. This approach samples the underlying probability distribution of future outcomes, providing a range of possible forecasts rather than a single deterministic prediction. By aggregating these simulations, known as ensemble members, forecasters can estimate statistical properties such as means, variances, and probabilities of specific events.^[11] Key techniques for creating ensembles include initial condition ensembles, where perturbations are applied to the starting states of the model to represent uncertainties in observations, and perturbed parameter ensembles, which vary uncertain model parameters across members to account for structural deficiencies in the model. Another prominent method is the breeding of growing modes, which iteratively rescales differences between forecast runs to identify and amplify the most unstable directions in the system's dynamics, particularly useful in weather models where errors grow rapidly. These techniques allow ensembles to simulate the propagation of uncertainties through the model dynamics.^[12]^[11] The forecast probability distribution is approximated by the empirical distribution of the ensemble members \{y_1, y_2, \dots, y_N\}, where the predictive mean is given by

\bar{y} = \frac{1}{N} \sum_{i=1}^N y_i

and the variance by

\sigma^2 = \frac{1}{N-1} \sum_{i=1}^N (y_i - \bar{y})^2.

This nonparametric representation directly uses the member values to infer probabilities, such as the fraction of members exceeding a threshold for event likelihood. Ensemble methods originated in meteorology during the 1990s, with the European Centre for Medium-Range Weather Forecasts (ECMWF) launching its operational Ensemble Prediction System (EPS) in December 1992 to provide probabilistic medium-range forecasts. This innovation quickly spread to other domains, including hydrology and economics, where multiple model runs help quantify forecast reliability. A notable example is the U.S. National Centers for Environmental Prediction's Global Ensemble Forecast System (GEFS), which generates 31 ensemble members (30 perturbed plus 1 control) for probabilistic outlooks up to 35 days ahead, with higher-resolution forecasts to 16 days, aiding in decisions on severe weather and climate variability.^[13]^[14]

Bayesian and Parametric Approaches

Bayesian forecasting incorporates uncertainty by starting with prior distributions over model parameters, which are updated using observed data through Bayes' theorem to produce posterior distributions. This process yields posterior predictive distributions that quantify the full range of possible future outcomes, enabling probabilistic statements about forecasts rather than point estimates. The approach is particularly valuable in settings where data is limited or noisy, as it formally propagates uncertainty from priors through to predictions.^[15] The posterior distribution is formally defined as
p(\theta \mid y_{1:T}) \propto p(y_{1:T} \mid \theta) p(\theta),
where p(y_{1:T} \mid \theta) is the likelihood and p(\theta) is the prior, with the normalizing constant p(y_{1:T}) ensuring the posterior integrates to 1. The predictive distribution for a new observation then follows as
p(y_{T+1} \mid y_{1:T}) = \int p(y_{T+1} \mid \theta, y_{1:T}) p(\theta \mid y_{1:T}) \, d\theta,
which marginalizes over the posterior to provide the forecast distribution. In practice, this integral is often approximated numerically.^[15] Parametric methods within probabilistic forecasting assume that forecast outcomes follow a specific distributional family, such as the normal or log-normal distribution, characterized by a small number of parameters like mean and variance. These assumptions allow for efficient estimation of the full probability density using techniques like maximum likelihood, facilitating the generation of prediction intervals and quantiles. Quantile regression extends this by directly estimating conditional quantiles of the response variable through minimization of a quantile loss function, bypassing full distributional assumptions to produce interval forecasts that capture heteroscedasticity and asymmetry in uncertainty.^[16]^[17] For computational efficiency, conjugate priors are employed in simpler Bayesian models, where the prior and likelihood belong to the same exponential family, resulting in a posterior of the same form and enabling closed-form updates without numerical integration. In more complex scenarios, Markov Chain Monte Carlo (MCMC) methods, such as Gibbs sampling or Metropolis-Hastings, are used to draw samples from the posterior, approximating the predictive distribution through Monte Carlo integration. Bayesian approaches, including vector autoregressions, have been widely adopted in economics since the 1980s for forecasting GDP growth, as pioneered by Litterman at the Federal Reserve to handle high-dimensional macroeconomic data.^[18]^[19]^[20]

Machine Learning Techniques

Machine learning techniques for probabilistic forecasting leverage data-driven approaches to generate full probability distributions over future outcomes, offering flexibility in modeling complex, high-dimensional datasets without strong parametric assumptions. These methods typically produce predictive distributions by estimating uncertainty directly from training data, such as through Gaussian processes (GPs), which model the joint distribution over observed and future points as a multivariate Gaussian, enabling non-parametric regression with inherent uncertainty quantification. Deep generative models, including variational autoencoders (VAEs), extend this by learning latent representations that capture multimodal distributions, allowing for scenario generation in forecasting tasks like time series prediction. Key techniques include quantile regression forests (QRFs), which extend random forests to estimate conditional quantiles non-parametrically by aggregating quantile predictions from decision trees, providing empirical distribution functions for probabilistic outputs.^[21] Variational autoencoders facilitate probabilistic predictions by optimizing a lower bound on the data likelihood, encoding inputs into a latent space from which diverse future samples can be decoded, particularly useful for capturing heteroscedastic uncertainty in sequential data.^[22] For neural networks, categorical probabilities are often output via a softmax layer, where the probability for class k is given by

p(y = k \mid x) = \frac{\exp(z_k)}{\sum_{j=1}^K \exp(z_j)},

with z as the network's pre-activation outputs.^[23] For continuous distributions, mixture density networks (MDNs) parameterize a Gaussian mixture model, where the network outputs mixing coefficients \pi_m, means \mu_m, and variances \sigma_m^2 for M components, yielding the density

p(y \mid x) = \sum_{m=1}^M \pi_m(x) \mathcal{N}(y \mid \mu_m(x), \sigma_m^2(x)).

This approach, introduced by Bishop, enables multimodal predictions for regression tasks.^[24] The adoption of these machine learning techniques in probabilistic forecasting surged after 2010, driven by the availability of big data and computational advances in deep learning, enabling scalable uncertainty estimation beyond traditional parametric baselines like linear Gaussian models.^[3] More recent developments include transformer architectures, such as the Temporal Fusion Transformer, for multi-horizon probabilistic forecasting, and diffusion models that generate diverse forecast samples to model complex uncertainties. For instance, a 2018 study demonstrated the efficacy of density-estimating neural networks for household load forecasting, achieving sharp predictive densities with empirical validation on real-world datasets.^[25] Compared to parametric approaches, machine learning methods excel at capturing non-linear relationships and interactions in large-scale data, improving forecast sharpness and calibration in domains with intricate patterns. However, they often face interpretability challenges, as the opaque nature of models like deep networks complicates understanding the sources of uncertainty.^[4]

Applications

Weather Forecasting

Probabilistic weather forecasting quantifies uncertainty in meteorological variables such as temperature, precipitation, and wind speed by providing probability distributions rather than single-point predictions.^[26] This approach is essential for medium-range forecasts, where initial condition errors and model imperfections amplify uncertainty, enabling users like emergency managers and aviation authorities to assess risks more effectively.^[27] Key methods in probabilistic weather forecasting include ensemble prediction systems (EPS), which generate multiple simulations from perturbed initial conditions and model parameters to sample the probability distribution of future states.^[28] A simpler metric is the probability of precipitation (PoP), defined as the likelihood that measurable precipitation—at least 0.01 inches (0.25 mm)—will occur at a specific point within the forecast area over a given period.^[29] EPS are particularly vital for medium-range predictions (up to 15 days), as they capture flow-dependent uncertainty beyond what deterministic models can achieve.^[30] Prominent examples include the European Centre for Medium-Range Weather Forecasts (ECMWF) EPS, which runs a 51-member ensemble (one control plus 50 perturbed forecasts) for global forecasts out to 15 days, operational since 1992 and providing probabilistic outputs like spread and clustering to indicate forecast reliability.^[31] Similarly, the Canadian Meteorological Centre (CMC) has operated regional ensembles since January 1996, initially with eight members and expanding to support high-resolution predictions over North America, aiding in short-term severe weather alerts.^[32] Studies from the 2000s demonstrated that probabilistic ensembles outperform deterministic forecasts in accuracy, particularly for precipitation and temperature extremes, by reducing overconfidence and improving skill scores like the continuous ranked probability score.^[33] To enhance reliability, post-processing techniques such as bias correction and calibration are routinely applied to raw ensemble outputs, adjusting for systematic model errors observed in historical data.^[27] Recent advancements incorporate machine learning for ensemble downscaling, translating coarse global outputs to finer regional scales while preserving probabilistic structure; for instance, generative adversarial networks have been used to improve precipitation resolution from 100 km to 12.5 km grids with higher fidelity than traditional dynamical methods.^[34] In July 2025, ECMWF operationalized its ensemble Artificial Intelligence Forecasting System (AIFS), which generates probabilistic forecasts using ML techniques for enhanced speed and skill in medium-range predictions.^[35]

Economic Forecasting

Probabilistic forecasting plays a pivotal role in macroeconomic analysis, particularly for central banks, where it informs monetary policy decisions by delivering complete probability distributions for variables such as GDP growth rates and recession probabilities, enabling a nuanced assessment of economic uncertainties and risks.^[36] This approach allows policymakers to evaluate the likelihood of various outcomes, such as sustained growth or downturns, supporting more robust policy formulation amid inherent economic volatility.^[37] For instance, central banks use these distributions to gauge the probabilities of events like inflation deviations or output gaps, which directly influence interest rate adjustments and forward guidance.^[36] Key methods in probabilistic economic forecasting include survey-based approaches and econometric modeling. Survey-based probabilities aggregate expert judgments to construct density forecasts; for example, Consensus Economics has conducted annual surveys since 2000, polling economists on the likelihood of GDP growth and inflation falling into specific ranges for major economies, thereby forming consensus probability distributions that highlight downside and upside risks.^[38] A prominent application is the Bank of England's fan charts, introduced in 1996 following the adoption of inflation targeting in 1992, which visually depict probability distributions for inflation over a two-year horizon.^[39] These charts are constructed from a central projection with uncertainty bands derived from historical forecast errors, where the darkest shaded area represents the central 50% probability range, and the full fan encompasses a 90% confidence interval, facilitating transparent communication of policy risks.^[39] Complementing surveys, vector autoregression (VAR) models extended with stochastic simulations generate probabilistic forecasts by drawing from multivariate time series dynamics; these simulations propagate shocks through the system to produce density forecasts for interrelated variables like output and prices.^[40] Bayesian extensions of VAR models, incorporating priors for parameter uncertainty, further enhance these simulations for real-time applications.^[36] Specific evaluations underscore the value of these methods in U.S. contexts; a 2015 Federal Reserve study employed regime-switching models to estimate inflation uncertainty, combining historical data with staff forecasts to derive prediction intervals for PCE price inflation, revealing a low 3% probability of reverting to high-variance regimes observed in the 1970s-1980s.^[41] An illustrative example is deriving the probability of inflation exceeding a 2% target from density forecasts in fan charts, where such assessments help central banks signal policy stances— for instance, the Bank of England has used this to quantify risks of overshooting or undershooting the target, with the fan's structure implying just over a 50% chance of outcomes falling within the innermost band.^[39] Post-2020, probabilistic forecasting gained renewed prominence in assessing pandemic recovery trajectories, with central banks and researchers applying survey and model-based methods to map probabilities of economic rebound amid health and fiscal shocks.^[42] For example, expert judgment surveys integrated with statistical models produced density forecasts for key indicators like visitor arrivals and GDP components, estimating median recoveries delayed until 2022-2023 for domestic sectors but longer for international trade-exposed areas, informing targeted stimulus and risk mitigation strategies.^[42] These applications highlighted the flexibility of probabilistic tools in capturing tail risks during unprecedented disruptions, such as a 51% probability of sustained output shortfalls relative to pre-pandemic baselines.^[43]

Energy Forecasting

Probabilistic forecasting plays a crucial role in energy sector applications, particularly for managing electricity grids where uncertainty in supply and demand can lead to imbalances, blackouts, or inefficient resource allocation. It provides full probability distributions rather than single-point estimates for key variables such as load demand, wind and solar power generation, and electricity prices, enabling operators to quantify risks and optimize decisions like unit commitment and reserve scheduling.^[44] In renewable-heavy systems, these forecasts account for intermittent sources by modeling variability in weather-dependent generation, supporting integration of wind and solar into the grid.^[45] Key methods in energy probabilistic forecasting include quantile regression averaging (QRA), which combines multiple point forecasts into a probabilistic output by fitting quantile regressions across ensemble members, often outperforming individual models in capturing uncertainty. QRA gained prominence after demonstrating superior performance in electricity price and load tracks, where it effectively averaged base forecasts to produce well-calibrated quantile predictions.^[46] Another approach is probabilistic load forecasting based on historical patterns, which leverages time-series analysis of past consumption data alongside exogenous variables like weather and calendar effects to generate distribution forecasts, typically using techniques such as lasso estimation for feature selection and quantile prediction.^[47] These methods emphasize calibration and sharpness to ensure reliable uncertainty quantification for operational planning.^[48] The Global Energy Forecasting Competition 2014 (GEFCom2014) highlighted advancements in probabilistic energy forecasting through tracks focused on load, price, wind, and solar power, using real-world hourly data from 2001–2010 to evaluate quantile forecasts via pinball loss scores. Top entries achieved mean pinball losses as low as 0.045 for load and 0.12 for price forecasts, with methods like lasso-based quantile regression and ensemble averaging proving effective for producing sharp, calibrated distributions.^[49] The competition's results, published in a special issue of the International Journal of Forecasting, underscored the value of hybrid approaches in handling non-stationarities in energy data.^[44] For instance, neural network-based models have been applied to wind power forecasting, where deep neural networks discretize power outputs into bins to estimate probabilistic distributions, providing uncertainty bands that capture 95% coverage with mean errors around 5–10% of installed capacity in short-term horizons.^[50] Post-2020, probabilistic forecasting has expanded in importance for net-zero energy transitions, aiding scenario planning for decarbonization by modeling uncertainties in renewable deployment, demand shifts from electrification, and policy impacts to benchmark progress toward emission targets. For federal net-zero objectives, such forecasts project monthly energy consumption increases of up to 20% in summer peaks by 2030 under high-renewable scenarios, informing resilient grid investments.^[51] This role has grown with global commitments, integrating machine learning techniques for renewable variability in long-term outlooks.^[52]

Population Forecasting

Probabilistic population forecasting plays a crucial role in governmental and international planning, informing policies on resource allocation, urban development, healthcare systems, and pension reforms over multi-decadal timescales. By quantifying uncertainties in fertility, mortality, and migration—key drivers of population dynamics—these methods provide not only point estimates but also probability distributions of future population sizes, age structures, and spatial distributions. This enables decision-makers to assess risks associated with low-fertility scenarios, longevity improvements, or volatile migration flows, which deterministic models often overlook.^[53]^[54] Central to probabilistic population forecasting is the stochastic cohort-component model, a extension of the traditional cohort-component method that projects populations by tracking age-sex cohorts through time while applying probabilistic rates for births, deaths, and migration. Uncertainty is introduced via stochastic processes, such as random walks or Monte Carlo simulations, to generate ensembles of possible trajectories; for example, fertility and mortality rates may be modeled using time-series autoregressions to capture temporal variability. The Lee-Carter model, a seminal stochastic approach for mortality forecasting, decomposes age-specific death rates into an age pattern and a time-varying component, with extensions incorporating random error terms to produce probabilistic life expectancy projections integrated into cohort models. Bayesian hierarchical models further enhance this for subnational applications, pooling sparse regional data across administrative units to estimate shared parameters for demographic rates, thereby yielding coherent probabilistic forecasts at provincial or county levels while accounting for hierarchical dependencies.^[54]^[55]^[56]^[57] The United Nations Population Division pioneered comprehensive probabilistic projections in its World Population Prospects series starting with the 2012 revision, employing Bayesian frameworks to forecast total fertility rates, life expectancies, and net migration for all countries, and providing 80% and 95% prediction intervals derived from thousands of simulated trajectories. These intervals capture the joint uncertainties across demographic components, with wider bounds for regions facing high variability, such as sub-Saharan Africa. In the 2024 revision (as of 2025), the median global population trajectory peaks at 10.3 billion in the mid-2080s, with an 80% probability of reaching this peak within the 21st century and 95% intervals spanning approximately 9.3 to 11.2 billion by 2100, highlighting the potential for sustained growth or earlier stabilization depending on fertility declines.^[58]^[59]^[60] Recent integrations of climate-induced migration into probabilistic models address emerging uncertainties from environmental changes, adjusting net migration rates based on climate scenarios to project amplified population shifts in at-risk areas. For instance, models incorporating slow-onset climate impacts, like sea-level rise or drought, estimate that such migration could redistribute tens of millions, widening uncertainty intervals in coastal or arid regions by enhancing outflows from origins and inflows to safer destinations. Parametric approaches for demographic rates, such as those using hierarchical priors on fertility trajectories, are briefly referenced in these Bayesian setups to ensure consistency across scales.^[61]^[62]

Assessment and Evaluation

Scoring Rules

Proper scoring rules are mathematical functions designed to evaluate the quality of probabilistic forecasts by assigning a numerical score based on the forecasted distribution and the observed outcome. These rules are proper if the expected score is maximized (or minimized, depending on the convention) when the forecaster reports the true underlying distribution, thereby incentivizing honest and well-calibrated predictions. A scoring rule is strictly proper if the true distribution is the unique optimizer of the expected score, ensuring that any deviation from the truth results in a worse expected performance. Key examples of proper scoring rules include the Brier score, applicable to binary and multi-class probabilistic forecasts. For a multi-class forecast with predicted probabilities p_i across K categories and a one-hot observed outcome o_i (where o_i = 1 for the realized class and 0 otherwise), the Brier score is given by

BS = \frac{1}{N} \sum_{t=1}^N \sum_{k=1}^K (p_{t,k} - o_{t,k})^2,

where N is the number of forecasts; lower values indicate better performance. This score, developed by Glenn Brier in the 1950s, measures the mean squared difference between predicted probabilities and outcomes, and has been a standard in meteorological verification since the 1990s for assessing ensemble-based predictions.^[63] For continuous outcomes or density forecasts, the logarithmic score is commonly used, defined as

LS = -\log p(y \mid \text{forecast}),

where p(y \mid \text{forecast}) is the forecasted probability density at the observed value y; again, lower scores are better, and it is strictly proper. This score, originating from I. J. Good's work in 1952, penalizes overconfident forecasts heavily and is linked to information-theoretic measures like entropy. Another important rule for continuous variables is the Continuous Ranked Probability Score (CRPS), which compares the forecasted cumulative distribution function (CDF) F to the observed outcome o via

\text{CRPS} = \int_{-\infty}^{\infty} \left[ F(y) - H(y - o) \right]^2 \, dy,

where H is the Heaviside step function (0 for negative arguments, 1 otherwise); minimization occurs at the true CDF. Developed by Matheson and Winkler in 1976, the CRPS generalizes the absolute error to probabilistic settings and is strictly proper under mild conditions. These scoring rules possess desirable properties such as elicitability, where the true distribution uniquely minimizes the expected score, facilitating statistical comparisons and estimation. They also ensure coherence in decision-making contexts, meaning that forecasts minimizing expected scores align with rational choices under uncertainty, as explored in foundational works on probabilistic prediction. Such properties make proper scoring rules essential for objective evaluation in fields like weather forecasting.^[63]

Verification and Calibration

Verification in probabilistic forecasting involves systematically comparing forecast distributions to observed outcomes to assess their overall quality and trustworthiness. This process evaluates multiple attributes, including reliability (or calibration), which measures the statistical consistency between forecast probabilities and the long-run observed relative frequencies of events; resolution, which quantifies the forecast's ability to discriminate between occurrences and non-occurrences of events; and sharpness, which assesses the concentration or precision of the predictive distributions, independent of calibration.^[64] These distinctions, formalized in seminal work on predictive performance, ensure that verification goes beyond mere accuracy to capture the nuanced strengths and weaknesses of probabilistic systems.^[64] Key techniques for verification include reliability diagrams and rank histograms, which provide visual diagnostics for calibration and ensemble performance. A reliability diagram plots the observed relative frequency of an event against the issued forecast probability, typically in bins; a perfectly calibrated forecast aligns points along the 1:1 diagonal line, with deviations indicating over- or under-forecasting of probabilities.^[65] For ensemble forecasts, rank histograms (also known as Talagrand diagrams) display the distribution of ranks assigned to verifying observations relative to the ensemble members; a uniform histogram signifies reliable spread, while systematic shapes reveal biases in ensemble representation.^[66] For continuous probabilistic forecasts, the probability integral transform (PIT) provides another diagnostic for calibration. The PIT value for an observation y is the CDF value F(y) under the forecast distribution; if the forecasts are well-calibrated, the PIT values should be uniformly distributed between 0 and 1. PIT histograms visualize this distribution, with deviations from uniformity indicating miscalibration, such as over- or under-dispersion.^[67] Under-dispersion occurs in ensembles when the spread of member forecasts is narrower than the actual forecast errors, often resulting in overconfident predictions, as evidenced by a U-shaped rank histogram with peaks at the ends.^[66] Conversely, over-dispersion features excessive variability, leading to unnecessarily broad uncertainty estimates, typically shown by a histogram peaked in the middle.^[66] To address calibration issues post-forecasting, methods like isotonic regression fit a non-decreasing, piecewise-constant function to map raw forecast probabilities to observed frequencies, preserving monotonicity while correcting biases in a data-driven manner. Attribute diagrams, extensions of reliability diagrams for multi-category probabilistic forecasts, were developed in the late 1990s to visualize reliability curves alongside no-resolution and climatological lines, facilitating assessment in scenarios beyond binary events; these have been applied to evaluate subjective probabilistic forecasts in economic surveys.^[68] A notable challenge in verification arises for long-horizon probabilistic forecasts, such as population projections, where sample dependence due to serial correlations in observations violates independence assumptions, potentially inflating uncertainty estimates and complicating reliable calibration assessments.^[69] Graphical tools like reliability diagrams and rank histograms thus complement quantitative scoring rules by offering intuitive diagnostics tailored to these dependencies.

References

[1]
https://www.annualreviews.org/doi/full/10.1146/annurev-statistics-062713-085831
[2]
An overview of deterministic and probabilistic forecasting methods of ...
Jan 20, 2023 · Probabilistic forecasting methods estimate wind speed/power in the form of probability density or probability interval,,,which is composed of ...
[3]
A review of probabilistic forecasting and prediction with machine ...
Here, we review the topic of predictive uncertainty estimation with machine learning algorithms, as well as the related metrics (consistent scoring functions ...
[4]
Probabilistic weather forecasting with machine learning - Nature
Dec 4, 2024 · Here we introduce GenCast, a probabilistic weather model with greater skill and speed than the top operational medium-range weather forecast in the world, ENS.
[5]
Testing for ontological errors in probabilistic forecasting models of ...
Aug 5, 2014 · Probabilistic forecasting models describe the aleatory variability of natural systems as well as our epistemic uncertainty about how the systems ...Abstract · Testing Psha Models · Discussion<|control11|><|separator|>
[6]
2 Uncertainty in Decision Making | Completing the Forecast
In statistical decision theory all sources of uncertainty are assessed and ... (probabilistic) forecasting products. These scientists would also support ...
[7]
On the Generation of Probabilistic Forecasts From Deterministic ...
Feb 20, 2019 · In this paper we introduce a new method to derive a probabilistic forecast ... Following the seminal paper by A. H. Murphy and Winkler (1992) ...
[8]
Confidence Interval Projections of the Federal Reserve Balance ...
Jan 13, 2017 · As a first input, we use the deterministic or "modal" path that is consistent with the FOMC participants' Summary of Economic Projections (SEP) ...
[9]
[PDF] Ensemble Methods for Meteorological Predictions
Mar 1, 2018 · Ensemble methods for meteorological predictions include methods to address uncertainty in initial conditions such as random perturbation, ...
[10]
Ensemble Forecasting at NMC: The Generation of Perturbations in
The new method, “breeding of growing modes,” or BGM, consists of one additional, perturbed short-range forecast, introduced on top of the regular analysis in an ...
[11]
30 years of ensemble forecasting at ECMWF
Nov 24, 2022 · In December 1992 operational ensemble forecasts began at ECMWF. They began at the US National Centers for Environmental Prediction at around ...A Brief History Of Ensemble... · 2000s And Onwards... · The Value Of Ensemble...
[12]
Global Ensemble Forecast System (GEFS)
GEFS is a weather model created by the National Centers for Environmental Prediction (NCEP) that generates 21 separate forecasts (ensemble members)
[13]
[PDF] Bayesian Forecasting in the 21st Century: A Modern Review
Dec 12, 2022 · Abstract. The Bayesian statistical paradigm provides a principled and coherent approach to probabilistic forecasting.
[14]
Probabilistic Forecast Methods | SpringerLink
May 1, 2023 · There are three core forms of probabilistic forecasts which will be explored in this book: quantile forecasts, density forecasts and ensemble forecasts.Probabilistic Forecast... · 11.3 Parametric Models · 11.7 Copula Models For...
[15]
Quantile regression based probabilistic forecasting of renewable ...
Nov 15, 2023 · Quantile regression based probabilistic forecasting of renewable energy generation and building electrical load: A state of the art review - ...
[16]
[PDF] Chapter 9 The exponential family: Conjugate priors - People @EECS
Conjugate priors are prior distributions chosen so that prior-to-posterior updating yields a posterior that is also in the same family, allowing tractable ...
[17]
Predictive Inference Based on Markov Chain Monte Carlo Output
Aug 24, 2016 · In Bayesian inference, predictive distributions are typically in the form of samples generated via Markov chain Monte Carlo (MCMC) or related ...
[18]
[PDF] A Large Bayesian VAR of the United States Economy
In the 1980s, Robert Litterman began using a small 6-variable Bayesian vector autoregressive model (BVAR) for macroeconomic forecasting at the Federal Reserve ...
[19]
[PDF] Quantile Regression Forests - Journal of Machine Learning Research
Quantile regression forests are a generalization of random forests that estimate conditional quantiles, providing information about the full conditional ...
[20]
[1909.11865] Probabilistic Forecasting using Deep Generative Models
Sep 26, 2019 · By doing so, a generative model can entirely or partially replace the dataset of pairs of predictions and observations, reducing the amount of ...
[21]
[PDF] Mi ture Density Networ s Christopher M. Bishop Neural Computing ...
In this paper we introduce a new class of neural network models, called Mixture Density. Networks ( MDNs ) , which overcome these limitations and which provide ...
[22]
[PDF] Mixture Density Networks - Aston Publications Explorer
The density p(x) = R p(x;t)dt of input data plays an important role in validating the predictions of trained networks (Bishop, 1994). However, for the ...
[23]
https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/bishop-ncrg-94-004.pdf
[24]
Ensemble forecasting | ECMWF
Ensemble forecasting aims at quantifying this flow-dependent forecast uncertainty. The sources of uncertainty in weather forecasting are discussed. Then, an ...
[25]
[PDF] The ECMWF Ensemble Prediction System
This system, operational at ECMWF since 1992, is the Ensemble Prediction System (EPS). In 2008, the EPS was merged with the monthly prediction system and has ...
[26]
[PDF] PART V: ENSEMBLE PREDICTION SYSTEM - ECMWF
Nov 12, 2024 · Ensemble prediction is a method to predict the probability distribution of forecast states, given a probability distribution of random analysis ...
[27]
What Does Probability of Precipitation Mean?
So, in the example above, there is a 30% chance that at least 0.01" of rain will fall at the point for which that forecast is valid over the period of time ...
[28]
Medium-range forecasts | ECMWF
ENS is a probabilistic forecast system designed to indicate the range of possible weather conditions out to 15 days ahead, including the probability of ...
[29]
The ECMWF ensemble prediction system: Looking back (more than ...
Aug 26, 2018 · This paper has been written to mark 25 years of operational medium-range ensemble forecasting. The origins of the ECMWF Ensemble Prediction System are outlined.
[30]
Ensemble Forecasts - Environment Canada
The daily ensemble forecasts have been available on an operational basis since January 24, 1996. They were originally performed with eight members. Then, on ...
[31]
[PDF] Deterministic and probabilistic weather forecasting - HAL
Nov 26, 2019 · ... forecasts from the same or different models provide more accurate and considerably less “jumpy” forecasts than deterministic forecasts,.
[32]
Limitation of super-resolution machine learning approach to ... - Nature
Aug 17, 2025 · The present study explores the potential of super-resolution machine learning (ML) models for precipitation downscaling from 100 to 12.5 km ...
[33]
[PDF] Moving towards probability forecasting
Feb 5, 2013 · Probabilistic forecasting in practice at central banks. Despite the extant body of literature devoted to probability macroeconomic ...
[34]
Assessing the Economic Value of Probabilistic Forecasts in ... - SSRN
Jun 29, 2016 · We consider the fundamental issue of what makes a “good” probability forecast for a central bank operating within an inflation targeting framework.
[35]
Economic Forecast Probabilities - Consensus Economics
Monthly surveys of economic forecasts. Forecast probabilities assess risks to the GDP growth and inflation outlook for the major economies.Missing: 2000 | Show results with:2000
[36]
The Inflation Report projections: understanding the fan chart
Since February 1996, the Bank's inflation forecast has been published in the form of a probability distribution—presented in what is now known as 'the fan ...
[37]
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2801414
[38]
[PDF] Regime-Switching Models for Estimating Inflation Uncertainty
Sep 22, 2015 · The paper then uses the models to construct prediction intervals around Federal Reserve. Board staff forecasts of PCE price inflation, combining ...Missing: probabilistic | Show results with:probabilistic
[39]
Probabilistic Forecasts Using Expert Judgment: The Road to ... - NIH
We propose a novel statistical methodology for generating scenario-based probabilistic forecasts based on a large survey of 443 tourism experts and ...
[40]
A New Approach to Forecasting the Probability of Recessions after ...
May 14, 2024 · This paper proposes a new non-parametric approach to computing predictive probabilities of future recessions that is robust to influential observations and ...
[41]
Global Energy Forecasting Competition 2014 and beyond
The aim of GEFCom2014-P was to forecast the probabilistic distribution (in quantiles) of the electricity price for one zone on a rolling basis. The forecast ...
[42]
Probabilistic solar and wind power forecasting using a generalized ...
We investigate the probabilistic forecasting of solar and wind power generation in connection with the Global Energy Forecasting Competition 2014.<|separator|>
[43]
Regularized quantile regression averaging for probabilistic ...
Quantile Regression Averaging (QRA) has sparked interest in the electricity price forecasting community after its unprecedented success in the Global Energy ...
[44]
[PDF] Lasso estimation for GEFCom2014 probabilistic electric load ...
We present a methodology for probabilistic load forecasting that is based on lasso (least absolute shrinkage and selection operator) estimation.
[45]
[PDF] PROBABILISTIC ELECTRIC LOAD FORECASTING by Jingrui Xie
historical data and changing consumption patterns for holiday load forecasting. ... used for evaluating probabilistic load forecasting since Global Energy ...
[46]
[PDF] Global Energy Forecasting Competition 2014 and beyond
Mar 17, 2016 · In addition, we also collected 13 papers from the top entries of GEFCom2014 and one paper from the winning entry of an in-class probabilistic ...
[47]
Probabilistic short term wind power forecasts using deep neural ...
Jul 19, 2018 · In this study, the uncertainty estimate is achieved by discretising the continuous time series of power targets into several bins (classes).
[48]
Energy forecasting to benchmark for federal net-zero objectives ...
Monthly forecasts indicate that summer month energy consumption could significantly increase within the first decade (2020–2030), and nearly all months will ...
[49]
Probabilistic energy forecasting through quantile regression ... - arXiv
Aug 8, 2024 · Accurate energy demand forecasting is crucial for sustainable and resilient energy development. To meet the Net Zero Representative ...
[50]
World Population Prospects 2024
It presents population estimates from 1950 to the present for 237 countries or areas, underpinned by analyses of historical demographic trends.Graphs / Profiles · Data Portal · Download Center · CSV formatMissing: uncertainty peaks
[51]
Stochastic methods in population forecasting - ScienceDirect.com
This paper presents a stochastic version of the demographic cohort-component method of forecasting future population.
[52]
Thirty years on: A review of the Lee–Carter method for forecasting ...
The introduction of the Lee–Carter (LC) method marked a breakthrough in mortality forecasting, providing a simple yet powerful data-driven stochastic approach.
[53]
The Out-of-Sample Performance of Stochastic Methods in ...
The Lee-Carter model yields a parsimonious stochastic approach to mortality forecasting that is easy to implement and often produces reasonable forecasts ...<|separator|>
[54]
Probabilistic County-Level Population Projections | Demography
Jun 1, 2023 · We propose a Bayesian method for producing subnational population projections, including migration and accounting for college populations.
[55]
[PDF] Methodology Report - World Population Prospects 2022 ...
Jul 6, 2022 · As part of the probabilistic population projections, the Population Division publishes 80 and 95 per cent prediction intervals for future ...
[56]
Bayesian probabilistic population projections for all countries - PNAS
We propose a Bayesian method for probabilistic population projections for all countries. The total fertility rate and female and male life expectancies at ...Abstract · Sign Up For Pnas Alerts · Results
[57]
Five key findings from the 2022 UN Population Prospects
Jul 11, 2022 · The global population is projected to peak at around 10.4 billion in 2086. The world population has increased rapidly over the last century.Missing: uncertainty bounds
[58]
Climate migration amplifies demographic change and population ...
Jan 8, 2024 · Climate migration could amplify demographic change—enhancing migration to destinations and suppressing migration to origins ...
[59]
Integrating climate change induced flood risk into future population ...
Dec 18, 2023 · Some projections estimate that globally, up to 216 million people may migrate due to climate change by 2050. It is now generally accepted that ...Missing: probabilistic | Show results with:probabilistic
[60]
Scoring Rules for Forecast Verification in - AMS Journals
Jan 1, 2010 · This measure is the logarithmic score, based on the relative entropy between the observed occurrence frequencies and the predicted probabilities for the ...
[61]
Probabilistic Forecasts, Calibration and Sharpness - Oxford Academic
We interpreted probabilistic forecasting within a simple theoretical framework that allowed us to distinguish probabilistic, exceedance and marginal ...
[62]
The attributes diagram A geometrical framework for assessing the ...
In two-event situations, a reliability diagram provides a geometrical framework for evaluating this attribute of probability forecasts.
[63]
Interpretation of Rank Histograms for Verifying Ensemble Forecasts in
Rank histograms are a tool for evaluating ensemble forecasts. They are useful for determining the reliability of ensemble forecasts and for diagnosing errors ...Introduction · Overview of the rank histogram · Problems interpreting rank...
[64]
Reliability Diagrams for Multicategory Probabilistic Forecasts in
A new type of reliability diagram is developed here and applied to probabilistic quantitative precipitation forecasts from a university contest.Introduction · Generating the multicategory... · Demonstration using forecast...Missing: attribute | Show results with:attribute
[65]
Probabilistic population forecasting: Short to very long-term
We extend the UN method to very-long range population forecasts by combining the statistical approach with expert review and elicitation.