Mean absolute percentage error
The mean absolute percentage error (MAPE) is a widely used statistical metric for assessing the accuracy of forecasting models in fields such as time series analysis, econometrics, and operations research, representing the average magnitude of errors as a percentage of the actual values.[1] It is formally defined by the formula \text{MAPE} = \frac{100}{n} \sum_{t=1}^{n} \left| \frac{A_t - F_t}{A_t} \right| where n denotes the number of observations, A_t is the actual value at period t, and F_t is the corresponding forecasted value.[1] This formulation ensures that errors are normalized relative to the actual observations, yielding a scale-independent measure that facilitates comparisons across datasets with varying units or magnitudes.[2] MAPE gained prominence through empirical studies like the M-competitions, which evaluated forecasting methods and highlighted its interpretability as a relative error in percentage terms, often ranging from 0% (perfect accuracy) to higher values indicating poorer performance.[1] Despite its popularity in business and academic applications for its intuitive output, MAPE has notable drawbacks: it becomes undefined or infinite when actual values are zero, introduces asymmetry by penalizing over-forecasts more severely than under-forecasts of the same absolute magnitude, and can be overly sensitive to small actual values, leading to unstable results in intermittent or low-volume data scenarios.[3] These limitations have prompted the development of alternatives, such as the symmetric mean absolute percentage error (sMAPE) or mean absolute scaled error (MASE), which address bias and scale issues while retaining relative interpretability.[1]Definition and Basic Concepts
Mathematical Definition
The mean absolute percentage error (MAPE) is a measure of prediction accuracy that expresses the average absolute error as a percentage of the actual values. It is defined mathematically as \text{MAPE} = \frac{100}{n} \sum_{i=1}^{n} \left| \frac{A_i - F_i}{A_i} \right|, where A_i represents the actual value for the i-th observation, F_i is the corresponding forecasted or predicted value, and n is the total number of observations.[4] The absolute value in the formula ensures that the error is always non-negative, regardless of whether the forecast over- or underestimates the actual value, while the division by A_i normalizes the error relative to the actual value. The sum of these relative errors is then averaged across all n observations and scaled by 100 to express the result as a percentage. This formulation assumes all A_i > 0 to avoid division by zero.[4] To illustrate the computation, consider a small dataset with two observations: actual values A_1 = 100, A_2 = 200; forecasted values F_1 = 110, F_2 = 180.- For the first observation: \left| \frac{100 - 110}{100} \right| = 0.10.
- For the second observation: \left| \frac{200 - 180}{200} \right| = 0.10.
- Sum of absolute relative errors: $0.10 + 0.10 = 0.20.
- Average: \frac{0.20}{2} = 0.10.
- MAPE: $100 \times 0.10 = 10\%.