Fact-checked by Grok 2 weeks ago

Mid-range

In statistics, the , also known as the midrange, is a measure of defined as the of the minimum and maximum values in a . It provides a quick estimate of the central value by averaging the extremes, making it one of the simplest statistical measures to compute. The mid-range is calculated by adding the smallest and largest data points and dividing by 2, expressed as \frac{\min(X) + \max(X)}{2}, where X represents the . This method is particularly straightforward for small or ordered s, such as test scores or measurements, and is often used alongside other measures like the and . However, its sensitivity to outliers—where a single extreme value can skew the result significantly—limits its reliability compared to the or , rendering it prone to bias in distributions with anomalies. For instance, in a of {1, 2, 3, 4, 100}, the mid-range is 50.5, far from the more representative of 3. Despite these drawbacks, the mid-range remains useful in preliminary or when computational resources are limited, as it requires only identification of the extremes rather than all values. In certain contexts, such as survey scales, it may refer to the theoretical of a response (e.g., 4 on a 1–7 ), independent of actual responses, to assess neutrality. Overall, while not as robust as other measures, the mid-range offers a basic tool for summarizing data location, especially in educational or exploratory settings.

Definition and Basics

Definition as a Measure of Central Tendency

The mid-range, also known as the mid-extreme, is a measure of defined as the of the minimum and maximum values in a sample , providing a straightforward of the population's central . This approach leverages only the dataset's extremes to approximate , making it one of the simplest location statistics alongside the and . Originating in descriptive statistics, the mid-range emerged as a quick method to gauge central location by averaging extremes, with early references appearing in 19th-century statistical literature focused on practical data summarization. No single inventor is attributed to its formalization, as it evolved naturally from rudimentary averaging techniques in early statistical practice, predating more comprehensive measures like the full . As a location estimator, the mid-range distinctly emphasizes the dataset's boundaries, rendering it particularly sensitive to extreme values that can skew the estimate away from the true center. This sensitivity highlights its role in descriptive analysis where rapid assessment of spread-influenced centrality is prioritized over robustness.

Relation to Order Statistics and Range

In statistics, order statistics are the sorted values of a random sample X_1, X_2, \dots, X_n of size n from a , arranged in non-decreasing order as X_{(1)} \leq X_{(2)} \leq \cdots \leq X_{(n)}, where X_{(1)} denotes the sample minimum and X_{(n)} the sample maximum./06%3A_Random_Samples/6.06%3A_Order_Statistics) The sample range R is the length of the interval spanning these extremes, defined as R = X_{(n)} - X_{(1)}. The mid-range is the of this [X_{(1)}, X_{(n)}], expressed as \frac{X_{(1)} + X_{(n)}}{2}. This construction underscores the mid-range's reliance exclusively on the two extreme order statistics, effectively ignoring all intermediate sample values in its computation.

Calculation

Formula and Computation

The mid-range, denoted as M, is computed as the average of the sample minimum and maximum values, formally expressed using order statistics as
M = \frac{X_{(1)} + X_{(n)}}{2},
where X_{(1)} represents the smallest observation in the ordered sample and X_{(n)} the largest.
To compute the mid-range, identify the minimum X_{(1)} and maximum X_{(n)} in the dataset; then apply the averaging formula directly to these extremes. For edge cases, the mid-range is for an empty sample (n = 0), as no minimum or maximum exists; for a single-value sample (n = 1), it equals that value, since the minimum and maximum coincide.

Illustrative Examples

To illustrate the computation of the mid-range, consider a simple consisting of the numbers from 1 to 9: {1, 3, 5, 7, 9}. The minimum value is 1 and the maximum value is 9, so the mid-range is calculated as (1 + 9)/2 = 5. Another example involves a where an extreme value is present: {1, 2, 3, 4, 100}. Here, the minimum is 1 and the maximum is 100, yielding a mid-range of (1 + 100)/2 = 50.5. For a with an even number of , such as {2, 4, 6, 8}, the minimum is 2 and the maximum is 8, resulting in a mid-range of (2 + 8)/2 = 5.

Statistical Properties

Robustness to Outliers

The mid-range, defined as the of the sample minimum and maximum, demonstrates extreme sensitivity to outliers due to its reliance on only the two extreme order statistics. This lack of robustness is quantified by its breakdown point of 0, indicating that a single contaminated can cause the to produce arbitrarily large or small values, completely distorting the location estimate. A key aspect of this arises from the direct impact of an extreme value on the mid-range. If a consists of values clustered around a true \mu, and one deviates from \mu by a d (becoming the new minimum or maximum), the mid-range shifts by exactly d/2, as the estimator averages the unaffected extreme with the outlier. For instance, consider a sample of 10 values all equal to 5 (mid-range = 5); introducing an outlier of 15 changes the mid-range to 10 (average of 5 and 15). This linear propagation of the outlier's deviation halves the influence compared to the but still renders the mid-range unreliable for contaminated . In contrast, trimmed variants like the midhinge—the average of the 25th and 75th percentiles, equivalent to a 25% trimmed mid-range—improve robustness, achieving a point of 25%, though they sacrifice some in clean samples.

Efficiency Across Distributions

The mid-range serves as an unbiased of the for symmetric distributions, and its performance relative to the sample varies significantly depending on the underlying distribution's . For platykurtic distributions, such as the on [a, b], the mid-range is the uniformly minimum variance unbiased (UMVU) of the μ = (a + b)/2. In this case, its variance attains the Cramér-Rao lower bound among all unbiased , making it optimal and yielding an asymptotic relative (ARE) of 1 relative to the best possible unbiased ; consequently, it outperforms the sample , with relative exceeding 1 and increasing with sample size. In contrast, for mesokurtic distributions like , the sample is the efficient , achieving the Cramér-Rao bound. The mid-range converges at a slower rate of O_p(1/√(log n)) compared to the √n rate of the sample , resulting in an ARE of 0 relative to the sample . For leptokurtic distributions, which exhibit heavier tails than , the mid-range performs even more poorly due to greater influence from extreme order statistics, leading to an ARE less than that for case and approaching 0 asymptotically. The mid-range's suitability is thus highest for symmetric platykurtic cases like the [a, b], where the directly corresponds to the mid-point of the , allowing the to leverage the bounded extremes effectively. Efficiency is derived by comparing the asymptotic variances (or more generally, squared errors) of the mid-range and sample , adjusted for their respective rates; when rates differ, the relative reflects the ratio of sample sizes required to achieve equivalent , highlighting the mid-range's advantages in bounded-support scenarios and disadvantages in unbounded or heavy-tailed ones.

Sampling Properties and Variance

The mid-range M = \frac{X_{(1)} + X_{(n)}}{2}, where X_{(1)} and X_{(n)} are the sample minimum and maximum order statistics from a sample of size n, is an unbiased of the population for symmetric distributions. For distributions with finite support, such as the , the sample mid-range also unbiasedly estimates the population mid-range, which coincides with the . Under the U(0,1), the exact variance of the mid-range is given by \text{Var}(M) = \frac{1}{2(n+1)(n+2)}, derived from the known moments of the minimum and maximum order statistics, which follow distributions, and their . This variance decreases rapidly with n, reflecting the concentration of the extremes near 0 and 1. For the normal distribution N(\mu, \sigma^2), the variance of the mid-range is approximately \frac{\pi^2 \sigma^2}{24 \ln n} for large n, arising from the asymptotic of the normalized extremes, with the min and max being asymptotically independent and symmetric around \mu. A rough large-sample sometimes used is \frac{\sigma^2}{2n}, though the logarithmic term provides better accuracy as it captures the slower convergence due to the unbounded tails. In the , which has heavier tails than ( versus Gaussian), the variance of the mid-range is higher than in the normal case for comparable \sigma^2, as the extremes exhibit greater variability; exact expressions are more complex and typically obtained via of moments, but simulations confirm elevated variance relative to lighter-tailed distributions. The of the mid-range is approximately for large n, justified by the applied to the sum of the dependent extremes, whose joint distribution converges to a bivariate form that yields for their average after normalization. This asymptotic holds across common distributions, facilitating confidence intervals via M \pm z_{\alpha/2} \sqrt{\text{Var}(M)}.

Performance Characteristics

Behavior in Small Samples

In small samples, the mid-range demonstrates heightened sensitivity to the distributional shape, performing optimally as a estimator under conditions approximating uniformity. For the uniform distribution, the mid-range is more efficient than the sample , particularly for small to moderate sample sizes. The estimator's reliance on just two order statistics—the minimum and maximum—introduces substantial instability in small samples due to the high variability of these extremes, which are determined by only a few observations. This volatility is particularly pronounced as the number of data points is low, amplifying the impact of any single or random fluctuation on the result. For instance, with n=2, the mid-range simplifies to the of the two values, offering no benefit from interior points since none exist, and its variance matches that of the exactly. Monte Carlo simulations with up to 200,000 iterations reveal that in non- small samples, the mid-range exhibits greater uncertainty, with coverage factors increasing markedly (e.g., from 2.41 for at ν=16 to 3.96 for 50% Gaussian mixture), leading to overestimation of the 's spread relative to uniform conditions. These empirical findings underscore the mid-range's diminished reliability outside platykurtic settings, where deviations from uniformity inflate the standard deviation of the by factors exceeding 10 in some cases for n up to 20.

Bias and Deviation Metrics

The mid-range exhibits zero as an estimate of the population for symmetric distributions, such as the and distributions, where the expected values of the sample minimum and maximum are from the . In positively skewed distributions, the mid-range displays positive , being drawn toward the extreme in the longer right tail, while negatively skewed distributions induce negative toward the left tail extreme. A key deviation property of the mid-range is its minimax characteristic: it minimizes the maximum absolute deviation from any point in the sample, serving as the center of the smallest interval that encompasses all data points. The mean squared error of the mid-range estimator exceeds that of the sample mean for most distributions, including the normal, owing to its heightened sensitivity to outliers, which inflates its variance. For instance, with samples of size 100 from a standard normal distribution, the mid-range's variance is approximately 0.0925, compared to 0.01 for the sample mean.

Comparisons to Other Central Tendency Measures

The mid-range, calculated as the average of the minimum and maximum values in a , utilizes only two points out of n, making it computationally faster than the , which incorporates every observation by summing all values and dividing by n. However, this reliance on extremes renders the mid-range less stable, as it is highly sensitive to outliers that affect the minimum or maximum, whereas the distributes the impact of outliers across all points equally. For normally distributed , the exhibits superior , with relative efficiencies showing its variance as approximately 59% of the mid-range's in small samples from standard normal distributions. In contrast to the , which is the central and thus leverages the ranked positions of all observations to mitigate extreme values, the mid-range disregards the ordering of interior points beyond identifying the extremes. While both measures can demonstrate robustness in trimmed variants, the mid-range's dependence solely on boundary values makes it less effective against outliers compared to the , which remains stable in skewed or heavy-tailed distributions like the Cauchy. In Cauchy-distributed samples, the mid-range has variance, while the has finite variance, highlighting the median's advantage in such settings. The mid-range is preferable for rapid assessments in uniform distributions, where it achieves the lowest variance among central tendency measures, outperforming the by a factor of about 2.2 in relative efficiency for certain sample sizes. In general inferential contexts, however, the is favored for symmetric, light-tailed data like the normal, and the median for skewed or outlier-prone scenarios, as the mid-range's asymptotic lack of efficiency limits its broader applicability.

Applications and Limitations

Uses in Specific Distributions

In uniform distributions, the mid-range serves as the uniformly minimum variance unbiased estimator (UMVUE) of the population , given by μ = (a + b)/2, where a and b are the lower and upper bounds of the Uniform(a, b). This property arises because the order statistics X_{(1)} and X_{(n)} (the sample minimum and maximum) form a complete for the in this setting, and their average achieves the lowest variance among unbiased estimators. In for bounded processes, such as manufacturing tolerances where measurements are assumed to follow a due to uniform spread within specified limits, the mid-range provides a reliable estimate of the central value, aiding in process monitoring and adjustment. The mid-range contributes to descriptive summaries as a derived measure from the , which includes the minimum, first quartile (Q1), , third quartile (Q3), and maximum; specifically, it is computed as the of the minimum and maximum to offer a simple indicator. This makes it useful in for platykurtic distributions like the , where the data exhibit low and bounded support, allowing the mid-range to capture effectively without sensitivity to intermediate values. A practical real-world application involves estimating the length of physical measurements, such as component dimensions in , from a sorted sample assumed to follow a ; here, the mid-range of the extremes provides an unbiased and efficient approximation of the true length when the process operates within fixed tolerances.

Drawbacks and Alternative Approaches

The mid-range exhibits extreme sensitivity to outliers, as a single extreme value can arbitrarily distort the estimate by affecting the sample minimum or maximum, rendering it unsuitable for datasets with potential errors or . This vulnerability stems from its reliance on only two data points, ignoring the rest of the sample and leading to inefficiency as an for most real-world distributions that are not . Consequently, the mid-range lacks robustness for , with an asymptotic breakdown point of 0—the lowest possible value—making it prone to failure in the presence of even minimal . To mitigate these drawbacks, alternatives such as the trimmed mid-range (or midhinge), defined as the average of the first and third quartiles, offer improved robustness by excluding extreme values while maintaining reasonable efficiency for symmetric data. For symmetric distributions without outliers, the is generally preferred due to its optimal efficiency under normality, whereas the provides a more robust option for skewed distributions or outlier-prone data. When assessing spread rather than , the serves as a robust alternative to the full range, avoiding the influence of extremes. The mid-range should be avoided in large, outlier-prone datasets or non-platykurtic distributions, where its poor performance is exacerbated, and it has become outdated in modern statistical software that favors robust methods like the or trimmed estimators.

References

  1. [1]
    Range and mid-range (video) - Khan Academy
    Jun 10, 2012 · The midrange is the average of the largest and smallest data points. Range is an easy to calculate measure of variability, while midrange is an easy to ...
  2. [2]
    Definition and How to Find the Midrange - Statistics How To
    The midrange is a type of average, or mean. For example, “midrange” electronic gadgets are in the middle-price bracket: not cheap, but not expensive, either.
  3. [3]
    Mid Range - GeeksforGeeks
    Jul 23, 2025 · Midrange is a simple statistical measure used to identify the central tendency of a dataset. It is calculated as the average of the maximum and minimum values ...
  4. [4]
    Define mid-range in statistics and its application in data analysis.
    Mid-range is a statistical measure that represents the average of the maximum and minimum values in a data set. It is calculated by adding the highest and ...
  5. [5]
    midrange - APA Dictionary of Psychology
    Apr 19, 2018 · n. the average of the lowest and highest scores in a set of data. The midrange is a measure of central tendency more prone to bias than the mean ...Missing: definition encyclopedia
  6. [6]
    Mid-range - Statista Definition
    ### Summary of Mid-range Definition
  7. [7]
    Video: Midrange - JoVE
    Apr 30, 2023 · It is the value midway between the two extreme values and is typically defined as the arithmetic mean of the maximum and minimum data values.<|control11|><|separator|>
  8. [8]
    4.3: Measures of Central Tendency - Mathematics LibreTexts
    Aug 12, 2022 · Midrange. Midrange is the mean of the lowest value and the highest values in the sample data. Example 13.<|separator|>
  9. [9]
    [PDF] Measures of Central Tendency - MATH 130, Elements of Statistics I
    The Midrange. Definition. The midrange of a variable is the average of the lowest and highest values of the variable found in the data set. Page 29. Example.
  10. [10]
    Measures of central tendency | Australian Bureau of Statistics
    Feb 2, 2023 · A measure of central tendency (also referred to as measures of centre or central location) is a summary measure that attempts to describe a whole set of data
  11. [11]
    The Early History of Average Values and Implications for Education
    “This technique of averaging the extreme values of the range to obtain the arithmetic mean or mid-range ... In history, the midrange may be seen as a predecessor ...
  12. [12]
    [PDF] eisenhart.pdf - University of York
    Aug 24, 1971 · the midrange (i.e., the "arithmetic mean" between the extremes), or, in more recent times, the arithmetic mean of all of the observations ...
  13. [13]
    Lesson 2: Summarizing Data | Principles of Epidemiology | CSELS
    The midrange, which is based on the minimum and maximum values, is more sensitive to outliers than any other measures. The mode and median tend not to be ...Missing: estimator | Show results with:estimator
  14. [14]
    Order Statistic -- from Wolfram MathWorld
    Important functions of order statistics include the statistical range R=Y_N-Y_1, midrange MR=1/2(Y_1+Y_N), and statistical median x^~={Y_((N+1)/2) if N is odd.<|control11|><|separator|>
  15. [15]
    Order Statistics - an overview | ScienceDirect Topics
    The sample midrange, defined as (X(1) + X(n))/2, is also a measure of central tendency. Order statistics have wide applications in many areas where the use of ...
  16. [16]
    Midrange -- from Wolfram MathWorld
    Given order statistics Y_1=min_(j)X_j, Y_2, ..., Y_(N-1), Y_N=max_(j)X_j with sample size N, the midrange of the random sample is defined by MR=1/2(Y_1+Y_N) ...<|control11|><|separator|>
  17. [17]
    Stats: Measures of Central Tendency
    Midrange. The midrange is simply the midpoint between the highest and lowest values. Summary. The Mean is used in computing other statistics (such as the ...
  18. [18]
    [PDF] Maxbias Curves of Robust Location Estimators based on Subranges
    The maxbias of Tα increases only slightly with ε and stays bounded right up to the breakdown point. This doesn't hold for the α-midrange nor for the MVE ...<|control11|><|separator|>
  19. [19]
    The Midrange of a Sample as an Estimator of the Population ...
    Apr 11, 2012 · It is found that the midrange is more efficient than the mean for all of the populations considered, and that this efficiency increases with ...
  20. [20]
    [PDF] A Course in Large Sample Theory
    Asymptotic Relative Efficiency. If 0{ and 02 are two estimates of a ... ple 6 of Section 13) of the midrange such that the limit distribution exists ...
  21. [21]
    [PDF] COMPARISON OF CENTRALITY ESTIMATORS FOR SEVERAL ...
    purpose we obtain the variance of the midrange of a uniform distribution by using order statistics. Hence, using the previous notation for the uniform.Missing: range | Show results with:range
  22. [22]
    [PDF] OF DISCRETE UNIFORM DISTRIBUTIONS Elweod Lp Bcmbara ...
    x = l, 2, or 3, the mid·range is more efficient than the sample mean. The variance of the sample mean isig, while. I the variance of the mid—range isE;.*. The ...
  23. [23]
    The Future of Data Analysis - jstor
    Page 1. THE FUTURE OF DATA ANALYSIS'. BY JOHN W. TuKEY. Princeton University ... mid-range offers high-efficiency of point-estimation, while the normally ...
  24. [24]
    [PDF] Midrange as estimator of measured value for samples from ... - imeko
    Standard deviations of mean and of midrange as functions of numberν of degree of freedom. On the basis of N – 1 = ν pure samples of uniform distribution with a ...
  25. [25]
    Chapter 10 Point Estimation | Introduction to Statistical Thinking
    We get that the variance of the sample average is approximately equal to 0.02. The variance of the mid-range statistic is approximately equal to 0.185, more ...
  26. [26]
    Statistical Analysis Handbook 2024 edition - Dr M J de Smith
    The mid-range is the point for which the maximum deviation of a set of values is minimized (hence it is a form of minimax statistic).
  27. [27]
    1.3.5.1. Measures of Location - Information Technology Laboratory
    ... mean for data that are normal (or nearly so). The mid-range, since it is based on the two most extreme points, is not robust. Its use is typically ...
  28. [28]
    [PDF] Optimally estimating the sample mean from the sample size, median ...
    Sep 30, 2016 · When the sample size is small, however, a well-designed combination of the mid-range and the median may provide a better estimation of the ...
  29. [29]
    [PDF] Unbiased Estimation Lecture 15: UMVUE: functions of sufficient and ...
    Since the mse of any unbiased estimator is its variance, a UMVUE is ℑ-optimal in mse with ℑ being the class of all unbiased estimators. One can similarly define ...
  30. [30]
    New View of Statistics: Middle and Spread - Sportsci.org
    Jul 4, 2000 · The range is a bad measure of spread, for two reasons. First, it's dictated by outliers, whether they're errors in data entry or genuine values.
  31. [31]
    [PDF] Robust Estimation Techniques for Location Parameter ... - DTIC
    is Uniform the the midrange is an efficient estimator of ... Three popular statistics were also computed, the sample mean, the sample median, and the mid range.
  32. [32]
    Optimally estimating the sample mean from the sample size, median ...
    In fact, as mentioned in Triola, the mid-range is rarely used in practice as, from an asymptotic point of view, it lacks efficiency and robustness as an ...
  33. [33]
    Robust Regression and Outlier Detection - Google Books
    The aim of this book, the authors tell us, is 'to make robust regression available for everyday statistical practice.' Rousseeuw and Leroy have included all of ...Missing: mid- | Show results with:mid-