Mid-range
In statistics, the mid-range, also known as the midrange, is a measure of central tendency defined as the arithmetic mean of the minimum and maximum values in a data set.[1][2] It provides a quick estimate of the central value by averaging the extremes, making it one of the simplest statistical measures to compute.[1] The mid-range is calculated by adding the smallest and largest data points and dividing by 2, expressed as \frac{\min(X) + \max(X)}{2}, where X represents the data set.[3] This method is particularly straightforward for small or ordered data sets, such as test scores or measurements, and is often used alongside other central tendency measures like the mean and median.[4] However, its sensitivity to outliers—where a single extreme value can skew the result significantly—limits its reliability compared to the median or arithmetic mean, rendering it prone to bias in distributions with anomalies.[5] For instance, in a data set of {1, 2, 3, 4, 100}, the mid-range is 50.5, far from the more representative median of 3.[1] Despite these drawbacks, the mid-range remains useful in preliminary data analysis or when computational resources are limited, as it requires only identification of the extremes rather than all values.[2] In certain contexts, such as survey scales, it may refer to the theoretical midpoint of a response range (e.g., 4 on a 1–7 Likert scale), independent of actual responses, to assess neutrality.[6] Overall, while not as robust as other measures, the mid-range offers a basic tool for summarizing data location, especially in educational or exploratory settings.[7]Definition and Basics
Definition as a Measure of Central Tendency
The mid-range, also known as the mid-extreme, is a measure of central tendency defined as the arithmetic mean of the minimum and maximum values in a sample dataset, providing a straightforward estimator of the population's central location.[8][9] This approach leverages only the dataset's extremes to approximate the center, making it one of the simplest location statistics alongside the arithmetic mean and median.[10] Originating in descriptive statistics, the mid-range emerged as a quick method to gauge central location by averaging extremes, with early references appearing in 19th-century statistical literature focused on practical data summarization.[11][12] No single inventor is attributed to its formalization, as it evolved naturally from rudimentary averaging techniques in early statistical practice, predating more comprehensive measures like the full arithmetic mean.[11] As a location estimator, the mid-range distinctly emphasizes the dataset's boundaries, rendering it particularly sensitive to extreme values that can skew the estimate away from the true center.[5] This sensitivity highlights its role in descriptive analysis where rapid assessment of spread-influenced centrality is prioritized over robustness.[1]Relation to Order Statistics and Range
In statistics, order statistics are the sorted values of a random sample X_1, X_2, \dots, X_n of size n from a distribution, arranged in non-decreasing order as X_{(1)} \leq X_{(2)} \leq \cdots \leq X_{(n)}, where X_{(1)} denotes the sample minimum and X_{(n)} the sample maximum./06%3A_Random_Samples/6.06%3A_Order_Statistics) The sample range R is the length of the interval spanning these extremes, defined as R = X_{(n)} - X_{(1)}.[13] The mid-range is the midpoint of this interval [X_{(1)}, X_{(n)}], expressed as \frac{X_{(1)} + X_{(n)}}{2}.[14] This construction underscores the mid-range's reliance exclusively on the two extreme order statistics, effectively ignoring all intermediate sample values in its computation.[14]Calculation
Formula and Computation
The mid-range, denoted as M, is computed as the average of the sample minimum and maximum values, formally expressed using order statistics asM = \frac{X_{(1)} + X_{(n)}}{2},
where X_{(1)} represents the smallest observation in the ordered sample and X_{(n)} the largest.[15][16] To compute the mid-range, identify the minimum X_{(1)} and maximum X_{(n)} in the dataset; then apply the averaging formula directly to these extremes.[16][15] For edge cases, the mid-range is undefined for an empty sample (n = 0), as no minimum or maximum exists; for a single-value sample (n = 1), it equals that value, since the minimum and maximum coincide.[16]