Fact-checked by Grok 2 weeks ago

Stratified sampling

Stratified sampling is a probability sampling method in statistics used to select a representative sample from a population by first dividing the population into distinct, non-overlapping subgroups, or strata, based on shared characteristics such as age, gender, income, or location, and then randomly sampling from each stratum in proportion to its size or according to a specified allocation. This approach ensures that all relevant subgroups are adequately represented in the sample, thereby reducing sampling error and improving the precision of estimates compared to simple random sampling, particularly when the population exhibits high variability within subgroups. The process begins with identifying key stratifying variables that capture important heterogeneity in the , followed by partitioning the into mutually exclusive . Samples are then drawn independently from each using random selection techniques, with sample sizes determined either proportionately (reflecting the stratum's share of the ) or disproportionately (to oversample underrepresented groups or for greater in specific ), with appropriate to maintain unbiased estimates. Common allocation strategies include proportional allocation for unbiased overall estimates and optimal allocation, such as Neyman allocation, which minimizes variance by considering both stratum sizes and within-stratum variability. For instance, in a survey, the might be stratified by geographic region and age group to ensure balanced representation across diverse demographics. Stratified sampling offers several advantages, including enhanced statistical efficiency through reduced sampling variance, guaranteed inclusion of minority subgroups, and the ability to provide separate estimates for each stratum, which is valuable for subgroup analysis. However, it requires prior knowledge of the population's composition to define effective strata, can be more complex and costly to implement than simpler methods, and may not effectively reduce sampling error if strata are poorly chosen or if the stratifying variable does not correlate well with the study outcomes. Compared to cluster sampling, it typically yields more precise results for heterogeneous populations but demands more upfront planning. The method was formalized in its modern form by in 1934, who demonstrated the theoretical foundations for optimal sample allocation to minimize estimation errors in stratified designs, building on earlier ideas in representative sampling from the early . Today, stratified sampling is widely applied in fields like survey research, , quality control, and simulations, where accurate representation across diverse population segments is critical.

Fundamentals

Definition

Stratified sampling is a probability sampling in which the population of is divided into distinct, non-overlapping subgroups known as strata, based on one or more stratification variables, and then independent random samples are drawn from each stratum. This approach ensures that the sample reflects the 's diversity by capturing representation from each subgroup proportionally or according to a specified allocation. Unlike simple random sampling, which selects units directly from the entire without regard to internal structure, stratified sampling leverages prior about the to improve representativeness and . The key components of stratified sampling include the total N, which represents the number of units; the number of strata K, denoting the subgroups formed; the size of each N_h for h (where h = 1, 2, \dots, K); and the sampling fraction within each , typically denoted as n_h / N_h, where n_h is the sample size drawn from h. A fundamental prerequisite is that the strata must be mutually exclusive, meaning no unit belongs to more than one , and collectively exhaustive, ensuring all population units are included in some . This partitioning allows for targeted sampling that accounts for heterogeneity across groups while maintaining the essential to probability-based . The relationship among these components is expressed mathematically as the total population size equaling the sum of the stratum sizes: N = \sum_{h=1}^{K} N_h This formula underscores the exhaustive coverage of the population by the strata, forming the basis for subsequent sampling and estimation procedures.

Comparison to simple random sampling

Simple random sampling (SRS) treats the entire population as homogeneous, selecting a single random sample where each unit has an equal probability of inclusion without regard for subgroups, which can lead to underrepresentation of rare or small subgroups in heterogeneous populations. In contrast, stratified sampling divides the population into mutually exclusive and exhaustive strata based on relevant characteristics and then samples proportionally from each stratum, ensuring representation across all subgroups and thereby reducing sampling error in diverse populations. This approach controls variability by homogenizing groups within strata while capturing differences between them, resulting in greater precision and lower variance for estimates compared to SRS when using the same sample size. Stratified sampling was developed in the early 20th century, notably through Jerzy Neyman's 1934 work on representative methods, to address biases in agricultural experiments and census surveys where populations varied by region, soil type, or demographics.

Design and Implementation

Stratum formation

Stratum formation is the initial step in stratified sampling, where the target population is partitioned into mutually exclusive and collectively exhaustive subgroups known as strata. Effective strata are designed to enhance the of estimates by ensuring homogeneity within each stratum—meaning low variability in the key variable of interest among units—and heterogeneity between strata, which captures significant differences across groups. This approach reduces the overall sampling variance compared to simple random sampling, as units within a stratum are more similar, allowing for more efficient representation of the . To form strata, researchers typically rely on auxiliary variables that are correlated with the study variable and readily available for the entire , such as demographic factors (e.g., or levels) or spatial attributes (e.g., ). These variables enable the division of the into non-overlapping categories that cover all units without omission or duplication; for instance, a might be stratified by brackets (low, medium, high) to reflect varying economic behaviors. The choice of auxiliary variables is critical, as they must be measurable from the and relevant to the research objectives to avoid introducing bias. Forming strata presents several challenges, including the substantial cost associated with acquiring a comprehensive that includes the necessary auxiliary information for all units. Additionally, arbitrary or poorly defined boundaries can lead to misclassification errors, where units are incorrectly assigned, potentially undermining the homogeneity goal and increasing variance. Obtaining accurate frame data often requires administrative or censuses, which may not always be up-to-date or complete, further complicating the process. A practical guideline for the number of strata K is to select a modest number that provides sufficient detail without risking empty or overly small strata, which could inflate variance; for small-scale surveys, K between 4 and 6 is often recommended to balance gains in against . Cochran noted that beyond approximately six strata, additional divisions yield in efficiency for many populations.

Sampling strategies

In stratified sampling, the core procedure involves independently drawing samples from each predefined using probability-based methods, typically simple random sampling, to ensure representation proportional to the stratum's characteristics. Once the has been divided into homogeneous strata, a —a complete list of units—is obtained for each stratum h, from which n_h units are selected randomly, either with or without . This independent selection within strata allows for tailored sampling efforts that account for variability across groups, as originally outlined in the foundational framework for probability-based stratified designs. A common strategy is proportional allocation, where the sample size for each stratum h is determined such that \frac{n_h}{n} = \frac{N_h}{N}, with n as the total sample size, N_h as the , and N as the total ; this ensures the sample mirrors the population's stratum proportions, reducing when stratum sizes differ significantly. Variations on the basic random selection include equal allocation, in which n_h = \frac{n}{K} for all K strata regardless of their population sizes, which is particularly useful when the goal is to compare strata directly or when variability is similar across groups, though it may oversample small strata. Another variation employs within each stratum, suitable for ordered lists like or geographic sequences: after a random starting point, every k-th unit is selected, where k = \frac{N_h}{n_h}, offering efficiency over simple random sampling when the frame lacks inherent randomness but maintaining approximate randomness if the ordering avoids periodicity. In practice, after random selection, non-response is addressed by adjusting sampling weights within each stratum, often by inflating the weights of respondents by the inverse of the stratum-specific response rate to compensate for missing units and preserve representativeness. For instance, if the response rate in stratum h is r_h, the adjusted weight for responding units becomes w_h = \frac{1}{r_h} \times \frac{N_h}{n_h}, ensuring unbiased estimates when non-response is assumed ignorable within strata.

Sample size allocation

In stratified sampling, determining the appropriate sample size for each , denoted n_h, is crucial for achieving efficient while meeting overall survey objectives. Allocation balance the total sample size n across strata to minimize variance or incorporate practical constraints, assuming the is divided into H strata with sizes N_h and total size N = \sum N_h. Proportional allocation assigns sample sizes in proportion to the stratum's share of the , given by the n_h = n \cdot \frac{N_h}{N}. This assumes equal variability across strata and ensures the sample mirrors the , which simplifies and reduces in overall estimates. Disproportionate allocation deviates from to improve precision for specific subgroups, such as by small or rare that may have higher variability. For instance, the sample size can be set proportional to the product of stratum size and standard deviation, n_h \propto N_h S_h, where S_h is the within-stratum standard deviation; this allocates more resources to heterogeneous to enhance subgroup estimates without inflating overall variance excessively. Neyman allocation provides an optimal disproportionate strategy for a fixed total sample size n, minimizing the variance of the when costs are equal across strata. The is n_h = n \cdot \frac{N_h S_h}{\sum_{k=1}^H N_k S_k}, which prioritizes larger, more variable strata but requires prior knowledge or estimates of S_h from pilot studies or historical data. This approach, introduced by Neyman in , can substantially reduce variance compared to proportional allocation in populations with unequal stratum variances. Practical considerations often modify these allocations, such as budget constraints that limit total n or vary costs c_h per unit across strata, leading to adjusted formulas like n_h \propto N_h S_h / \sqrt{c_h} to optimize under fixed expenditure. Differential response rates, which may be lower in certain strata due to or reluctance, necessitate those groups to achieve effective sample sizes post-collection. If prior stratum sizes or variabilities are unknown at the design stage, post-stratification serves as an by applying weights after sampling via random selection, effectively mimicking stratified allocation without upfront decisions.

Statistical Analysis

Population mean estimator

In stratified sampling, the unbiased estimator for the population mean \bar{Y} is constructed by taking a weighted average of the sample means from each stratum, where the weights reflect the relative sizes of the strata in the population. Specifically, the estimator is given by \hat{\bar{Y}}_{st} = \sum_{h=1}^H W_h \bar{y}_h, where H is the number of strata, W_h = N_h / N is the weight for stratum h (with N_h denoting the of stratum h and N = \sum N_h the total ), and \bar{y}_h = \frac{1}{n_h} \sum_{i=1}^{n_h} y_{hi} is the simple random sample mean within stratum h (with n_h the sample size from that stratum and y_{hi} the observed values). This form ensures that the overall estimate respects the population structure by upweighting strata that comprise a larger proportion of the total population. The estimator \hat{\bar{Y}}_{st} is unbiased for the true population mean \bar{Y}, meaning E(\hat{\bar{Y}}_{st}) = \bar{Y}. To see this, note that within each stratum h, the sample mean \bar{y}_h is an unbiased of the stratum population \bar{Y}_h under simple random sampling, so E(\bar{y}_h) = \bar{Y}_h. Substituting into the estimator yields E(\hat{\bar{Y}}_{st}) = \sum_{h=1}^H W_h E(\bar{y}_h) = \sum_{h=1}^H W_h \bar{Y}_h = \bar{Y}, since \bar{Y} is itself the population-weighted average of the stratum . This unbiasedness holds regardless of the specific sample sizes n_h chosen for each stratum. The standard weighting scheme uses the known proportions W_h = N_h / N for all forms of stratified sampling, including cases of disproportionate allocation where the sample sizes n_h are not proportional to N_h (e.g., to account for varying stratum variability or costs). Alternative normalizations, such as adjusting weights based on sampling fractions, are not required for the unbiased of the , as the proportions directly the correct ; the choice of n_h influences but not the form of the point estimate. Compared to the simple random sampling estimator \bar{y}, which treats the entire as homogeneous, \hat{\bar{Y}}_{st} is generally more precise when the population exhibits heterogeneity across strata, as it explicitly accounts for differences to reduce error.

Variance estimation

In stratified sampling, the variance of the stratified mean \hat{\bar{Y}}_{st} quantifies the in the estimate and is derived from the within-stratum variances. The exact , accounting for the finite correction, is \operatorname{Var}(\hat{\bar{Y}}_{st}) = \sum_{h=1}^H W_h^2 \left(1 - f_h\right) \frac{S_h^2}{n_h}, where W_h = N_h / N is the weight of stratum h, f_h = n_h / N_h is the sampling fraction in stratum h, S_h^2 is the variance within stratum h, n_h is the sample size in stratum h, N_h is the in stratum h, and N is the total . This expression arises because the strata are sampled independently, allowing the total variance to be the weighted sum of individual stratum variances. Since the true stratum variances S_h^2 are unknown, an unbiased of the variance is used post-sampling: \hat{\operatorname{Var}}(\hat{\bar{Y}}_{st}) = \sum_{h=1}^H W_h^2 \left(1 - f_h\right) \frac{s_h^2}{n_h}, where s_h^2 = \frac{1}{n_h - 1} \sum_{i=1}^{n_h} (y_{hi} - \bar{y}_h)^2 is the unbiased sample variance within h, and \bar{y}_h is the sample in h. This is unbiased because the of each s_h^2 equals S_h^2, and the finite correction $1 - f_h is known from the . At least two observations per are required for s_h^2 to be defined. When the population sizes N_h are large relative to the sample sizes (i.e., n_h \ll N_h, so f_h \approx 0), the finite population correction can be ignored, simplifying the formulas to \operatorname{Var}(\hat{\bar{Y}}_{st}) \approx \sum_{h=1}^H W_h^2 \frac{S_h^2}{n_h}, \quad \hat{\operatorname{Var}}(\hat{\bar{Y}}_{st}) \approx \sum_{h=1}^H W_h^2 \frac{s_h^2}{n_h}. This approximation is common in survey practice where exhaustive population listing is impractical. Confidence intervals for the population mean are typically constructed using the normal approximation, especially when sample sizes are large: \hat{\bar{Y}}_{st} \pm z_{\alpha/2} \sqrt{\hat{\operatorname{Var}}(\hat{\bar{Y}}_{st})}, where z_{\alpha/2} is the from the standard (e.g., 1.96 for a 95% interval). For smaller samples, a t-distribution with approximated may be used, but the normal approximation suffices when n_h \geq 30 per .

Optimal allocation methods

Optimal allocation methods in stratified sampling seek to distribute the total sample size across strata to minimize the variance of the stratified population mean estimator \hat{\bar{Y}}_{st} for a fixed total sample size n, or to minimize variance subject to a fixed budget when sampling costs vary by stratum. These methods build on knowledge of stratum sizes N_h and standard deviations S_h, prioritizing allocation to strata with larger contributions to overall variability. Neyman allocation, originally derived by , achieves the minimum possible variance of \hat{\bar{Y}}_{st} under a fixed n by setting the sample size in stratum h as n_h = n \frac{N_h S_h}{\sum_k N_k S_k}. This formula arises from minimizing the variance expression \text{Var}(\hat{\bar{Y}}_{st}) = \sum_h W_h^2 \frac{S_h^2}{n_h} (1 - f_h), where W_h = N_h / N is the stratum weight and f_h = n_h / N_h is the sampling fraction, subject to the constraint \sum_h n_h = n. The derivation employs the of Lagrange multipliers, leading to the proportionality of n_h to N_h S_h, which allocates more samples to larger and more variable strata. When sampling costs c_h differ across strata, optimal allocation minimizes \text{Var}(\hat{\bar{Y}}_{st}) subject to a total cost constraint C = \sum_h c_h n_h. Using Lagrange multipliers on the variance formula under this budget, the solution yields n_h proportional to \frac{N_h S_h}{\sqrt{c_h}}. This adjustment favors strata with high variability relative to their sampling cost, ensuring efficient resource use while controlling variance. Power allocation extends Neyman allocation by setting n_h proportional to (N_h S_h)^p, where p is a power parameter between 0 and 1 that balances optimality and robustness. For p=1, it recovers Neyman allocation; for p=0.5, it simplifies to proportionality with \sqrt{N_h S_h}, which reduces sensitivity to errors in S_h estimates while still prioritizing variable strata. This family of methods, introduced by Bankier, is particularly useful in surveys requiring reliable subnational estimates alongside national ones. These optimal methods require estimates of S_h for each , often obtained from pilot surveys or historical , which can introduce if inaccurate. Misestimation of variances particularly affects Neyman allocation, potentially increasing the actual variance beyond that of simpler proportional methods.

Practical Considerations

Advantages

Stratified sampling offers increased precision in estimates compared to simple random sampling () by dividing the into homogeneous strata, which reduces the overall sampling variance. This occurs because within-strata variability is lower than the total variability, leading to more accurate estimates, particularly in heterogeneous populations. For instance, empirical studies have demonstrated variance reductions in controlled experiments when using stratified sampling over . A key advantage is ensuring representation of all relevant subgroups, including minority or underrepresented populations, which might miss entirely due to . By guaranteeing a minimum sample size in each , stratified sampling avoids zero-sample issues for rare groups and provides proportional or targeted based on stratum sizes. This is particularly beneficial in diverse populations, such as ensuring adequate sampling of ethnic minorities in educational or surveys. Stratified sampling also enables improved estimates for subgroups without relying on pooling data from the entire sample. Researchers can compute direct inferences, such as effect sizes, for each stratum independently, allowing for stratum-specific as if separate studies were conducted. This facilitates detection of differences across groups and enhances the reliability of subgroup comparisons. In terms of cost efficiency, stratified sampling can reduce the total sample size required to achieve a desired level, especially through optimal allocation methods that concentrate sampling effort where variability is highest. This leads to lower costs while maintaining or improving estimate accuracy relative to . For example, in a stratified by major, proportional allocation ensures efficient representation without . Additionally, stratified sampling provides administrative benefits by facilitating in clustered or geographically dispersed . By defining strata based on or organizational units, it simplifies , such as targeting surveys to specific regions or clusters, thereby streamlining fieldwork and .

Disadvantages

Stratified sampling requires the development of a complete that includes variables for every unit in the , which can be costly and time-intensive to compile, particularly for large or dispersed . For instance, obtaining detailed auxiliary information on all units to define homogeneous strata often involves significant efforts, such as accessing administrative records or conducting preliminary surveys. The design of stratified sampling is inherently complex, necessitating statistical expertise to select appropriate stratification variables and allocate sample sizes effectively across strata. Poor choices, such as using irrelevant variables for stratification, can result in strata that are not sufficiently homogeneous, thereby increasing the overall sampling variance compared to simpler methods. This complexity arises during stratum formation, where defining non-overlapping groups demands careful consideration to avoid inefficiencies. A key in stratified sampling is the occurrence of empty strata, where no units are selected from a particular (n_h = 0), especially in small overall samples or when strata are numerous and some are rare. This leads to for those subgroups, potentially compromising the representativeness of the sample and requiring special imputation or adjustment techniques to estimate parameters. Non-response poses additional challenges in stratified sampling, as differential response rates across strata can introduce into estimates unless explicitly modeled and corrected through or other adjustments. For example, if certain strata experience higher non-response due to issues, the resulting sample may over- or under-represent those groups, distorting overall inferences. Finally, stratified sampling faces scalability limitations for very large or dynamic populations, where maintaining an up-to-date becomes impractical without ongoing resource investment. In rapidly changing environments, such as online user bases or mobile , outdated frames can lead to coverage errors, making the method less viable compared to more adaptive sampling approaches.

Illustrative example

Consider a hypothetical of 1000 students at a , where the goal is to estimate the average grade point average (GPA). The is stratified by grade level into three strata: freshmen (N_1 = 300), sophomores (N_2 = 400), and juniors/seniors (N_3 = 300). The stratum weights are thus W_1 = 0.3, W_2 = 0.4, and W_3 = 0.3. To implement proportional allocation, a total sample size of n = 100 is selected, with n_1 = 30, n_2 = 40, and n_3 = 30 drawn via simple random sampling () within each . The stratum sample means are \bar{y}_1 = 2.8, \bar{y}_2 = 3.1, and \bar{y}_3 = 3.4. The stratified of the population mean is then \hat{\bar{Y}}_{st} = \sum_{h=1}^3 W_h \bar{y}_h = 0.3 \times 2.8 + 0.4 \times 3.1 + 0.3 \times 3.4 = 3.10. For comparison, the same total sample size of 100 drawn via from the entire yields a sample of 3.05, which is slightly less precise due to not accounting for grade-level differences. To illustrate the , assume the within-stratum variances are S_h^2 = 0.22 for each (a reasonable value for GPA data on a 4.0 ). With proportional allocation and ignoring the finite correction for simplicity (valid for large N_h), the approximate variance of the stratified is V(\hat{\bar{Y}}_{st}) \approx \frac{1}{n} \sum_{h=1}^3 W_h S_h^2 = \frac{0.22}{100} = 0.0022. The variance is S^2 = \sum W_h (\mu_h - \mu)^2 + \sum W_h S_h^2, where the stratum means \mu_h match the sample means and \mu = 3.1, yielding a between-stratum component of 0.054 and S^2 = 0.274. The variance is then approximately S^2 / n = 0.00274. Thus, stratified sampling reduces the variance by about 20% relative to ($0.0022 / 0.00274 \approx 0.80).

Extensions and Applications

Disproportionate stratified sampling

Disproportionate stratified sampling refers to a variation of stratified sampling in which the sample sizes allocated to each , denoted as n_h, are intentionally set to be unequal to the strata's proportions in the . This method deliberately deviates from proportional allocation by certain strata, such as smaller or underrepresented groups, to achieve greater analytical precision for those specific subgroups. The primary rationale for disproportionate stratified sampling is to address challenges in estimating parameters for rare events or subgroups with low population representation, such as disease prevalence in low-incidence areas, where standard proportional sampling might yield insufficient observations for reliable inference. It is also beneficial when strata exhibit substantial differences in variability, enabling allocations that prioritize higher-variance strata to reduce the overall sampling error more effectively than proportional methods. To adjust for the unequal sampling fractions in estimation, sampling weights are applied based on the inclusion probabilities within each . The unbiased for the population mean \bar{Y} is given by \hat{\bar{Y}}_{st} = \sum_{h=1}^H W_h \bar{y}_h, where W_h = N_h / N represents the population proportion of stratum h (N_h is the stratum size and N is the total population size), and \bar{y}_h is the sample mean within stratum h. Equivalently, this can be expressed using per-unit weights proportional to N_h / n_h, ensuring the estimator remains unbiased despite the disproportionality. Practical examples include surveys that high-income households to gain deeper insights into premium consumer behaviors, despite their small share, and evaluation that minority groups to assess program impacts more accurately. In health , such as the on Ethics Research and Clinical Care (CERC) study, disproportionate sampling enriched the dataset with racial/ethnic minorities (e.g., Black participants, who comprised 9.5% of the ) to enable robust subgroup analyses. Despite these benefits, disproportionate stratified sampling has notable drawbacks, including the potential to inflate the overall variance of estimates if allocations are not carefully optimized for or , and the added of post-sampling , which demands accurate of sizes and can complicate analysis. Misclassification of strata, such as in electronic health record-derived categories, may further results unless addressed through design-based methods.

Applications in survey research

Stratified sampling has been instrumental in survey research since its formal introduction by Jerzy Neyman in 1934. This method addressed the inefficiencies of simple random sampling in heterogeneous populations, marking a milestone in representative sampling for empirical studies. In national censuses, stratified sampling enhances accuracy by partitioning populations into geographic and demographic strata, allowing for targeted sampling within subgroups to reflect diverse characteristics such as urban-rural divides or ethnic compositions. The U.S. Census Bureau routinely employs this approach in surveys like the Rental Housing Finance Survey, where strata are defined by census regions, states, urban-rural status, and counties to ensure representative coverage and reduce sampling error. Similarly, in market research, stratified sampling by consumer segments—such as age, income, or region—enables precise estimation of preferences and behaviors; for instance, Nielsen ratings stratify TV households using multi-stage cluster and stratified techniques to mirror national demographics, providing reliable audience metrics for over 41,000 sampled households. As of 2025, Nielsen has transitioned to a 'Big Data + Panel' approach, combining stratified panel data with big data sources to enhance accuracy in mirroring national demographics. Public health surveys leverage stratified designs, often combined with clustering, to assess coverage in varied settings; the World Health Organization's vaccination cluster surveys, for example, implicitly stratify by urban-rural status to estimate rates, revealing disparities such as lower coverage in rural areas due to barriers. In environmental monitoring, stratification by habitat types supports estimates, as seen in vegetation surveys where plots are allocated across , , and strata to capture gradients and inform priorities. Modern adaptations integrate stratified sampling with for dynamic stratification, where algorithms refine strata in based on streaming data to handle unbalanced or evolving populations, improving prediction accuracy in large-scale surveys. Implementation is facilitated by software tools like R's survey package, which supports of stratified designs by incorporating sampling weights and strata variables to compute unbiased estimates and variances for complex survey data.

References

  1. [1]
    Stratified Sampling: Definition, Advantages & Examples
    Stratified sampling is a method of obtaining a representative sample from a population that researchers divided into subpopulations.
  2. [2]
    Stratified Sampling | Definition, Guide & Examples - Scribbr
    Sep 18, 2020 · In stratified sampling, researchers divide subjects into subgroups called strata based on characteristics that they share.
  3. [3]
    Stratified Sampling - an overview | ScienceDirect Topics
    Stratified sampling is defined as a method that involves dividing a total pool of data into distinct subsets (strata) and then conducting randomized sampling ...
  4. [4]
    [PDF] neyman-1934-jrss.pdf
    On the Two Different Aspects of the Representative Method: The Method of Stratified. Sampling and the Method of Purposive Selection. Author(s): Jerzy Neyman.
  5. [5]
    [PDF] Chapter 7. Sampling Techniques - University of Central Arkansas
    For a stratified random sample, the population is divided into groups or strata. A random sample is selected from each stratum based upon the percentage that ...
  6. [6]
    [PDF] Chapter 4: Stratified Random Sampling
    Stratified random sampling divides a population into non-overlapping groups (strata), then selects a simple random sample from each stratum.
  7. [7]
    Sampling methods in Clinical Research; an Educational Review - NIH
    Stratified random sampling. This method is a modification of the simple random sampling therefore, it requires the condition of sampling frame being available, ...
  8. [8]
    [PDF] Chapter 4 outline - Stat@Duke
    Stratified random sampling. Divide population in strata and do simple random sampling within each stratum. Advantages: ▷ Have individual estimates for each ...
  9. [9]
    On the Two Different Aspects of the Representative Method : The ...
    Jerzy Neyman; On the Two Different Aspects of the Representative Method : The Method of Stratified Sampling and the Method of Purposive Selection, Journal.
  10. [10]
    Design Information for Survey Procedures
    For more detailed information, refer to Cochran ... To improve precision, units within strata should be as homogeneous as possible for the characteristics of ...
  11. [11]
    [PDF] Stratified telephone survey designs
    basic sampling frame (to avoid complications of unequal probability of ... The cost of stratification is a fixed cost and therefore will reduce the ...
  12. [12]
    Survey design and analysis considerations when utilizing ... - NIH
    Jul 11, 2021 · We recommend implementing disproportionate stratified sampling over simple random sampling even if the sampling strata are misclassified.
  13. [13]
    A note on the optimum stratification of populations for estimating the ...
    Aug 6, 2025 · ... sampling frame. The function 'optimizeStrata' available in package ... cost of stratification is high compared with that of ranking.
  14. [14]
  15. [15]
    [PDF] Chapter 4 Stratified Sampling - IIT Kanpur
    Based on different ideas, some allocation procedures are as follows: 1. Equal allocation. Choose the sample size i n to be the same for all the strata. Draw ...<|separator|>
  16. [16]
    [PDF] Using Calibration Weighting to Adjust for Nonresponse Under a ...
    Calibration weighting ensures that sample-estimated totals of certain calibra- tion or benchmark variables match previously determined population totals. Two.
  17. [17]
    6 Part A: Stratified Sampling - STAT ONLINE
    Stratified sampling partitions a population into non-overlapping groups (strata), and a sample is selected within each stratum.Missing: definition | Show results with:definition
  18. [18]
    [PDF] 3 STRATIFIED SIMPLE RANDOM SAMPLING
    Thus, every possible stratified SRS having stratum sample sizes n1,··· ,nH has the same probability of being selected. 3.1 Estimation of yU and t. • Because a ...Missing: N_h | Show results with:N_h
  19. [19]
    Section 4. Sample size determination - Statistique Canada
    Dec 15, 2020 · If the survey study has a budget constraint, the sample size allocation can be optimized by minimizing the variance of the estimator with a ...
  20. [20]
    Reducing Variance with Sample Allocation Based on Expected ...
    Aug 9, 2021 · This paper demonstrates that the sample allocation that takes the expected response rates (ERRs) into account has certain advantages over other approaches.
  21. [21]
    None
    ### Summary of Variance Formulas for Stratified Mean Estimator from Cochran's Sampling Techniques (Chapter 5: Stratified Random Sampling, Section on Variance)
  22. [22]
    Power Allocations: Determining Sample Sizes for Subnational Areas
    In this article, a simple allocation method is suggested for achieving reliable national and subnational estimates. KEY WORDS: Neyman allocation; Optimal ...
  23. [23]
    [PDF] Improving the Sensitivity of Online Controlled Experiments
    stratified sampling achieves variance reduction compared to simple random sampling because it removes the variation of the sample size from each stratum ...
  24. [24]
    Stratified Sampling - Encyclopedia of Research Design
    Proportionate stratified sampling can provide an increase in precision (meaning a reduction in error) relative to simple random sampling, if variability within ...
  25. [25]
    8 Sampling and Massive Data - The National Academies Press
    Stratified sampling, case-based sampling, adaptive sampling, and length ... sampling frame is unknown. Some populations, like that of people with ...
  26. [26]
    Sampling in Developmental Science: Situations, Shortcomings ... - NIH
    One of the benefits of clustered and stratified sampling designs is that, relative to simple random sampling, the recruitment costs and efforts are often lower ...
  27. [27]
    Sage Research Methods - Stratified Random Sampling
    Stratification of target populations is extremely common in survey sampling. Stratified sampling techniques are often used when designing ...
  28. [28]
    [PDF] Statistical inference with empty strata in judgment post stratified ...
    Jul 11, 2016 · Abstract This article develops estimators for certain population characteristics using a judgment post stratified (JPS) sample.Missing: n_h= | Show results with:n_h=
  29. [29]
    Post-stratification or non-response adjustment? - Survey Practice
    Jul 31, 2016 · Non-response adjustments are applicable when your sample comes with auxiliary variables available for both respondents and non-respondents. Then ...
  30. [30]
    Chapter 8 Stratified Sampling | STAT392
    ... Optimal allocation gives the lowest possible variance at a fixed survey cost. Note Neyman allocation is the case where the cost is equal in each stratum.<|control11|><|separator|>
  31. [31]
    How Stratified Random Sampling Works, With Examples
    Stratified random sampling is the process of creating subgroups in a dataset according to various factors such as age, gender, income level, or education.What Is Stratified Random... · Simple vs. Stratified · Proportionate vs... · Advantages
  32. [32]
    Chapter 5 Choosing the Type of Probability Sampling
    In proportionate stratified sampling, the number of elements allocated to the various strata is proportional to the representation of the strata in the target.
  33. [33]
    [PDF] Introduction to Probability Sampling - UCSF Prevention Science
    – Each population member has a known, non-zero chance of inclusion. – Sample members are drawn with a random selection mechanism. • Characteristics.
  34. [34]
    [PDF] Fundamentals of Applied Sampling - Berkeley SDA
    Section 5.4 introduced disproportionate stratified sampling, in which we divide a sampling frame into several strata and sample the strata at different rates.Missing: rationale drawbacks
  35. [35]
    Survey design and analysis considerations when utilizing ...
    Jul 11, 2021 · We recommend implementing disproportionate stratified sampling over simple random sampling even if the sampling strata are misclassified.
  36. [36]
    Randomization | The Abdul Latif Jameel Poverty Action Lab
    Controlled randomized experiments were invented by Charles Sanders Peirce and Joseph Jastrow in 1884. Jerzy Neyman introduced stratified sampling in 1934.
  37. [37]
    2021 RHFS Survey Methodology - U.S. Census Bureau
    Sep 24, 2025 · Within each stratum, eligible buildings were sorted by geographic variables, including census region, state, urban/rural status, county, and zip ...
  38. [38]
    Sampling Estimation & Survey Inference - U.S. Census Bureau
    Jul 16, 2025 · Survey sampling helps the Census Bureau provide timely and cost efficient estimates of population characteristics.
  39. [39]
    Media Ratings and Target Marketing - Sage Publishing
    Instead, Nielsen breaks down the complexity of sampling TV households by using two sampling techniques: multi-stage cluster sam- pling and stratified sampling ...
  40. [40]
    [PDF] WORLD HEALTH ORGANIZATION VACCINATION COVERAGE ...
    Implicitly stratify the sample. In this example, the survey designers have decided to stratify the sample implicitly by urban/rural status. That is, they ...
  41. [41]
    Techniques for Sampling Habitat – Monitoring Animal Populations ...
    Develop a system of habitat stratification that you expect will coincide with species habitat requirements. Use maps, aerial photographs, or satellite imagery ...
  42. [42]
    Essential environmental variables to include in a stratified sampling ...
    Sep 1, 2019 · The objective of stratification for vegetation surveys is to incorporate those habitat types that show the most meaningful association with the ...
  43. [43]
    (PDF) Stratified Sampling-Based Deep Learning Approach to ...
    Oct 21, 2023 · Stratified sampling approaches are employed to generate the necessary samples for accurate prediction, and the precise samples with computed ...
  44. [44]
    Analyze Survey Data: Stratified Random Sampling Design
    The examples below highlight how to create a complex sample survey design object and then directly query specific coefficients, error terms, and other survey ...Missing: implementation | Show results with:implementation