Fact-checked by Grok 2 weeks ago

Multistage sampling

Multistage sampling is a probability sampling in statistics that involves selecting a sample from a large, often geographically dispersed by progressively dividing it into smaller groups or clusters across multiple stages, allowing for efficient data collection without requiring a complete list of all population members. This technique, also known as multistage cluster sampling, begins with the identification of primary sampling units (such as regions or districts), from which subsets are randomly selected; subsequent stages then involve further subdivision and random selection of secondary units (like neighborhoods or schools) and ultimately individual elements, ensuring representativeness while minimizing logistical challenges. It is particularly valuable in scenarios where constructing a full is impractical due to population size or dispersion, such as surveys or educational studies. The process enhances efficiency by leveraging natural groupings in the , reducing travel and administrative costs compared to simple random sampling, though it may require larger sample sizes to achieve comparable and can introduce clustering effects that affect variance . For instance, in , multistage sampling might involve randomly selecting provinces, then hospitals within those provinces, and finally patients from selected wards, facilitating studies on widespread populations like schoolchildren in large countries. Key advantages include cost-effectiveness and flexibility in applying different probability sampling techniques across stages, while disadvantages encompass potential biases from undercoverage and the need for complex statistical adjustments to account for design effects. Overall, multistage sampling balances practicality and statistical rigor, making it a of large-scale survey in fields ranging from social sciences to .

Definition and Fundamentals

Definition

Multistage sampling is a probability-based sampling method in which the population is sampled through a series of successive stages, starting with the selection of large primary units such as geographic clusters and progressively subdividing to smaller units until reaching the ultimate elements of interest for . This approach allows for the efficient selection of samples from complex, large-scale populations by breaking down the sampling process into manageable hierarchical levels, ensuring that each selected unit has a known probability of inclusion. The hierarchical structure of multistage sampling involves dividing the target into primary sampling units (PSUs), from which a random subsample is drawn; these PSUs are then further partitioned into secondary sampling units (SSUs), and the process repeats across additional stages as needed, with probability sampling applied at every level to support valid . For instance, in a two-stage , elements within the are first grouped into PSUs, and then individual SSUs are randomly selected from the chosen PSUs. This multi-level preserves the probabilistic nature of the sample while accommodating the nested organization of real-world populations. In contrast to single-stage sampling techniques, such as simple random sampling, which demand a complete and accessible listing every element, multistage sampling facilitates the handling of vast, dispersed populations by relying on partial frames at higher stages and avoiding the need for exhaustive . This distinction is particularly valuable in scenarios where logistical or resource constraints make full population listing infeasible, enabling researchers to approximate representativeness through staged probability selections. Multistage sampling presupposes a foundational understanding of probability sampling principles, such as random selection and inclusion probabilities, but it does not require the availability of a complete list of all elements upfront. represents a special case of this method, where all stages involve the selection of clusters and the final stage encompasses all elements within those clusters.

Key Concepts and Terminology

In multistage sampling, a core framework within probability sampling, the is divided into hierarchical levels of units that are successively selected across stages to reach the ultimate elements of interest. Primary Sampling Units (PSUs) represent the largest clusters selected in the initial stage of sampling, often corresponding to broad geographic areas, organizations, or administrative divisions that serve as the foundation for further subdivision. For instance, in national surveys, PSUs might consist of counties or enumeration areas, allowing researchers to manage large efficiently by first targeting these macro-level groups. Secondary Sampling Units (SSUs) are the subdivisions sampled within the selected PSUs during the second stage, providing a finer to the selection . Examples include neighborhoods or city blocks within chosen counties, or households within selected areas, enabling the sampling to narrow from broad regions to more specific locales. sampling units and higher-order units extend this further, involving additional subdivisions—such as individuals within households—until the final stage yields the ultimate elements, like survey respondents. Sampling at each stage can occur with replacement, where selected units are returned to the pool for potential reselection, or without replacement, where they are excluded after selection to avoid duplicates. With-replacement sampling simplifies certain probability calculations by treating selections as independent, though it may introduce ; without-replacement sampling, often preferred for , ensures distinct units but requires more complex adjustments in estimating selection probabilities across stages. Designs in multistage sampling may be self-weighting, where all ultimate elements have equal probabilities of selection, resulting in uniform weights without the need for post-sampling adjustments, or unequal probability designs, where selection chances vary by unit size or importance, necessitating inverse-probability weighting to achieve unbiased estimates. Self-weighting approaches promote simplicity in analysis, particularly when population strata are balanced, while unequal probability methods, such as probability proportional to size, better accommodate heterogeneous clusters but demand careful variance estimation.

Methodology

Sampling Stages

Multistage sampling proceeds through a series of sequential stages, where the is progressively divided into successively smaller units, allowing for efficient selection from large, dispersed populations without requiring a complete for all elements. In the first stage, the target is divided into primary sampling units (PSUs), which are typically large, naturally occurring clusters such as geographic areas, schools, or hospitals, and a subset of these PSUs is randomly selected, often using simple random sampling to ensure each has an equal probability of inclusion. This initial division and selection reduces logistical costs by focusing efforts on manageable portions of the while maintaining probabilistic representation. Subsequent stages involve further subdivision of the selected units from prior stages into smaller secondary sampling units (SSUs), such as neighborhoods within selected counties or households within blocks, followed by random from these SSUs to continue narrowing down to the final elements of interest, like individuals. For instance, in a two-stage , stage 2 would select SSUs directly from the chosen PSUs using random methods, whereas additional stages might subdivide SSUs into units if needed. This hierarchical ensures that the final sample reflects the population structure while controlling variance through controlled clustering. The number of stages in multistage sampling is typically two to four, determined by factors such as population size, geographic dispersion, and data accessibility, with fewer stages preferred for simpler populations to minimize design effects and more stages used for highly fragmented ones to balance precision and feasibility. Optimal stage determination involves assessing the trade-off between sampling efficiency and increased clustering variance, often guided by pilot studies or cost-precision models to select the configuration that achieves desired accuracy at minimal expense. At each stage, a —a list or roster of units within the selected clusters from the previous stage—is essential for probabilistic selection, enabling random draws without bias. Practical considerations include constructing these frames from available records like census data or local directories, and addressing non-response or incomplete frames by applying adjustments such as imputation or post-stratification weights to maintain representativeness, particularly in later stages where unit-level data may be sparse.

Selection and Implementation Procedures

In multistage sampling, random selection at each stage typically employs methods such as , , or to ensure probabilistic representation of the . SRS involves drawing units entirely at random from a complete list at the given stage, providing equal probability to all elements, while selects units at regular intervals from an ordered list after a random start, offering efficiency when lists are available. PPS adjusts selection probabilities based on a measure of unit size, such as or revenue, to account for variability and improve precision in heterogeneous populations. These methods are applied sequentially across stages, with the choice depending on the availability of sampling frames and the need to balance cost and accuracy. To handle unequal sizes among primary or subsequent units, is particularly useful, as it assigns higher selection probabilities to larger units, thereby balancing representation and reducing bias in estimates derived from clusters of varying scales. For instance, in area-based sampling, might use land area or dwelling counts as the size measure to select primary sampling units like census tracts, ensuring that more populous areas contribute proportionally without over- or under-sampling. This approach mitigates the effects common in clustered designs by weighting selections to reflect true population distributions. Software tools facilitate the implementation of these selection procedures, enabling automated random draws and design specification for multistage surveys. The 'survey' package supports multistage designs by allowing users to define cluster identifiers, , and unequal probabilities across stages, including via functions like svydesign(). Similarly, SAS's SURVEYSELECT procedure accommodates multistage sampling with options for , systematic, and methods, integrating and ing for complex designs. Specialized software like Blaise, a (CAPI) system, is widely used for field surveys involving multistage sampling, as it handles instrument programming and while supporting quality checks during implementation. Field implementation requires rigorous for enumerators to execute selections accurately and maintain procedural across stages. Enumerators are typically trained on sampling protocols, including how to identify and list units within selected clusters, apply random selection tools like random number generators, and document deviations to prevent non-response bias. Quality control measures, such as supervisor oversight, random verification of selections, and real-time data auditing, are integrated at each stage to minimize errors and ensure adherence to the design, with protocols emphasizing consistency in handling refusals or inaccessible units.

Types and Variations

Cluster Multistage Sampling

Cluster multistage sampling is a form of multistage sampling in which each successive stage involves the random selection of —naturally occurring groups of population elements that share similar characteristics, such as geographic proximity or organizational affiliation. The process begins with the identification of primary sampling units (PSUs) as , from which a random sample is drawn; subsequent stages then subsample secondary units (subclusters) within the selected PSUs, continuing hierarchically until the desired elements are reached. For instance, in a national health survey, the first stage might select counties as , the second stage neighborhoods within those counties, the third stage households, and the final stage individuals, ensuring all levels rely on selection without imposing to balance subgroups. This structure allows for efficient sampling frames at higher levels while enabling detailed subsampling at lower levels. This method is particularly suited for populations that are geographically dispersed or logistically challenging to enumerate comprehensively, such as rural communities, nationwide educational systems, or large corporate networks spanning multiple regions. By focusing efforts within selected clusters, it minimizes costs associated with and frame construction, making it feasible to survey vast areas where a complete list of all elements would be prohibitively expensive or impossible to obtain. It is commonly applied in agricultural, demographic, and studies where clusters align with administrative or natural boundaries, facilitating practical implementation without the need for full coverage. A key statistical consideration in cluster multistage sampling is the , which quantifies the efficiency loss due to intra-cluster —the tendency for elements within the same to exhibit greater similarity than those across different s. This inflates the sampling variance relative to simple random sampling () of the same size, often requiring sample sizes to be increased by a factor equal to the (typically 1.5 to 3 or higher, depending on the strength) to achieve comparable . The effect is more pronounced in designs with fewer, larger s or high homogeneity within s, underscoring the between logistical savings and statistical efficiency. The technique traces its origins to early 20th-century agricultural surveys, where the need for cost-effective on yields and characteristics drove innovations in probability sampling. Jerzy Neyman's seminal 1934 work laid foundational principles for cluster-based allocation, integrating with selection to enable unbiased in large-scale field studies, marking a shift from purposive to representative methods in survey practice. This evolution was pivotal for institutions like the U.S. Department of , which adopted such designs in to monitor economic and production data across dispersed farmlands.

Stratified Multistage Sampling

Stratified multistage sampling combines the principles of and multistage sampling by first partitioning the into homogeneous, mutually exclusive strata, often based on geographic regions, demographic characteristics, or other key variables, and then applying independent multistage sampling procedures within each stratum. This integration ensures of diverse subgroups while leveraging the efficiency of clustering at subsequent stages, such as selecting primary units like counties, followed by secondary units like neighborhoods, and finally individual elements. Allocation of sample sizes across strata can follow proportional methods, where the number of units selected from each is directly proportional to its size in the , or optimal approaches like Neyman allocation, which adjusts allocations based on both stratum size and internal variability to achieve greater . Neyman allocation, originally proposed in 1934, prioritizes larger samples in strata with higher variability to minimize overall sampling variance under equal costs per unit. This design is particularly beneficial for heterogeneous populations in national surveys, as stratification reduces sampling error by explicitly accounting for subgroup differences, leading to more accurate estimates and improved efficiency compared to unstratified multistage methods. For instance, in diverse contexts like , it enhances the reliability of inferences about parameters by ensuring adequate coverage of varied demographic or regional groups. A prominent example is the U.S. National Health Interview Survey (NHIS), which employs stratified multistage sampling with strata defined at the initial stage by states and demographic factors such as minority population concentrations (e.g., low, medium, high for , , and Asian groups). Within these strata, primary sampling units (PSUs) like counties are selected with probability proportional to size, followed by stratified blocks as secondary units and clustered households as tertiary units, enabling of underrepresented groups to improve precision in health estimates.

Advantages and Limitations

Advantages

Multistage sampling provides substantial cost and time efficiencies, especially in surveys requiring extensive fieldwork, by breaking down large into progressively smaller, geographically clustered units that minimize interviewer travel and logistical expenses. For instance, in face-to-face across wide areas, this approach focuses resources on selected clusters rather than the entire , significantly reducing operational costs compared to methods demanding nationwide dispersion. This method enhances feasibility for studying vast or dispersed populations by eliminating the need for a complete national , instead utilizing readily available administrative divisions such as provinces, districts, or villages as initial sampling units. This practicality is evident in international health surveys, where the recommends multistage cluster for its STEPS program to monitor risk factors across countries without exhaustive lists. Multistage sampling improves accuracy relative to single-stage methods by allowing refined selection and control at each successive stage, which mitigates and reduces overall design effects on variance estimates. This stepwise refinement enables more precise inferences while maintaining representativeness, particularly in heterogeneous settings. The technique's scalability supports its application across varying sizes and complexities, adapting the number of stages to balance precision and resources, as demonstrated in large-scale demographic and studies worldwide. Unlike random sampling, which proves logistically inefficient for broadly distributed groups, multistage designs facilitate manageable without compromising coverage.

Limitations and Challenges

Multistage sampling introduces higher variance in estimates compared to simple random sampling, primarily due to intra-cluster homogeneity, where elements within selected clusters tend to be more similar than those across the as a whole. This homogeneity leads to a greater than one, necessitating larger sample sizes to achieve equivalent , as the effective sample size is reduced by the clustering structure. The design and of multistage samples are inherently complex, requiring specialized expertise to properly account for the hierarchical structure and avoid issues such as undercoverage or non-response at successive stages. Decisions on sampling methods at each stage are subjective and demand rigorous justification to prevent flawed implementations that compromise validity. Additionally, the multi-level nature of the complicates statistical , often requiring advanced techniques to adjust for clustering effects. Potential biases arise if clusters selected at initial stages are not representative of the broader population, such as in cases of urban-rural imbalances or uneven geographic coverage, which can propagate through subsequent stages and result in undercoverage or . Non-response patterns that vary across clusters further exacerbate this risk, undermining the sample's generalizability. Multistage sampling imposes significant resource demands, including extensive for fieldworkers to navigate multiple selection stages and coordinate across dispersed clusters, which can prolong fieldwork timelines and increase overall costs despite initial efficiencies in frame construction. The need for larger samples to mitigate variance also heightens manpower and expertise requirements, making it particularly challenging for resource-limited studies.

Applications and Examples

Real-World Applications

Multistage sampling has been extensively adopted by national statistical offices for large-scale population surveys, enabling efficient data collection across vast geographic areas. The U.S. Census Bureau's (ACS), an ongoing annual survey of approximately 3.5 million addresses, utilizes a multistage probability sample with two phases, each comprising two stages, to select housing units and group quarters. In the first phase, addresses are systematically assigned to sub-frames and then sampled; the second phase targets nonrespondents for follow-up, ensuring representation of the civilian noninstitutionalized population while minimizing costs compared to full censuses. This approach, refined since the ACS's inception in the 1990s, supports timely estimates of demographic, social, economic, and housing characteristics at national, state, and local levels. In health and , multistage sampling facilitates representative in resource-constrained settings, particularly in developing countries. The (WHO), in collaboration with partners, implements the Demographic and Health Surveys (DHS) program, which employs a two-stage stratified design across over 90 countries. The first stage selects enumeration areas (typically villages or urban blocks) with probability proportional to size from national frames, stratified by urban/rural residence and regions; the second stage systematically samples 25-30 households per after listing. This method, used since the 1980s, has generated comparable on , , maternal and child health, and for more than 400 surveys, informing policy in low- and middle-income nations. Market research organizations apply multistage sampling to construct panels that capture diverse behaviors and preferences with geographic breadth. Panels are often built by first sampling large clusters such as regions or , followed by sub-sampling neighborhoods and households to recruit participants, ensuring cost-effective coverage of national markets. This technique supports ongoing tracking of purchasing patterns, , and , as seen in industry practices for scalable consumer insights. The adoption of multistage sampling accelerated post-World War II, driven by the need for efficient large-scale polling and amid expanding populations and limited resources. Originating in wartime probability sampling innovations, it became standard in the 1940s and 1950s for national surveys, such as those by starting in 1947 for income and agricultural data, where primary units like municipalities were selected proportionally to size before subsampling individuals. This evolution addressed the limitations of simple random sampling in dispersed populations, enabling broader institutional use in government and commercial polling by the mid-20th century.

Illustrative Examples

To illustrate the application of multistage sampling, consider a hypothetical scenario aimed at surveying educational outcomes among students aged 13–19 across a large , where the population is too dispersed for simple random sampling. Example 1: Student Sampling in a Country In this three-stage design, provinces serve as primary sampling units (PSUs). At Stage 1, 20 provinces are selected out of 100 using simple random sampling to ensure geographic representation. At Stage 2, within each selected province, 15 schools are chosen as secondary sampling units (SSUs) via probability proportional to size (), based on enrollment figures, yielding 300 schools total. Finally, at Stage 3, 40 students are randomly selected from each school, often after first sampling classes to facilitate access, resulting in a final sample of 12,000 students. This approach progressively narrows the focus from broad administrative divisions to individual respondents, minimizing travel costs while maintaining probability-based selection. Example 2: Household Survey For a national survey on living conditions, cities act as the first-stage PSUs. Suppose there are 50 major cities; 10 are selected using stratified random sampling to balance urban and rural influences. In Stage 2, 20 residential blocks are drawn from each chosen city as SSUs, typically via from municipal maps, for a subtotal of 200 blocks. Stage 3 involves selecting 15 dwellings per block and then interviewing all eligible residents within those dwellings, producing a sample of 3,000 households. This method leverages existing administrative hierarchies to make fieldwork feasible across vast areas. A practical for determining sample sizes in multistage designs is to select 20–30 PSUs at the first stage to achieve variance estimates without excessive costs, adjusting subsequent stages based on and needs. Hierarchical diagrams, depicting nested units like provinces containing schools or cities encompassing blocks, enhance comprehension of these structures by visually mapping the sampling progression.

Mathematical and Statistical Aspects

Probability Calculations

In multistage sampling, the probability framework begins with the calculation of inclusion probabilities at each stage, which determine the likelihood that a particular unit is selected into the sample. At the first stage, primary sampling units (PSUs) are selected from a larger of PSUs, typically using simple random sampling without replacement. The inclusion probability for a specific PSU, denoted as \pi_1, is given by \pi_1 = n_1 / N_1, where n_1 is the number of PSUs selected and N_1 is the total number of PSUs in the . This probability assumes equal chance of selection for all PSUs and forms the foundation for subsequent conditional selections. For subsequent stages, the inclusion probabilities are computed conditionally on the units selected in prior stages. In a two-stage , the joint inclusion probability for a secondary sampling unit (SSU) j within a selected PSU i, denoted \pi_{ij}, is the product of the first-stage probability for PSU i and the of selecting SSU j given that PSU i is in the sample: \pi_{ij} = \pi_i \cdot (n_{2i} / N_{2i}), where n_{2i} is the number of SSUs selected from PSU i and N_{2i} is the total number of SSUs in PSU i. This structure extends to additional stages, with each conditional probability reflecting the sampling fraction at that level within the previously selected units. When units have unequal sizes, adjustments are made using probability proportional to size () sampling, particularly at the first stage, to improve efficiency by increasing the selection chance for larger units. In , the inclusion probability for unit k, \pi_k, is proportional to its size measure s_k relative to the total size across all units: \pi_k \propto s_k / \sum s_k, often normalized such that the expected number of selections equals the sample size n_1. This approach ensures that larger PSUs, which may contain more elements of interest, are more likely to be included, though exact probabilities depend on whether sampling is with or without replacement. The overall inclusion probability for an ultimate element in the sample is the product of the conditional probabilities across all stages leading to that element. For instance, in a two-stage with at the first stage and simple random sampling at the second, the probability simplifies to a constant value independent of PSU size, such as \pi = (n_1 s_i / S) \cdot (n_2 / N_{2i}) = n_1 n_2 / S, where S = \sum s_i. This multiplicative property allows for the computation of sampling weights as the of these probabilities, facilitating unbiased in complex designs.

Estimation and Variance

In multistage sampling, unbiased estimation of population totals relies on the Horvitz-Thompson , which weights observed values by the inverse of their inclusion probabilities to account for the complex design. For a sampled element k with observed value y_k and first-order inclusion probability \pi_k—computed as the product of selection probabilities across all stages—the estimator for the Y is given by \hat{Y} = \sum_{k \in s} \frac{y_k}{\pi_k}, where s denotes the sample. This approach ensures unbiasedness under the design, as the E(\hat{Y}) = Y, even in unequal probability selections at multiple stages. Estimating the variance of \hat{Y} in multistage designs is challenging due to the nested structure and potential correlations within clusters. The linearization method approximates the variance by expanding the nonlinear estimator around its , treating it as a of elementary units while incorporating the design's and clustering. For a multistage sample, this yields an approximation of the form \widehat{\mathrm{Var}}(\hat{Y}) \approx \sum \widehat{\mathrm{Var}}_{\mathrm{stage}}, where terms capture variability at each stage, often using with-replacement assumptions for primary units and adjusting for finite corrections. This method is particularly effective for complex statistics like means or ratios, providing consistent variance estimates that reflect the full . The (deff) quantifies the efficiency of a multistage design relative to simple random sampling () of the same size, defined as \mathrm{deff} = \frac{\mathrm{Var}_{\mathrm{multistage}}(\hat{\theta})}{\mathrm{Var}_{\mathrm{SRS}}(\hat{\theta})}, where \hat{\theta} is the of interest (e.g., a ). Values greater than 1 indicate increased variance due to clustering, with typical deff ranging from 2 to 5 in two-stage cluster designs, depending on intracluster and cluster sizes; higher values signal greater inefficiency, necessitating larger samples for equivalent . Software tools facilitate these computations by incorporating sampling weights and design structures. In , the survey package supports multistage analysis through functions like svymean() for weighted means and svyvar() for variances, using svydesign() to specify clusters, strata, and probabilities at each stage, enabling Taylor linearization-based inference.

References

  1. [1]
    Multistage Sampling | Introductory Guide & Examples - Scribbr
    Aug 16, 2021 · In multistage sampling, or multistage cluster sampling, you draw a sample from a population using smaller and smaller groups (units) at each stage.Single-stage vs multistage... · Single-stage sampling · Multistage sampling
  2. [2]
    Multistage Sampling - an overview | ScienceDirect Topics
    Multistage sampling is defined as a form of cluster sampling that involves selecting samples in a series of steps from different levels of units, ...
  3. [3]
    Sampling methods in Clinical Research; an Educational Review - NIH
    Cluster sampling (Multistage sampling). It is used when creating a sampling frame is nearly impossible due to the large size of the population. In this ...
  4. [4]
    Sample Design Methods - Landscape Toolbox
    In two-stage sampling, elements of the population are grouped together into large groups called primary sampling units. The individual sampling units within ...
  5. [5]
    Sampling units - CCSG
    The term 'primary sampling units' (PSUs) refers to the sampling units chosen in the first stage of selection. The term 'secondary sampling units' (SSUs) refers ...
  6. [6]
    [PDF] Sampling, Sample Size and Planning for Data Collection
    Multistage sampling. ▷ When design is complex and efficient to select units in stages. ▷ Randomly select primary sampling units at 1st stage. Example:.
  7. [7]
    NHANES Tutorials - Sample Design Module
    Rather, a complex, multistage, probability sampling design is used to select participants representative of the civilian, non-institutionalized US population.
  8. [8]
    [PDF] Methods and Applications Lecture 1: Complex Survey Sampling
    ▻ Cluster sampling: All units in the population are aggregated into larger units called clusters, known as primary sampling units. (PSUs), and clusters are ...
  9. [9]
    [PDF] 7 CLUSTER SAMPLING AND SYSTEMATIC SAMPLING
    The primary sampling units may be of different sizes. That is, the numbers of secondary sampling units in the primary sampling units are not all the same.
  10. [10]
    [PDF] Analysis of Large-Scale Social Surveys1 - Columbia University
    Aug 10, 2000 · Units selected at the rst stage of sampling are called primary sampling units (PSUs). Units selected at the nal stage are called ultimate ...Missing: definition | Show results with:definition
  11. [11]
    On the non‐recursive implementation of multistage sampling without ...
    Oct 20, 2021 · The hierarchical nesting of units leads to a tree whose nodes correspond to sampling units (primary sampling units, secondary sampling units and ...
  12. [12]
    [PDF] SAS/STAT - Introduction to Survey Sampling and Analysis Procedures
    Multistage Sampling. In multistage sampling, you select an initial (first-stage) sample that is based on groups of elements in the population, which are ...
  13. [13]
    Encyclopedia of Survey Research Methods - Multi-Stage Sample
    A multi-stage sample is one in which sampling is done sequentially across two or more hierarchical levels, such as first at the county ...
  14. [14]
    3. Chapter 3: Sampling Methods
    In reality, however, cluster/multistage sampling can be boiled down to the successive drawing of random samples from populations that are large and widely ...
  15. [15]
    Design Effects and Effective Sample Size
    Sep 29, 2025 · The design effect (deff) is the ratio of the variance of a survey statistic under the complex design to the variance of the survey statistic under an SRS.
  16. [16]
    [PDF] History and Development of the Theoretical Foundations of Survey ...
    Neyman (1934) broke out of this sampling straitjacket with his theories of stratified sampling with "optimal" allocation and cluster sampling with ratio ...
  17. [17]
    None
    ### Summary of Stratified Multistage Sampling from NIST Document
  18. [18]
    Sample Size: Stratified Sample - Stat Trek
    How to calculate sample size for each stratum of a stratified sample. Covers optimal allocation and Neyman allocation. Sample problem illustrates key ...
  19. [19]
    [PDF] A Simple Method of Exact Optimal Sample Allocation under ...
    Aug 21, 2014 · Neyman, J. (1934). “On the Two Different Aspects of the Representative Method: The Method of. Stratified Sampling and the Method of Purposive ...<|separator|>
  20. [20]
    Introducing an efficient sampling method for national surveys with ...
    Jul 17, 2021 · As opposed to SRS, stratified sampling is usually used to increase the efficiency of sampling designs [4, 5]. Stratified sampling classifies a ...
  21. [21]
    [PDF] Design and Estimation for the National Health Interview Survey ...
    NHIS is based on a stratified multistage sample design. The design's specific parameters, however, have changed over time; a new sample design is implemented ...<|control11|><|separator|>
  22. [22]
    National Health Interview Survey (NHIS) - Health, United States - CDC
    NHIS uses a stratified, multistage sample design that permits the representative sampling of households and noninstitutional group quarters. The sample design ...
  23. [23]
    Introduction to Survey Sampling - Cluster and Multi-Stage Sampling
    Sample design is key to all surveys, fundamental to data collection, and to the analysis and interpretation of the data. Introduction to Survey Sampling, ...
  24. [24]
    [PDF] pdf - SIAP
    ➢ As the name suggests, Multi-stage sampling involves multiple stages of sampling. ➢ The number of stages can be numerous, although it is rare to have more ...
  25. [25]
    [PDF] Section 2: Preparing the Sample Overview
    Jan 26, 2017 · Multi-stage cluster sampling is one of the most common sample designs for national surveys and it is the recommended method for most STEPS ...
  26. [26]
    [PDF] Design Effects for Totals in Multi-Stage Samples
    This approach takes advantage of the fact that the variance of the estimator ... Kish, L. (1965) Survey Sampling. New York: Wiley. Park, I., and Lee, H ...
  27. [27]
    (PDF) Multistage sampling - ResearchGate
    Aug 6, 2025 · Multi-stage sampling is a method of obtaining a portion of a population by segmenting it into progressively smaller groups until a desirable ...
  28. [28]
    Cluster and Multistage Sampling - Sage Research Methods
    If, as generally applies, the clusters are more internally homogeneous than would occur from a random allocation, the cluster means will be correspondingly more ...Missing: intra- | Show results with:intra-<|separator|>
  29. [29]
    Multistage Sampling: Definition, Guide, Examples | Appinio Blog
    Aug 1, 2024 · Multistage sampling is a sophisticated and flexible sampling technique that involves selecting samples in multiple stages.
  30. [30]
    [PDF] Chapter 4: Sample Design and Selection - Census.gov
    Nov 28, 2022 · The combined. MAF/TIGER database is used both for sampling and tabulation for the ACS as well as most. Census surveys and data products. The MAF ...
  31. [31]
    [PDF] Sampling and Household Listing Manual - The DHS Program
    DEMOGRAPHIC AND HEALTH SURVEYS SAMPLING POLICY ... The sampling procedures used in DHS surveys are usually complex involving multi-stage.
  32. [32]
    [PDF] The rise of survey sampling - CBS
    During the Second World War, market research activities almost come to a halt, but they were resumed at a large scale after the war. In 1946 a panel of ...Missing: adoption | Show results with:adoption
  33. [33]
    Fifty Years of Survey Sampling in the United States - jstor
    The first phase, which began in the middle 1930s, in- volved the development of basic probability sampling theory, the ac- ceptance of probability sampling as ...
  34. [34]
    [PDF] Designing Household Survey Samples: Practical Guidelines
    Serve as a practical guide for survey practitioners in designing and implementing efficient household sample surveys; c. Illustrate the interrelationship of ...<|separator|>
  35. [35]
    [PDF] R Companion for - Sampling: Design and Analysis, Third Edition
    1 Jun 2020 · Rescaled bootstrap for stratified multistage sampling. Survey Method ... Teaching survey sampling with the `sampling' R package. In.
  36. [36]
    3.2.2 Probability sampling - Statistique Canada
    Sep 2, 2021 · This is known as sampling with probability proportional to size (PPS). With this method, the bigger the size of the unit, the higher the chance ...
  37. [37]
    [PDF] Linearization Variance Estimators for Survey Data
    In this paper, a new approach to deriving Taylor linearization variance estimators is proposed. It leads directly to a variance estimator which satisfies the ...
  38. [38]
    [PDF] Sample size and design effect
    Mar 24, 2001 · This presentation is a brief introduction to the design effect, which is an adjustment that should be used to determine survey sample size.<|control11|><|separator|>