Fact-checked by Grok 2 weeks ago

Probability-proportional-to-size sampling

Probability proportional to size (PPS) sampling is a probability-based sampling technique used in survey methodology where the probability of selecting a population unit is directly proportional to a predetermined measure of its size, such as population count or economic value, to improve efficiency in estimating population parameters.^[1] This method was first formally introduced by Morris H. Hansen and William N. Hurwitz in their 1943 paper on sampling from finite populations, where they proposed PPS with replacement to allow unbiased estimation of totals using the Hansen-Hurwitz estimator.^[2] Subsequent developments, including the Horvitz-Thompson estimator for PPS without replacement in 1952, extended its applicability to single-stage and multistage designs, particularly in cluster sampling where larger clusters receive higher selection probabilities while maintaining equal probabilities for ultimate elements through fixed subsampling.^[1]^[3] PPS sampling is especially valuable in scenarios with heterogeneous unit sizes, such as national health surveys or business establishment frames, as it reduces variance in estimates compared to equal-probability sampling by allocating more resources to larger units.^[4]^[5] Key implementation steps involve calculating cumulative sizes, determining a sampling interval, and selecting units via systematic random starts, enabling straightforward computation of inclusion probabilities and weights for unbiased inference.^[3] Advantages include enhanced precision for skewed distributions and simplified fieldwork logistics in multistage surveys, though challenges like requiring accurate size measures and handling without-replacement complexities persist.^[1]^[5]

Definition and Basics

Definition

Probability-proportional-to-size (PPS) sampling is a probability-based sampling technique where the probability of selecting a population unit into the sample is directly proportional to a specified measure of its size, such as the number of elements it contains, its economic value, or an auxiliary variable like revenue. This approach assigns higher inclusion probabilities to larger units, which enhances the efficiency of estimating population totals or means by focusing selection efforts on units that contribute more substantially to the aggregate quantities of interest.^[1]^[6] Mathematically, the first-order inclusion probability \pi_i for unit i is defined as \pi_i = n \cdot \frac{size_i}{\sum_{j=1}^N size_j}, where n denotes the desired sample size, size_i is the size measure for unit i, and \sum_{j=1}^N size_j is the total size measure across all N units in the population. This formulation ensures that the expected number of units selected equals n, while larger units are oversampled relative to their proportion in an equal-probability scheme. The size measure must be accurately known or reliably estimated from the sampling frame to implement PPS effectively.^[2] The primary purpose of PPS sampling is to mitigate inefficiencies in simple random sampling when dealing with heterogeneous or skewed populations, where a small number of large units account for a disproportionate share of the total variability or aggregate value, thereby reducing the variance of estimators without increasing sample size. In contrast to equal-probability methods, PPS leverages auxiliary size information to achieve greater precision, particularly when the size measure correlates positively with the study variable.^[2]^[1] PPS sampling was developed in the 1940s by statisticians Morris H. Hansen and William N. Hurwitz as part of advancements in survey methodology, initially applied to improve estimation in agricultural and economic surveys involving finite populations with varying unit sizes. Their foundational work established PPS as a key tool for handling unequal unit contributions in practical sampling scenarios.^[2]

Key Concepts

Probability-proportional-to-size (PPS) sampling modifies equal-probability sampling by assigning selection probabilities to population units in proportion to a chosen size measure, thereby reducing sampling error when estimating totals or means in populations with substantial variability in unit sizes.^[7] This adjustment leverages auxiliary information about unit sizes to overweight larger units, which are presumed to contribute more to the population total, leading to more efficient estimators compared to simple random sampling.^[8] For instance, if the size measure correlates strongly with the study variable, the variance of the estimator can approach zero when the variable is exactly proportional to the size.^[7] Size measures in PPS sampling are auxiliary variables that quantify the relative importance or scale of each unit and must be positively correlated with the variable of interest to achieve efficiency gains.^[8] Common examples include population counts for clusters, such as the number of students in school classes or residents in geographic areas; revenue figures for businesses; or land area for agricultural plots.^[8]^[7] These measures are typically obtained from pre-survey data sources, such as administrative records or censuses, to compute selection probabilities prior to sampling.^[8] Size variables can be continuous, such as exact revenue amounts, or discrete, such as rounded counts of elements like employees or households, with the choice depending on data availability and the nature of the population frame.^[7] Accurate auxiliary data is essential for defining these sizes, as it directly influences the proportionality of selection probabilities.^[2] As a probability-based method, PPS sampling ensures design unbiasedness for appropriate estimators when selection probabilities are correctly specified based on the size measures.^[2] However, if the size variable lacks positive correlation with the study variable, efficiency diminishes through increased variance, though unbiasedness is preserved provided the probabilities reflect the true design.^[8]^[7]

Sampling Procedures

With-Replacement Sampling

In probability-proportional-to-size (PPS) sampling with replacement, each unit in the population is assigned a selection probability equal to its size measure divided by the total size measure across all units, and selections are made independently for a fixed sample size n, permitting the possibility of duplicate selections.^[2] This approach ensures that larger units have a higher chance of being selected in each draw, while the independence of draws simplifies the sampling process compared to without-replacement variants.^[9] The standard procedure for implementing PPS with replacement employs the cumulative total method, which facilitates efficient selection based on precomputed size accumulations. Consider a population of N units, each with a known positive size measure x_i > 0 for i = 1, \dots, N, and let T = \sum_{i=1}^N x_i denote the total size. The steps are as follows:

List the units in any arbitrary order and compute the cumulative size sums: S_0 = 0 and S_j = \sum_{i=1}^j x_i for j = 1, \dots, N, so that S_N = T.
For each of the n independent draws (k = 1, \dots, n), generate a uniform random variate u_k \sim U(0, T).
Select the unit i_k as the smallest index j such that S_j \geq u_k; the probability of selecting unit i in any single draw is then p_i = x_i / T.

This process yields a sample that may include duplicates if the same unit is chosen across draws.^[10]^[9] When duplicates occur, each selection is retained as a distinct observation in the sample, reflecting the independent nature of the draws; subsequent estimation accounts for these multiples without adjustment for overlap.^[9] This design is especially appropriate for large populations where n is small relative to N, minimizing the likelihood of duplicates while maintaining unbiased inclusion probabilities.^[2] The computational demands of this method are modest, requiring only the initial calculation of cumulative sums—once per population—and subsequent generation of uniform random numbers, without the need for intricate sorting or iterative adjustments during selection.^[10] This simplicity makes it accessible for implementation in basic statistical software or even manual computation for moderate-sized populations.^[9]

Without-Replacement Sampling

In probability-proportional-to-size (PPS) sampling without replacement, distinct units are selected from a finite population such that the inclusion probability of each unit is proportional to its size measure, ensuring a fixed sample size n with no duplicates. This method is essential for applications where redundant selections would waste resources, particularly when n is a meaningful proportion of the population size N. The procedure relies on ordered selection techniques to maintain proportionality while enforcing uniqueness.^[11] The general procedure involves ordering the population units by their size measures to create a cumulative total frame, which facilitates systematic draws. A random starting point u is selected uniformly from [0, X/n], where X is the total size, and subsequent points are chosen at regular intervals equal to X/n. The units are selected by finding, for each point, the smallest index j such that the cumulative size S_j >= the point value. This systematic PPS approach, often attributed to early models of unequal probability sampling, guarantees exactly n unique units while approximating the desired inclusion probabilities. For smaller samples, Brewer's method offers a straightforward extension for n > 2, pairing units and using adjusted joint probabilities to select without replacement, as detailed in foundational work on systematic unequal probability designs.^[12]^[13] The algorithm typically proceeds in three steps: first, assign initial inclusion probabilities \pi_i = n \cdot (x_i / X) for each unit i, where x_i is the size measure and X = \sum x_i, ensuring \pi_i \leq 1; second, construct an ordered list of units (e.g., by increasing size) and employ a pivotal or sequential draw method to select units one at a time, rejecting any already chosen and renormalizing remaining probabilities; third, apply ordering adjustments, such as stratification by size, to preserve the proportional structure across the sample. Size-based ordering, as a core concept in PPS designs, underpins these steps by aligning the frame with the probability structure.^[14] Key variants include systematic PPS with a random start and fixed interval X/n, which is computationally efficient for ordered frames, and rejective sampling schemes that generate multiple candidate samples proportional to size and accept only those with exactly n distinct units, as cataloged in comprehensive reviews of unequal probability methods. These variants, such as those building on Brewer's framework, ensure unbiased inclusion probabilities close to the target without complex adjustments.^[15] In finite populations, without-replacement PPS offers an advantage over with-replacement alternatives by reducing sampling variance when n is relatively large compared to N, as it maximizes the diversity of selected units and avoids inefficient multiple inclusions.^[11]

Estimation and Properties

Unbiased Estimators

In probability-proportional-to-size (PPS) sampling, unbiased estimators for population parameters are constructed using inverse inclusion probabilities to correct for the unequal selection chances of units based on their sizes. The Horvitz-Thompson (HT) estimator, adapted for PPS designs without replacement, provides an unbiased estimate of the population total Y = \sum_{i=1}^N y_i by weighting each observed value y_i in the sample by the inverse of its inclusion probability \pi_i. Specifically, the estimator is given by

\hat{Y} = \sum_{i \in s} \frac{y_i}{\pi_i},

where s denotes the sample. This formulation ensures unbiasedness under the sampling design, as the expected value E(\hat{Y}) = Y, since E\left( \frac{y_i}{\pi_i} \mathbf{I}_i \right) = y_i for the indicator \mathbf{I}_i of unit i's inclusion.^[16] In PPS without replacement, the inclusion probabilities \pi_i are proportional to the unit sizes x_i, typically \pi_i = n \cdot \frac{x_i}{\sum x_j} under certain approximations, though exact forms depend on the selection procedure; the HT estimator directly incorporates these \pi_i as weights, maintaining unbiasedness without requiring joint inclusion probabilities \pi_{ij} for the point estimate itself (though they are used in variance estimation).^[17] For the population mean \bar{Y} = Y / N, where N is the known population size, the unbiased estimator is simply \bar{\hat{Y}} = \hat{Y} / N. This follows directly from the unbiasedness of \hat{Y}, yielding E(\bar{\hat{Y}}) = \bar{Y}.^[16] In PPS sampling with replacement, the Hansen-Hurwitz estimator is employed instead, drawing n independent samples where each unit i has selection probability p_i proportional to its size x_i. The estimator for the total is

\hat{Y}_{HH} = \frac{1}{n} \sum_{k=1}^n \frac{y_k}{p_k},

where the sum accounts for possible duplicates by including y_k / p_k for each draw k; averaging over the draws ensures unbiasedness, with E(\hat{Y}_{HH}) = Y, as each term's expectation is \sum_i y_i. The corresponding mean estimator is \bar{\hat{Y}}_{HH} = \hat{Y}_{HH} / N. For without-replacement PPS, joint inclusion probabilities inform higher-order adjustments but are not part of the basic HT point estimator.^[2]

Variance Estimation

In probability-proportional-to-size (PPS) sampling with replacement, the exact design-based variance of the Hansen-Hurwitz estimator \hat{Y}_{HH} = \frac{1}{n} \sum_{k=1}^n \frac{y_k}{p_k} for the population total Y = \sum_{i=1}^N y_i (where p_i = X_i / X) is

\Var(\hat{Y}_{HH}) = \frac{1}{n} \left( \sum_{i=1}^N \frac{y_i^2}{p_i} - Y^2 \right).

An unbiased estimator of this variance is

\hat{V}(\hat{Y}_{HH}) = \frac{1}{n(n-1)} \sum_{k=1}^n \left( \frac{y_k}{p_k} - \hat{Y}_{HH} \right)^2,

which can equivalently be expressed as \hat{V}(\hat{Y}_{HH}) = \frac{X^2}{n(n-1)} \sum_{k=1}^n \left( \frac{y_k}{X_k} - \hat{\bar{\beta}} \right)^2, where \hat{\bar{\beta}} = n^{-1} \sum_{k=1}^n y_k / X_k; this estimator is exact under the design and does not rely on approximations.^[18]^[8] For PPS sampling without replacement, the Sen-Yates-Grundy form provides the variance of the Horvitz-Thompson estimator \hat{Y}_{HT} = \sum_{i \in s} y_i / \pi_i, where \pi_i are the inclusion probabilities (set proportional to sizes X_i):

\Var(\hat{Y}_{HT}) = \frac{1}{2} \sum_{i=1}^N \sum_{j=1}^N (\pi_i \pi_j - \pi_{ij}) \left( \frac{y_i}{\pi_i} - \frac{y_j}{\pi_j} \right)^2,

with \pi_{ij} denoting the joint inclusion probability for units i and j.^[18] The corresponding estimator is

\hat{V}_{SYG}(\hat{Y}_{HT}) = \frac{1}{2} \sum_{i \in s} \sum_{j \in s, j \neq i} \frac{\pi_i \pi_j - \pi_{ij}}{\pi_{ij}} \left( \frac{y_i}{\pi_i} - \frac{y_j}{\pi_j} \right)^2,

which requires estimating the joint probabilities \pi_{ij} and is applicable to fixed-size designs like systematic PPS without replacement.^[18]^[19] For complex PPS designs, approximation methods such as the bootstrap or Taylor series linearization are commonly used to estimate variances when exact joint probabilities are unavailable or computationally intensive. Bootstrap procedures resample from the original PPS design to mimic the selection process, providing variance estimates for the Horvitz-Thompson estimator in without-replacement settings; for instance, algorithms tailored to PPS bootstrap the sample while preserving size-based probabilities.^[20] Linearization approximates the variance for large samples by treating the estimator as a function of sample totals, often implemented in software like R's survey package, which supports PPS via the svydesign function with probability arguments and computes variances using replication or linearization methods.^[21] Variance estimation in PPS sampling faces challenges, including high variability when the size measure X_i correlates poorly with the study variable y_i, leading to less efficiency compared to equal-probability sampling and potentially inflated standard errors.^[11] Additionally, stable estimation typically requires sample sizes n > 10 to ensure the denominator terms like n(n-1) in with-replacement formulas yield reliable results without excessive instability.

Applications and Examples

Survey Sampling

Probability-proportional-to-size (PPS) sampling is widely applied in general survey contexts, such as national economic and health surveys, to efficiently capture variability in unit sizes and ensure representation of high-impact elements. By assigning selection probabilities based on a measure of size—such as population, revenue, or geographic area—PPS facilitates targeted oversampling of larger or more influential units, improving the precision of estimates for totals and means in heterogeneous populations.^[7] In area surveys, PPS is employed using geographic size as the measure to oversample high-impact units like urban areas, which often contain a disproportionate share of the target population. For instance, primary sampling units such as census enumeration areas are selected proportional to the number of households, enabling better coverage of densely populated regions without exhaustive listing.^[22] Similarly, in business surveys, PPS relies on firm revenue or payroll to prioritize large corporations, which contribute significantly to aggregate economic indicators like total output or employment. The U.S. Business Enterprise Research and Development Survey, for example, uses Pareto PPS based on historical R&D performance or annual payroll to focus on firms with substantial activity, enhancing the accuracy of national innovation metrics.^[23] PPS integrates seamlessly with multi-stage designs common in large-scale surveys, where it is typically applied at the first stage to select clusters proportional to size, followed by equal-probability sampling of elements within those clusters. This approach balances efficiency in primary unit selection with simplicity in later stages, as seen in health and demographic surveys aiming for nationally representative data.^[4] A notable case is the U.S. National Health Interview Survey (NHIS), which uses PPS at the primary sampling unit level—proportional to population estimates—to select geographic clusters, thereby balancing urban and rural representation through stratification into metropolitan and non-metropolitan areas.^[24] These applications yield efficiency gains over simple random sampling, particularly for skewed totals where unit sizes vary widely, allowing smaller samples to achieve comparable precision.

Cluster Sampling

In probability-proportional-to-size (PPS) cluster sampling, primary sampling units (PSUs), such as counties or enumeration areas, are selected with inclusion probabilities proportional to their population size, followed by subsampling secondary units within the chosen clusters; this approach is commonly applied in demographic and agricultural surveys to efficiently capture variability across geographic areas. For instance, in demographic studies, PSUs like administrative districts are chosen via PPS based on resident population counts, with subsequent random selection of households or individuals inside them to estimate totals like employment rates.^[3] A practical example is found in the Food and Agriculture Organization (FAO) agricultural surveys, where PPS is used to select farm clusters proportional to their size—measured by the number of agricultural holdings or cropped area—to estimate crop yields, thereby assigning higher weights to larger farms that contribute more to overall production.^[25] In such designs, like India's Crop Estimation Surveys, villages are first sampled via simple random sampling, but experimental plots within them are selected using PPS based on crop area, enabling precise yield forecasts for multiple crops across regions.^[25] Two-stage PPS sampling further refines this process: at the first stage, clusters are selected with probabilities \pi_i \propto M_i, where M_i is the cluster size (e.g., number of households), and at the second stage, a fixed number of elements are sampled equally or again via PPS within each cluster, which inherently provides implicit stratification by size and ensures equal overall selection probabilities for elements when combined.^[3] This method is particularly beneficial in practice for cost-effective data collection in dispersed populations, as demonstrated in World Bank household surveys in developing countries like Tanzania and Ethiopia, where PPS at the cluster level minimizes travel costs while maintaining representativeness across rural and urban areas.^[26]

Advantages and Limitations

Advantages

Probability-proportional-to-size (PPS) sampling enhances efficiency in populations with skewed distributions by assigning higher selection probabilities to larger or more influential units, thereby reducing the relative standard error of estimators for population totals compared to equal-probability methods. This approach targets units that contribute disproportionately to the overall variability, leading to more precise estimates in heterogeneous settings. For instance, in business surveys such as the Dutch Producer Price Index, randomized PPS sampling achieved more than a 30% reduction in variance relative to simple random sampling ratio estimators, even at low sampling fractions.^[27] A key benefit of PPS sampling is its potential to produce self-weighting samples when the size measure is perfectly correlated with the study variable of interest, simplifying the estimation process by eliminating the need for complex weight adjustments in the Horvitz-Thompson or similar estimators. In such cases, all sampled elements receive equal weights, streamlining analysis and reducing computational overhead without compromising unbiasedness. This property is particularly advantageous in cluster sampling designs where size measures, such as population counts from prior censuses, align closely with the target outcomes.^[28] PPS sampling demonstrates strong adaptability by incorporating auxiliary information, such as size measures derived from census data, to improve precision over equal-probability alternatives without requiring full stratification. This integration allows for effective use of readily available frame data to guide selection probabilities, enhancing overall survey accuracy in resource-constrained environments.^[29] In heterogeneous populations, such as those encountered in economic indicator surveys, PPS sampling yields cost savings by requiring fewer units to achieve the same level of precision as equal-probability methods, as it efficiently allocates the sample toward high-impact elements. This efficiency translates to reduced fieldwork and operational expenses while maintaining statistical reliability, making it ideal for applications like business or cluster surveys with varying unit sizes.^[4]

Limitations

Probability-proportional-to-size (PPS) sampling depends heavily on accurate and up-to-date measures of unit size, typically derived from an auxiliary variable correlated with the study variable. If these size measures are outdated or only weakly correlated with the target variable, the method loses efficiency, resulting in higher variance for estimators compared to scenarios with strong correlations.^[4]^[30] Inaccurate size data can also introduce practical challenges, such as selecting units that no longer reflect current proportions, though unbiased estimators like the Horvitz-Thompson remain applicable if inclusion probabilities are adjusted accordingly.^[31] Implementation of PPS sampling, particularly without replacement, involves significant computational complexity due to the need to calculate selection probabilities and inverse inclusion weights for each unit. This process is more burdensome than with-replacement variants, as exact joint inclusion probabilities required for variance estimation are often difficult or impossible to derive analytically, necessitating approximations or simulation-based methods.^[32]^[15] PPS sampling inherently favors larger units, which can lead to underrepresentation of smaller ones unless supplemented by other techniques, resulting in insufficient sample sizes for rare or small entities. For instance, in multi-stage surveys, small clusters or subpopulations may receive disproportionately few selections, limiting the method's utility in contexts like biodiversity assessments where small units are critical.^[33]^[34] The variance of PPS estimators can be unstable, particularly with small sample sizes or weak correlations between the size measure and study variable, often exceeding that of simple random sampling and requiring larger overall samples to achieve comparable precision. This instability arises because the method's efficiency gains depend on the auxiliary variable's predictive power; poor correlations amplify variability in weight assignments for the unbiased estimators.^[35]^[31]

References

[1]
Probability Proportional to Size (PPS) Sampling - Wiley Online Library
Aug 5, 2016 · PPS sampling is a method where the probability of selecting a unit is proportional to its size, used in multistage and single-stage sampling.Missing: sources | Show results with:sources
[2]
On the Theory of Sampling from Finite Populations - Project Euclid
On the Theory of Sampling from Finite Populations. Morris H. Hansen, William N. Hurwitz. DOWNLOAD PDF + SAVE TO MY LIBRARY. Ann. Math. Statist. 14(4): 333-362.
[3]
[PDF] Steps in applying Probability Proportional to Size (PPS)
PPS involves two stages: first, larger clusters have a higher probability of being sampled. Second, the same number of individuals are sampled per cluster. The ...Missing: sources | Show results with:sources
[4]
[PDF] stage probability proportional to size sample - CDC Stacks
Jul 30, 2024 · In most multi-stage surveys, probability proportional to size (PPS) sampling is used to select the units up until the final stage of sampling.
[5]
Some Methods of Inclusion Probability Proportional to Size Sampling
SUMMARY. This article proposes a sampling scheme which involves creating a sampling design to ensure inclusion probability proportional to size selection.Missing: sources | Show results with:sources
[6]
3.2.2 Probability sampling - Statistique Canada
Sep 2, 2021 · This is known as sampling with probability proportional to size (PPS). With this method, the bigger the size of the unit, the higher the chance ...
[7]
Probability Proportional to Size (PPS) Sampling
PPS sampling is also used in the early stages of multi-stage samples to achieve equal probability samples of the final-stage sampling units or ...
[8]
Chapter 7 Probability Proportional to Size Sampling | STAT392
This idea leads naturally to probability proportional to size sampling, where each unit has a distinct probability of selection ψi ψ i on any given draw, which ...
[9]
Sampling | Design and Analysis | Sharon L. Lohr
Nov 29, 2021 · Sampling: Design and Analysis, Third Edition shows you how to design and analyze surveys to answer these and other questions. This authoritative ...
[10]
[PDF] Chapter 7 Varying Probability Sampling - IIT Kanpur
PPS sampling with replacement (WR):. First, we discuss the two methods of drawing a sample with PPS and WR. 1. Cumulative total method: ... cumulative totals ...
[11]
Chapter 8 Sampling with probabilities proportional to size
In this chapter the individual population units (elementary sampling units) are selected with probabilities proportional to size.
[12]
PPS Systematic Sampling - SAS Help Center
Oct 28, 2020 · Systematic sampling selects units at a fixed interval throughout the sampling frame (or stratum) after a random start.
[13]
A MODEL OF SYSTEMATIC SAMPLING WITH UNEQUAL ...
A MODEL OF SYSTEMATIC SAMPLING WITH UNEQUAL PROBABILITIES. K. R. W. Brewer,. K. R. W. Brewer. Commoniwealth Bureau of Census and Statistics, Canberra. Search ...
[14]
A Model of Systematic Sampling with Unequal Probabilities
Aug 6, 2025 · Download Citation | A Model of Systematic Sampling with Unequal Probabilities | A new method is described of drawing, without replacement, ...
[15]
[PDF] SAMPLE EXPANSION FOR PROBABILITY PROPORTIONAL TO ...
There exist numerous methods for selecting a fixed size sample with probability proportional to size (PPS), without replacement. Brewer and Hanif (1983).
[16]
A Generalization of Sampling Without Replacement from a Finite ...
This paper presents a general technique for the treatment of samples drawn without replacement from finite universes when unequal selection probabilities are ...
[17]
[PDF] Horvitz-Thompson-1952-jasa.pdf
A Generalization of Sampling Without Replacement From a Finite Universe. Author(s): D. G. Horvitz and D. J. Thompson. Source: Journal of the American ...
[18]
Chapter 4 Unequal probability sampling | Survey data in the field of ...
The Hansen-Hurwitz estimator has been proposed in case of sampling with replacement. Let consider an ordered sample of m m independent draws ...
[19]
Selection Without Replacement from Within Strata with Probability ...
Horvitz and Thompson have given an unbiased estimator of the error variance, but this is shown to be inefficient and a new unbiased estimator is given. A method ...
[20]
Bootstrap algorithms for variance estimation in πPS sampling
The problem of bootstrapping the estimator's variance under a probability proportional to size design is examined. Focusing on the Horvitz-Thompson estimator ...
[21]
[PDF] Describing PPS designs to R
To specify a PPS design, the sampling probabilities must be given in the prob argument of svydesign, or in the fpc argument, with prob and weight ...
[22]
[PDF] designing and selecting the sample | unicef mics
You and your sampling staff have concluded, therefore, that you need to have a minimum of 300 PSUs in order to achieve good geographic spread and sufficient ...
[23]
[PDF] Cutoff Sampling in Federal Establishment Surveys
The sampling method traditionally used for surveys of business establishments is stratified probability proportional to size (PPS) sampling. In this method ...
[24]
[PDF] Design and Estimation for the National Health Interview Survey ...
NCHS supplies variance estimation information [pseudo-stratum, pseudo- primary sampling unit (PSU)] for the public-use microdata files to allow data users to ...
[25]
[PDF] Probability Proportional to Size - Austrian Journal of Statistics
Oct 7, 2022 · This paper defined a ranked sampling procedure for PPS sampling; we explored the RSS and PPS approaches and considered the possibility of ...
[26]
[PDF] improving methods for measuring crop area, production and yield
This handbook focuses on improving methods for measuring crop area, production, and yield, as part of a global strategy to improve agricultural statistics.
[27]
[PDF] Understanding Household Surveys - The World Bank
Aug 20, 2021 · Probability proportional‐to‐size selection at the first stage of sampling. Tanzania. NPS. 2014-. 2015. 3,265 hh multi-stage cluster sample ...
[28]
[PDF] On the efficiency of randomized probability proportional to size ...
Abstract. This paper examines the efficiency of the Horvitz-Thompson estimator from a systematic probability proportional to size (PPS) sample drawn from a ...<|control11|><|separator|>
[29]
2. SAMPLE DESIGN AND SURVEY DATA
True PPS sampling in a one-stage cluster sampling will produce a self-weighting sample of elements, as in the. SRS design. The self-weighting can also be ...
[30]
[PDF] Chapter II Overview of sample design issues for household surveys ...
20. Probability proportional to size (PPS) sampling is a technique that employs auxiliary data to yield dramatic increases in the precision of survey estimates ...
[31]
Research on Adaptive Cluster Sampling Method Based on PPS
In PPS sampling, different units have different sizes, and larger units have a higher probability of being selected when sampling. ... continuous research and ...
[32]
(PDF) On the problems of PPS sampling in multi-character surveys
Aug 7, 2025 · This paper, which is on the problems of PPS sampling in multi-character surveys, compares the efficiency of some estimators used in PPSWR ...Missing: challenges | Show results with:challenges
[33]
[PDF] PracTools: Computations for Design of Finite Population Samples
The algorithm proceeds until some user-controllable convergence criteria are met. The variance formulas for pps without replacement sampling are difficult or ...Missing: burden | Show results with:burden
[34]
[PDF] CHOOSING THE SAMPLE - UNICEF MICS
Using PPS may result in small samples for groups representing a small proportion of the overall population. Stratification solves this problem. You will create ...
[35]
[PDF] Sampling and Estimation - USDA Food and Nutrition Service
But PPS sampling would have resulted in underrepresentation of smaller agencies (in the sense that a very small proportion of such agencies might be selected). ...
[36]
[PDF] Estimation of Co-efficient of Variation in PPS sampling.
The results indicate that the MSE of the PPS estimator is relatively higher when the correlation co-efficient is low. Table 3.2: The MSE's of the proposed ...