Effects of Cluster Sizes on Variance Components in Two-Stage Sampling

Open access

Abstract

Determining sample sizes in multistage samples requires variance components for each stage of selection. The relative sizes of the variance components in a cluster sample are dramatically affected by how much the clusters vary in size, by the type of sample design, and by the form of estimator used. Measures of the homogeneity of survey variables within clusters are related to the variance components and affect the numbers of sample units that should be selected at each stage to achieve the desired precision levels. Measures of homogeneity can be estimated using standard software for random-effects models but the model-based intracluster correlations may need to be transformed to be appropriate for use with the sample design. We illustrate these points and implications for sample size calculation for two-stage sample designs using a realistic population derived from household surveys and the decennial census in the U.S.

Bates, D., M. Maechler, and B. Bolker. 2011. lme4: Linear Mixed-Effects Models Using S4 Classes. Available at: http://CRAN.R-project.org/package=lme4. (accessed October 12, 2015).

Binder, D. 1995. “Linearization Methods for Single Phase and Two-Phase Samples: A Cookbook Approach.” Survey Methodology 22: 17–22.

Center for Disease Control and Prevention. 2009. National Health and Nutrition Examination Survey: 1999–2010 survey content. Washington, DC: Department of Health and Human Services. Retrieved from www.cdc.gov/nchs/data/nhanes/survey_content_99_10.pdf.

Center for Disease Control and Prevention. 2012. National Health Interview Survey. Retrieved from National Center for Health Statistics: http://www.cdc.gov/nchs/nhis.htm.

Chromy, J. and L. Myers. 2001. “Variance Models Applicable to the NHSDA.” In Proceedings of the Survey Research Methods Section: American Statistical Association, August 5–9, 2001. Alexandria, VA: American Statistical Association. Available at: http://www.amstat.org/sections/SRMS/Proceedings/. (accessed October 12, 2015).

Cochran, W. 1977. Sampling Techniques, (3rd edition). New York: John Wiley & Sons.

Gabler, S., S. Haeder, and P. Lahiri. 1999. “A Model Based Justification of Kish’s Formula for Design Effects for Weighting and Clustering.” Survey Methodology 25: 105–106.

Hansen, M., W. Hurwitz, and M. Madow. 1953a. Sample Survey Methods and Theory, (Vol. I) New York: John Wiley & Sons.

Hansen, M., W. Hurwitz, and W. Madow. 1953b. Sample Survey Methods and Theory, (Vol. II) New York: John Wiley & Sons.

Hunter, S., K. Bowman, and J. Chromy. 2005. “Results of the Variance Component Analysis of Sample Allocation by Age in the National Survey on Drug Use and Health.” In Proceedings of the Survey Research Methods Section: American Statistical Association, August 7–11, 2005 (pp. 3132–3136). Alexandria, VA: American Statistical Association. Available at: http://www.amstat.org/sections/SRMS/Proceedings/. (accessed October 12, 2015).

Isaki, C. and W. Fuller. 1982. “Survey Design Under the Regression Superpopulation Model.” Journal of the American Statistical Association 77: 89–96. Doi: http://dx.doi.org/10.1080/01621459.1982.10477770.

Judkins, D. and W. van de Kerckhove. 2003. Residential Energy Consumption Survey 2005 Optimization. Washington, DC: Department of Energy.

Kish, L. 1965. Survey Sampling. New York: John Wiley.

Lynn, P. and S. Gabler. 2005. “Approximations to b * in the Prediction of Design Effects Due to Clustering.” Survey Methodology 31: 101–104.

Lohr, S. 2010. Sampling: Design and Analysis, (2nd edition). Boston, MA: Brooks/Cole CENGAGE Learning.

Park, I. and H. Lee. 2004. “Design Effects for the Weighted Mean and Total Estimators Under Complex Survey Sampling.” Survey Methodology 30: 183–193.

Särndal, C.-E., B. Swensson, and J. Wretman. 1992. Model Assisted Survey Sampling. New York: Springer.

Searle, S., G. Casella, and C. McCulloch. 1992. Variance Components. New York: John Wiley & Sons.

U.S. Census Bureau. 2011. 2010 Census Redistricting Data (Public Law 94–171) Summary File. Washington, DC: Department of Commerce. Available at: http://www.census.gov/prod/cen2010/doc/pl94-171.pdf. (accessed October 12, 2015).

Valliant, R., J.A. Dever, and F. Kreuter. 2013. Practical Tools for Designing and Weighting Survey Samples. New York: Springer.

Valliant, R., J.A. Dever, and F. Kreuter. 2015. PracTools: Tools for Designing and Weighting Survey Samples. R package version 0.3. Available at: http://CRAN.R-project.org/package=PracTools. (accessed November 25, 2015).

Waksberg, J., S. Sperry, D. Judkins, and V. Smith. 1993. “National Survey of Family Growth: Evaluation of Linked Design.” Vital and Health Statistics 117: July 1993. 20pp. (PHS) 93-1391. PB94-103462. PC A04 MF A01. Available at: http://www.cdc.gov/nchs/products/series/series02.html (accessed October 12, 2015).

Wolter, K.M. 2007. Introduction to Variance Estimation. New York: Springer.

Journal of Official Statistics

The Journal of Statistics Sweden

Journal Information


IMPACT FACTOR 2018: 0,837
5-year IMPACT FACTOR: 0,934

CiteScore 2018: 1.04

SCImago Journal Rank (SJR) 2018: 0.963
Source Normalized Impact per Paper (SNIP) 2018: 1.020

Metrics

All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 343 254 33
PDF Downloads 147 114 11