Methods for oversampling rare subpopulations in social surveys

Surveys are frequently required to produce estimates for subpopulations, sometimes for a single subpopulation and sometimes for several subpopulations in addition to the total population. When membership of a rare subpopulation (or domain) can be determined from the sampling frame, selecting the required domain sample size is relatively straightforward. In this case the main issue is the extent of oversampling to employ when survey estimates are required for several domains and for the total population. Sampling and oversampling rare domains whose members cannot be identified in advance present a major challenge. A variety of methods has been used in this situation. In addition to large-scale screening, these methods include disproportionate stratified sampling, two-phase sampling, the use of multiple frames, multiplicity sampling, location sampling, panel surveys, and the use of multi-purpose surveys. This paper illustrates the application of these methods in a range of social surveys.

[1]  Sharon L. Lohr,et al.  Inference from Dual Frame Surveys , 2000 .

[2]  Mario Callegaro,et al.  Computing Response Metrics for Online Panels , 2008 .

[3]  Colm O'Muircheartaigh,et al.  Design priorities and disciplinary perspectives: the case of the US National Children's Study , 2008 .

[4]  K. Fiscella,et al.  Use of geocoding and surname analysis to estimate race and ethnicity. , 2006, Health services research.

[5]  Clare K. Purvis,et al.  Using the American Community Survey: Benefits and Challenges , 2006 .

[6]  Sharon L. Lohr,et al.  Multiple-Frame Surveys , 2009 .

[7]  Seymour Sudman,et al.  NEW DEVELOPMENTS IN THE SAMPLING OF SPECIAL POPULATIONS , 1986 .

[8]  S Sudman,et al.  The Use of Network Sampling for Locating the Seriously III , 1988, Medical care.

[9]  Jean-Michel Durr The French new rolling census , 2005 .

[10]  P. Biernacki,et al.  TARGETED SAMPLING: OPTIONS FOR THE STUDY OF HIDDEN POPULATIONS , 1989 .

[11]  Robert D. Tortora,et al.  Multiplicity-based sampling for the mobile telephone population: coverage, nonresponse, and measurement issues , 2007 .

[12]  Graham Kalton,et al.  Sampling Considerations in Research on HIV Risk and Illness , 2002 .

[13]  Regional Offices National Health Interview Survey (NHIS) - Chicago Region - U.S. Census Bureau , 2009 .

[14]  Graves Ej,et al.  National Hospital Discharge Survey , 2004 .

[15]  G. Kalton,et al.  Small-area income and poverty estimates : priorities for 2000 and beyond , 2000 .

[16]  D. Heckathorn 6. Extensions of Respondent-Driven Sampling: Analyzing Continuous Variables and Controlling for Differential Recruitment , 2007 .

[17]  Leslie Kish Optima and Proxima in Linear Sample Designs , 1976 .

[18]  A. Scott,et al.  Using multiple frames in health surveys , 2009, Statistics in medicine.

[19]  Chris J. Skinner,et al.  On the Efficiency of Raking Ratio Estimation for Multiple Frame Surveys , 1991 .

[20]  C. Brooker,et al.  Survey methodology and the issue of response rate: The case example of a community mental health nursing census , 1997 .

[21]  J. Rao Small Area Estimation , 2003 .

[22]  Leslie Kish,et al.  Statistical design for research , 1988 .

[23]  F. Mecatti Center Sampling: a strategy for surveying difficult-to-sample populations , 2004 .

[24]  D. Blanc,et al.  Sampling and Weighting a Survey of Homeless Persons: A French Example , 2001 .

[25]  D. Hoaglin,et al.  Overview of the sampling design and statistical methods used in the National Immunization Survey. , 2001, American journal of preventive medicine.

[26]  G. Kalton,et al.  SURVEY DESIGN AND DATA COLLECTION ISSUES IN THE DISABILITY EVALUATION STUDY , 2002 .

[27]  Word Dl,et al.  Building a Spanish surname list for the 1990s--a new approach to an old problem. , 1996 .

[28]  Chris J. Skinner,et al.  Estimation in dual frame surveys with complex designs , 1996 .

[29]  R. Janssen,et al.  The Young Men's Survey: methods for estimating HIV seroprevalence and risk factors among young men who have sex with men. , 1996, Public health reports.

[30]  B. Schoenberg,et al.  Prevalence and Clinical Features of Epilepsy in a Biracial United States Population , 1986, Epilepsia.

[31]  Michael D. Bankier,et al.  Power Allocations: Determining Sample Sizes for Subnational Areas , 1988 .

[32]  Graham Kalton,et al.  Sampling Rare Populations , 1986 .

[33]  USE OF EXPERT RATINGS AS SAMPLING STRATA FOR A MORE COST-EFFECTIVE PROBABILITY SAMPLE OF A RARE POPULATION. , 2009, Public opinion quarterly.

[34]  H. Wiegand,et al.  Kish, L.: Survey Sampling. John Wiley & Sons, Inc., New York, London 1965, IX + 643 S., 31 Abb., 56 Tab., Preis 83 s. , 1968 .

[35]  Diane S. Lauderdale,et al.  Asian American ethnic identification by surname , 2000 .

[36]  Erik M. Volz,et al.  Probability based estimation theory for respondent driven sampling , 2008 .

[37]  Kosuke Imai,et al.  Survey Sampling , 1998, Nov/Dec 2017.

[38]  Philippe Flajolet,et al.  Adaptive Sampling , 1997 .

[39]  B. Erens,et al.  The health of minority ethnic groups '99 , 2001 .

[40]  Douglas D. Heckathorn,et al.  Respondent-driven sampling : A new approach to the study of hidden populations , 1997 .

[41]  Michael D. Bankier Estimators Based on Several Stratified Samples with Applications to Multiple Frame Surveys , 1986 .

[42]  K. Langa,et al.  The Aging, Demographics, and Memory Study: Study Design and Methods , 2005, Neuroepidemiology.

[43]  Seymour Sudman,et al.  On Finding and Interviewing the Needles in the Haystack: The Use of Multiplicity Sampling , 1982 .

[44]  Seymour Sudman On Sampling of Very Rare Human Populations , 1972 .

[45]  David J. McKenzie,et al.  Surveying migrant households: a comparison of census‐based, snowball and intercept point surveys , 2007 .

[46]  Monroe G. Sirken,et al.  Network Sampling Developments in Survey Research During the Past 40+ Years , 2005 .

[47]  H. Miller,et al.  Sample design and estimation procedures for a national health examination survey of children. , 1971, Vital and health statistics. Series 2, Data evaluation and methods research.

[48]  S Sudman,et al.  Sampling Rare and Elusive Populations , 1988, Science.

[49]  David G Steel,et al.  Sampling within households in household surveys , 2007 .

[50]  W. Deming An Essay on Screening, or on Two-Phase Sampling, Applied to Surveys of a Community , 1977 .

[51]  R. Folsom,et al.  NOTES ON A CCMIK)SITE SIZE MEASURE FOR SELF-WEIGHTING SAMPLES IN MULTIPLE DOMAINS , 2002 .

[52]  Leslie Kish,et al.  Cumulating/Combining population surveys , 1999 .

[53]  M. Elliott,et al.  Sample designs for measuring the health of small racial/ethnic subgroups , 2008, Statistics in medicine.

[54]  Graham Kalton,et al.  PRACTICAL METHODS FOR SAMPLING RARE AND MOBILE POPULATIONS , 2001 .

[55]  D. McCaffrey,et al.  Using the Census Bureau’s surname list to improve estimates of race/ethnicity and associated disparities , 2009, Health Services and Outcomes Research Methodology.

[56]  A. E. Bateman The Statistics of Canada , 1878 .

[57]  Adam Chu,et al.  Weights for Combining Surveys across Time or Space , 1999 .

[58]  David E. Kanouse,et al.  Drawing a probability sample of female street prostitutes in Los Angeles county , 1999 .

[59]  W. Kalsbeek Sampling minority groups in health surveys , 2003, Statistics in medicine.

[60]  J. Catania,et al.  The Effect of Venue Sampling on Estimates of HIV Prevalence and Sexual Risk Behaviors in Men Who Have Sex With Men , 2006, Sexually transmitted diseases.

[61]  Sharon L. Lohr,et al.  Estimation in Multiple-Frame Surveys , 2006 .

[62]  G. Kalton,et al.  Methods for Sampling Rare Populations in Telephone Surveys , 2007 .

[63]  Shawn A. Ross,et al.  Survey Methodology , 2005, The SAGE Encyclopedia of the Sociology of Religion.