5. Sampling and Estimation in Hidden Populations Using Respondent-Driven Sampling

Standard statistical methods often provide no way to make accurate estimates about the characteristics of hidden populations such as injection drug users, the homeless, and artists. In this paper, we further develop a sampling and estimation technique called respondent-driven sampling, which allows researchers to make asymptotically unbiased estimates about these hidden populations. The sample is selected with a snowball-type design that can be done more cheaply, quickly, and easily than other methods currently in use. Further, we can show that under certain specified (and quite general) conditions, our estimates for the percentage of the population with a specific trait are asymptotically unbiased. We further show that these estimates are asymptotically unbiased no matter how the seeds are selected. We conclude with a comparison of respondent-driven samples of jazz musicians in New York and San Francisco, with corresponding institutional samples of jazz musicians from these cities. The results show that some standard methods for studying hidden populations can produce misleading results.

[1]  Bonnie H. Erickson,et al.  Some Problems of Inference from Chain Data , 1979 .

[2]  Lillian S. Lin,et al.  A Venue-Based Method for Sampling Hard-to-Reach Populations , 2001, Public health reports.

[3]  H. Hogan The 1990 Post-Enumeration Survey: operations and results. , 1993, Journal of the American Statistical Association.

[4]  Philippe Flajolet,et al.  Adaptive Sampling , 1997 .

[5]  John G. Kemeny,et al.  Finite Markov chains , 1960 .

[6]  C. McCarty,et al.  Comparing Two Methods for Estimating Network Size , 2001 .

[7]  J. Watters,et al.  HIV seroprevalence among street-recruited injection drug and crack cocaine users in 16 US municipalities. , 1998, American journal of public health.

[8]  H. Russell Bernard,et al.  Estimation of Seroprevalence, Rape, and Homelessness in the United States Using a Social Network Approach , 1998, Evaluation review.

[9]  Graham Kalton,et al.  Introduction to Survey Sampling , 1983 .

[10]  Douglas D. Heckathorn,et al.  Respondent-driven sampling II: deriving valid population estimates from chain-referral samples of hi , 2002 .

[11]  Tom A. B. Snijders,et al.  The transition probabilities of the reciprocity model , 1999 .

[12]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo conver-gence diagnostics: a comparative review , 1996 .

[13]  Olle Häggström Finite Markov Chains and Algorithmic Applications , 2002 .

[14]  D. Watts,et al.  An Experimental Study of Search in Global Social Networks , 2003, Science.

[15]  Douglas D. Heckathorn,et al.  Respondent-driven sampling : A new approach to the study of hidden populations , 1997 .

[16]  Bruce A. Reed,et al.  A Critical Point for Random Graphs with a Given Degree Sequence , 1995, Random Struct. Algorithms.

[17]  S. Welch SAMPLING BY REFERRAL IN A DISPERSED POPULATION , 1975 .

[18]  William H. Press,et al.  Book-Review - Numerical Recipes in Pascal - the Art of Scientific Computing , 1989 .

[19]  Linda M Collins,et al.  Adaptive sampling in research on risk-related behaviors. , 2002, Drug and alcohol dependence.

[20]  P. Biernacki,et al.  TARGETED SAMPLING: OPTIONS FOR THE STUDY OF HIDDEN POPULATIONS , 1989 .

[21]  Joan Jeffri,et al.  Finding the beat: Using respondent-driven sampling to study jazz musicians☆ , 2001 .

[22]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[23]  W. C. Carter,et al.  Detecting measurement bias in respondent reports of personal networks , 2002, Soc. Networks.

[24]  S. Berg Snowball Sampling—I , 2006 .

[25]  S. Feld Why Your Friends Have More Friends Than You Do , 1991, American Journal of Sociology.

[26]  Jennifer Lauby,et al.  Street and network sampling in evaluation studies of HIV risk-reduction interventions. , 2002, AIDS reviews.

[27]  D. Watts Networks, Dynamics, and the Small‐World Phenomenon1 , 1999, American Journal of Sociology.

[28]  M. Spreen Rare Populations, Hidden Populations, and Link-Tracing Designs: What and Why? , 1992 .

[29]  L. Asz Random Walks on Graphs: a Survey , 2022 .

[30]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[31]  Henk F. L. Garretsen,et al.  Snowball Sampling Applied to Opiate Addicts Outside the Treatment System , 1997 .

[32]  M. H. Hansen,et al.  On the Theory of Sampling from Finite Populations , 1943 .

[33]  D. Heckathorn,et al.  Extensions of Respondent-Driven Sampling: A New Approach to the Study of Injection Drug Users Aged 18–25 , 2002, AIDS and Behavior.

[34]  Edward Liebow,et al.  A note on implementation of a random-walk design to study adolescent social networks , 1995 .

[35]  David E. Kanouse,et al.  Drawing a probability sample of female street prostitutes in Los Angeles county , 1999 .

[36]  Mark E. J. Newman,et al.  Ego-centered networks and the ripple effect , 2001, Soc. Networks.

[37]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[38]  M. McPherson,et al.  BIRDS OF A FEATHER: Homophily , 2001 .

[39]  Monroe G. Sirken,et al.  Household Surveys with Multiplicity , 1970 .

[40]  J. Hopcroft,et al.  Are randomly grown graphs really random? , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  S. Friedman,et al.  Promising social network research results and suggestions for a research agenda. , 1995, NIDA research monograph.

[42]  A. Winsor Sampling techniques. , 2000, Nursing times.

[43]  D. Heckathorn,et al.  A Methodology for Reducing Respondent Duplication and Impersonation in Samples of Hidden Populations , 2001 .

[44]  L. O'donnell,et al.  Time-space sampling in minority communities: results with young Latino men who have sex with men. , 2001, American journal of public health.

[45]  T. Snijders Enumeration and simulation methods for 0–1 matrices with given marginals , 1991 .

[46]  Melvin Small,et al.  Northern Passage: American Vietnam War Resisters in Canada , 2002 .

[47]  S Sudman,et al.  Sampling Rare and Elusive Populations , 1988, Science.

[48]  O. Frank A Survey of Statistical Methods for Graph Analysis , 1981 .

[49]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[50]  J. Coleman Relational Analysis: The Study of Social Organizations with Survey Methods , 1958 .

[51]  P. Biernacki,et al.  Snowball Sampling: Problems and Techniques of Chain Referral Sampling , 1981 .

[52]  O. Geoffrey Okogbaa,et al.  A review of: “Adaptive Sampling” S. Thompson and G. Seber Wiley, 1996 , 1997 .

[53]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[54]  H. Russell Bernard,et al.  A social network approach to estimating seroprevalence in the United States , 1998 .

[55]  Muhammad Hanif,et al.  Sampling With Unequal Probabilities , 1982 .

[56]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[57]  Tom A. B. Snijders,et al.  Estimation On the Basis of Snowball Samples: How To Weight? , 1992 .

[58]  Tom A. B. Snijders,et al.  Estimating the Size of the Homeless Population in Budapest, Hungary , 2002 .

[59]  Ove Frank,et al.  CHAPTER 16 – ESTIMATION OF POPULATION TOTALS BY USE OF SNOWBALL SAMPLES , 1979 .

[60]  László Lovász,et al.  Random Walks on Graphs: A Survey , 1993 .

[61]  Holly Hagan,et al.  Using a jail-based survey to monitor HIV and risk behaviors among seattle area injection drug users , 2001, Journal of Urban Health.

[62]  John Scott What is social network analysis , 2010 .