Assessing respondent-driven sampling

Respondent-driven sampling (RDS) is a network-based technique for estimating traits in hard-to-reach populations, for example, the prevalence of HIV among drug injectors. In recent years RDS has been used in more than 120 studies in more than 20 countries and by leading public health organizations, including the Centers for Disease Control and Prevention in the United States. Despite the widespread use and growing popularity of RDS, there has been little empirical validation of the methodology. Here we investigate the performance of RDS by simulating sampling from 85 known, network populations. Across a variety of traits we find that RDS is substantially less accurate than generally acknowledged and that reported RDS confidence intervals are misleadingly narrow. Moreover, because we model a best-case scenario in which the theoretical RDS sampling assumptions hold exactly, it is unlikely that RDS performs any better in practice than in our simulations. Notably, the poor performance of RDS is driven not by the bias but by the high variance of estimates, a possibility that had been largely overlooked in the RDS literature. Given the consistency of our results across networks and our generous sampling conditions, we conclude that RDS as currently practiced may not be suitable for key aspects of public health surveillance where it is now extensively applied.

[1]  A. Lansky,et al.  Developing an HIV Behavioral Surveillance System for Injecting Drug Users: The National HIV Behavioral Surveillance System , 2007, Public health reports.

[2]  D. Heckathorn 6. Extensions of Respondent-Driven Sampling: Analyzing Continuous Variables and Controlling for Differential Recruitment , 2007 .

[3]  Rebeca Ramos,et al.  Respondent-Driven Sampling of Injection Drug Users in Two U.S.–Mexico Border Cities: Recruitment Dynamics and Impact on Estimates of HIV and Syphilis Prevalence , 2006, Journal of Urban Health.

[4]  L. Johnston,et al.  Efficacy of convenience sampling through the internet versus respondent driven sampling among males who have sex with males in Tallinn and Harju County, Estonia: challenges reaching a hidden population , 2009, AIDS care.

[5]  J. Potterat,et al.  Social networks and infectious disease: the Colorado Springs Study. , 1994, Social science & medicine.

[6]  Mohsen Malekinejad,et al.  Implementation Challenges to Using Respondent-Driven Sampling Methodology for HIV Biological and Behavioral Surveillance: Field Experiences in International Settings , 2008, AIDS and Behavior.

[7]  Douglas D. Heckathorn,et al.  Respondent-driven sampling II: deriving valid population estimates from chain-referral samples of hi , 2002 .

[8]  Douglas D. Heckathorn,et al.  From Networks to Populations: The Development and Application of Respondent-Driven Sampling Among IDUs and Latino Gay Men , 2005, AIDS and Behavior.

[9]  S. Goodreau,et al.  Birds of a feather, or friend of a friend? using exponential random graph models to investigate adolescent social networks* , 2009, Demography.

[10]  Lisa G. Johnston,et al.  An Empirical Comparison of Respondent-driven Sampling, Time Location Sampling, and Snowball Sampling for Behavioral Surveillance in Men Who Have Sex with Men, Fortaleza, Brazil , 2008, AIDS and Behavior.

[11]  Douglas D. Heckathorn,et al.  Respondent-driven sampling : A new approach to the study of hidden populations , 1997 .

[12]  S. Vermund,et al.  Network-related mechanisms may help explain long-term HIV-1 seroprevalence levels that remain high but do not approach population-group saturation. , 2000, American journal of epidemiology.

[13]  Douglas D. Heckathorn,et al.  Parsing Social Network Survey Data from Hidden Populations Using Stochastic Context-Free Grammars , 2009, PloS one.

[14]  Matthew J. Salganik Variance Estimation, Design Effects, and Sample Size Calculations for Respondent-Driven Sampling , 2006, Journal of Urban Health.

[15]  Social Networks, Drug Abuse, and HIV Transmission. Proceedings of a meeting. August 19-20, 1993. , 1995, NIDA research monograph.

[16]  C. McCarty,et al.  Comparing Two Methods for Estimating Network Size , 2001 .

[17]  Masud Reza,et al.  The Effectiveness of Respondent Driven Sampling for Recruiting Males Who have Sex with Males in Dhaka, Bangladesh , 2008, AIDS and Behavior.

[18]  Tanja Popovic,et al.  HIV-associated behaviors among injecting-drug users--23 Cities, United States, May 2005-February 2006. , 2009, MMWR. Morbidity and mortality weekly report.

[19]  Stephanie Tortu,et al.  Recruiting Injection Drug Users: A Three-Site Comparison of Results and Experiences with Respondent-Driven and Targeted Sampling Procedures , 2006, Journal of Urban Health.

[20]  Greg Scott,et al.  " They Got Their Program, and I Got Mine " : a Cautionary Tale concerning the Ethical Implications of Using Respondent-driven Sampling to Study Injection Drug Users , 2007 .

[21]  Lisa G. Johnston,et al.  Methods to Recruit Hard-to-Reach Groups: Comparing Two Chain Referral Sampling Methods of Recruiting Injecting Drug Users Across Nine Studies in Russia and Estonia , 2006, Journal of Urban Health.

[22]  Cyprian Wejnert,et al.  Web-Based Network Sampling , 2008 .

[23]  Matthew J. Salganik,et al.  How Many People Do You Know?: Efficiently Estimating Personal Network Size , 2010, Journal of the American Statistical Association.

[24]  Mark S Handcock,et al.  7. Respondent-Driven Sampling: An Assessment of Current Methodology , 2009, Sociological methodology.

[25]  Patrick S Sullivan,et al.  Behavioral Surveillance among People at Risk for HIV Infection in the U.S.: The National HIV Behavioral Surveillance System , 2007, Public health reports.

[26]  Douglas D. Heckathorn,et al.  Effectiveness of Respondent-Driven Sampling for Recruiting Drug Users in New York City: Findings from a Pilot Study , 2006, Journal of Urban Health.

[27]  Tian Zheng,et al.  How Many People Do You Know in Prison? , 2006 .

[28]  Matthew J. Salganik,et al.  Respondent‐driven sampling as Markov chain Monte Carlo , 2009, Statistics in medicine.

[29]  T. Vicsek,et al.  Community structure and ethnic preferences in school friendship networks , 2006, physics/0611268.

[30]  Cyprian Wejnert,et al.  3. An Empirical Test of Respondent-Driven Sampling: Point Estimates, Variance, Degree Measures, and Out-of-Equilibrium Data , 2009, Sociological methodology.

[31]  Mary Dawood,et al.  Sampling rare populations. , 2008, Nurse researcher.

[32]  Tobi Saidel,et al.  Review of sampling hard-to-reach and hidden populations for HIV surveillance. , 2005, AIDS.

[33]  Erik M Volz,et al.  Using Respondent-Driven Sampling in a Hidden Population at Risk of HIV Infection: Who Do HIV-Positive Recruiters Recruit? , 2009, Sexually transmitted diseases.

[34]  Erik M. Volz,et al.  Probability based estimation theory for respondent driven sampling , 2008 .

[35]  J. Moody Race, School Integration, and Friendship Segregation in America1 , 2001, American Journal of Sociology.

[36]  S Q Muth,et al.  Social networks in disease transmission: the Colorado Springs Study. , 1995, NIDA research monograph.

[37]  Mohsen Malekinejad,et al.  Using Respondent-Driven Sampling Methodology for HIV Biological and Behavioral Surveillance in International Settings: A Systematic Review , 2008, AIDS and Behavior.

[38]  Alden S. Klovdahl,et al.  Mapping a social network of heterosexuals at high risk for HIV infection , 1994, AIDS.

[39]  Juan Diaz,et al.  Assessment of risk factors for HIV infection among men who have sex with men in the metropolitan area of Campinas City, Brazil, using respondent-driven sampling , 2019 .

[40]  Holly Hagan,et al.  Evaluating respondent-driven sampling in a major metropolitan area: Comparing injection drug users in the 2005 Seattle area national HIV behavioral surveillance system survey with participants in the RAVEN and Kiwi studies. , 2010, Annals of epidemiology.

[41]  Min Xu,et al.  Trends in Prevalence of HIV, Syphilis, Hepatitis C, Hepatitis B, and Sexual Risk Behavior Among Men Who Have Sex With Men: Results of 3 Consecutive Respondent-Driven Sampling Surveys in Beijing, 2004 Through 2006 , 2007, Journal of acquired immune deficiency syndromes.

[42]  Linda M Collins,et al.  Adaptive sampling in research on risk-related behaviors. , 2002, Drug and alcohol dependence.

[43]  Kosuke Imai,et al.  Survey Sampling , 1998, Nov/Dec 2017.