3. An Empirical Test of Respondent-Driven Sampling: Point Estimates, Variance, Degree Measures, and Out-of-Equilibrium Data

This paper, which is the first large-scale application of respondent-driven sampling (RDS) to nonhidden populations, tests three factors related to RDS estimation against institutional data using two Web RDS samples of university undergraduates. First, two methods of calculating RDS point estimates are compared. RDS estimates calculated using both methods coincide closely, but variance estimation, especially for small groups, is problematic for both methods. In one method, the bootstrap algorithm used to generate confidence intervals is found to underestimate variance. In the other method, where analytical variance estimation is possible, confidence intervals tend to overestimate variance. Second, RDS estimates are found to be robust against varying measures of individual degree. Results suggest the standard degree measure currently employed in most RDS studies is among the best-performing degree measures. Finally, RDS is found to be robust against the inclusion of out-of-equilibrium data. The results show that valid point estimates can be generated with RDS analysis using real data, but that further research is needed to improve variance estimation techniques.

[1]  Béla Bollobás,et al.  Random Graphs , 1985 .

[2]  Douglas D. Heckathorn,et al.  Respondent-driven sampling : A new approach to the study of hidden populations , 1997 .

[3]  Mohsen Malekinejad,et al.  Using Respondent-Driven Sampling Methodology for HIV Biological and Behavioral Surveillance in International Settings: A Systematic Review , 2008, AIDS and Behavior.

[4]  S. Berg Snowball Sampling—I , 2006 .

[5]  Matthew E. Brashears,et al.  Social Isolation in America: Changes in Core Discussion Networks over Two Decades , 2006 .

[6]  Robert Heimer,et al.  Critical Issues and Further Questions About Respondent-Driven Sampling: Comment on Ramirez-Valles, et al. (2005) , 2005, AIDS and Behavior.

[7]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[8]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[9]  J. Coleman Relational Analysis: The Study of Social Organizations with Survey Methods , 1958 .

[10]  C. McCarty,et al.  Comparing Two Methods for Estimating Network Size , 2001 .

[11]  P. V. Marsden,et al.  Core Discussion Networks of Americans , 1987 .

[12]  D. Heckathorn,et al.  Extensions of Respondent-Driven Sampling: A New Approach to the Study of Injection Drug Users Aged 18–25 , 2002, AIDS and Behavior.

[13]  Cyprian Wejnert,et al.  Web-Based Network Sampling , 2008 .

[14]  Robert G Carlson,et al.  Respondent-driven sampling to recruit MDMA users: a methodological assessment. , 2005, Drug and alcohol dependence.

[15]  P. V. Marsden,et al.  NETWORK DATA AND MEASUREMENT , 1990 .

[16]  Cyprian Wejnert,et al.  RESPONDENT-DRIVEN SAMPLING FOR ONLINE RESEARCH , 2007 .

[17]  Douglas D. Heckathorn,et al.  Respondent-driven sampling II: deriving valid population estimates from chain-referral samples of hi , 2002 .

[18]  Muhammad Hanif,et al.  Sampling With Unequal Probabilities , 1982 .

[19]  Erik M. Volz,et al.  Probability based estimation theory for respondent driven sampling , 2008 .

[20]  Ali Haider,et al.  Partner naming and forgetting: Recall of network members , 2007, Soc. Networks.

[21]  A. Winsor Sampling techniques. , 2000, Nursing times.

[22]  Matthias Schonlau,et al.  Respondent-Driven Sampling , 2010 .

[23]  Matthew J. Salganik Variance Estimation, Design Effects, and Sample Size Calculations for Respondent-Driven Sampling , 2006, Journal of Urban Health.