Respondent Driven Sampling: Determinants of Recruitment and a Method to Improve Point Estimation

Introduction Respondent-driven sampling (RDS) is a variant of a link-tracing design intended for generating unbiased estimates of the composition of hidden populations that typically involves giving participants several coupons to recruit their peers into the study. RDS may generate biased estimates if coupons are distributed non-randomly or if potential recruits present for interview non-randomly. We explore if biases detected in an RDS study were due to either of these mechanisms, and propose and apply weights to reduce bias due to non-random presentation for interview. Methods Using data from the total population, and the population to whom recruiters offered their coupons, we explored how age and socioeconomic status were associated with being offered a coupon, and, if offered a coupon, with presenting for interview. Population proportions were estimated by weighting by the assumed inverse probabilities of being offered a coupon (as in existing RDS methods), and also of presentation for interview if offered a coupon by age and socioeconomic status group. Results Younger men were under-recruited primarily because they were less likely to be offered coupons. The under-recruitment of higher socioeconomic status men was due in part to them being less likely to present for interview. Consistent with these findings, weighting for non-random presentation for interview by age and socioeconomic status group greatly improved the estimate of the proportion of men in the lowest socioeconomic group, reducing the root-mean-squared error of RDS estimates of socioeconomic status by 38%, but had little effect on estimates for age. The weighting also improved estimates for tribe and religion (reducing root-mean-squared-errors by 19–29%), but had little effect for sexual activity or HIV status. Conclusions Data collected from recruiters on the characteristics of men to whom they offered coupons may be used to reduce bias in RDS studies. Further evaluation of this new method is required.

[1]  Krista Gile Improved Inference for Respondent-Driven Sampling Data With Application to HIV Prevalence Estimation , 2010, 1006.4837.

[2]  D. Heckathorn 6. Extensions of Respondent-Driven Sampling: Analyzing Continuous Variables and Controlling for Differential Recruitment , 2007 .

[3]  Richard G. White,et al.  Evaluation of the role of location and distance in recruitment in respondent-driven sampling , 2011, International journal of health geographics.

[4]  P. Kaye Infectious diseases of humans: Dynamics and control , 1993 .

[5]  Richard G. White,et al.  Community understanding of respondent-driven sampling in a medical research setting in Uganda: importance for the use of RDS for public health research , 2013, International journal of social research methodology.

[6]  Mark S Handcock,et al.  7. Respondent-Driven Sampling: An Assessment of Current Methodology , 2009, Sociological methodology.

[7]  Douglas D. Heckathorn,et al.  Respondent-driven sampling : A new approach to the study of hidden populations , 1997 .

[8]  Tobi Saidel,et al.  Review of sampling hard-to-reach and hidden populations for HIV surveillance. , 2005, AIDS.

[9]  Erik M. Volz,et al.  Probability based estimation theory for respondent driven sampling , 2008 .

[10]  Andrew J Copas,et al.  Evaluation of Respondent-driven Sampling , 2012, Epidemiology.

[11]  Douglas D. Heckathorn,et al.  Respondent-driven sampling II: deriving valid population estimates from chain-referral samples of hi , 2002 .

[12]  A. Kamali,et al.  HIV prevalence and incidence are no longer falling in southwest Uganda: evidence from a rural population cohort 1989–2005 , 2008, AIDS.