Birds of a feather: using a rotational box plot to assess ascertainment bias.

BACKGROUND Comparability of study participants with non-participants is customarily assessed by contrasting the distributions of sociodemographic characteristics. Such comparisons do not necessarily provide insight into whether or not participants of a given subgroup are similar to non-participants of the same subgroup. A geographical information system (GIS) may provide such insight by visually displaying the spatial distributions of participants and non-participants. In a previously reported study of heterosexuals at elevated risk for human immunodeficiency virus (HIV), traditional methods suggested distributional differences in the demographic characteristics of participants and non-participants. METHODS Based on residential address co-ordinates for each subgroup member, we used the subgroup's centroid as the origin and constructed a 360 degrees series of overlapping box plots of the distance of subgroups members to the origin, thereby producing closed polygons for each of the box plot demarcators. RESULTS These rotational box plots revealed similar geographical distributions for most participant and non-participant subgroups, with the exception of African-American men and women. CONCLUSIONS Observed differences resulted in part from the study design, and provided some insight into sampling problems encountered in social network studies. Based on Tobler's supposition that 'nearby things tend to be alike', the rotational box plot is a useful additional tool for investigating sample bias.

[1]  C Latkin,et al.  Using geographic information systems to assess spatial patterns of drug use, selection bias and attrition among a sample of injection drug users. , 1998, Drug and alcohol dependence.

[2]  G M Jacquez,et al.  The map comparison problem: tests for the overlap of geographic boundaries. , 1995, Statistics in medicine.

[3]  Richard Rothenberg,et al.  Choosing a centrality measure: Epidemiologic correlates in the Colorado Springs study of social networks☆ , 1995 .

[4]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[5]  Luc Anselin,et al.  Exploratory Spatial Data Analysis Linking SpaceStat and ArcView , 1997 .

[6]  Alden S. Klovdahl,et al.  Mapping a social network of heterosexuals at high risk for HIV infection , 1994, AIDS.

[7]  D. Williamson,et al.  The box plot: a simple visual method to interpret data. , 1989, Annals of internal medicine.

[8]  R. Rothenberg,et al.  Social network dynamics and HIV transmission , 1998, AIDS.

[9]  Luc Anselin,et al.  Testing for Spatial Error Autocorrelation in the Presence of Endogenous Regressors , 1997 .

[10]  Christine Pasquire,et al.  Using geographic information systems for site selection , 1999 .

[11]  Manfred M. Fischer,et al.  Recent Developments in Spatial Analysis , 1997 .

[12]  Luc Anselin,et al.  Rao's score test in spatial econometrics , 2001 .

[13]  G. Shaddick,et al.  Spatial statistical methods in environmental epidemiology: a critique , 1995, Statistical methods in medical research.

[14]  K. C. Clarke,et al.  On epidemiology and geographic information systems: a review and discussion of future directions. , 1996, Emerging infectious diseases.

[15]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .