OBJECTIVE
To efficiently estimate race/ethnicity using administrative records to facilitate health care organizations' efforts to address disparities when self-reported race/ethnicity data are unavailable.
DATA SOURCE
Surname, geocoded residential address, and self-reported race/ethnicity from 1,973,362 enrollees of a national health plan.
STUDY DESIGN
We compare the accuracy of a Bayesian approach to combining surname and geocoded information to estimate race/ethnicity to two other indirect methods: a non-Bayesian method that combines surname and geocoded information and geocoded information alone. We assess accuracy with respect to estimating (1) individual race/ethnicity and (2) overall racial/ethnic prevalence in a population.
PRINCIPAL FINDINGS
The Bayesian approach was 74 percent more efficient than geocoding alone in estimating individual race/ethnicity and 56 percent more efficient in estimating the prevalence of racial/ethnic groups, outperforming the non-Bayesian hybrid on both measures. The non-Bayesian hybrid was more efficient than geocoding alone in estimating individual race/ethnicity but less efficient with respect to prevalence (p<.05 for all differences).
CONCLUSIONS
The Bayesian Surname and Geocoding (BSG) method presented here efficiently integrates administrative data, substantially improving upon what is possible with a single source or from other hybrid methods; it offers a powerful tool that can help health care organizations address disparities until self-reported race/ethnicity data are available.
[1]
G. Brier.
VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY
,
1950
.
[2]
G. Brier,et al.
External correspondence: Decompositions of the mean probability score
,
1982
.
[3]
Yi Zeng,et al.
Causes and implications of the recent increase in the reported sex ratio at birth in China.
,
1993
.
[4]
Surname analysis for estimating local concentration of Hispanics and Asians
,
1994
.
[5]
Diane S. Lauderdale,et al.
Asian American ethnic identification by surname
,
2000
.
[6]
B. Smedley,et al.
Unequal Treatment: Con-fronting Racial and Ethnic Disparities in Health Care
,
2002
.
[7]
E. Perrin,et al.
Eliminating Health Disparities: Measurement and Data Needs
,
2004
.
[8]
K. Fiscella,et al.
Use of geocoding and surname analysis to estimate race and ethnicity.
,
2006,
Health services research.
[9]
Daniel F McCaffrey,et al.
Power of tests for a dichotomous independent variable measured with error.
,
2008,
Health services research.