Validation of the Symptom Pattern Method for Analyzing Verbal Autopsy Data

Background Cause of death data are a critical input to formulating good public health policy. In the absence of reliable vital registration data, information collected after death from household members, called verbal autopsy (VA), is commonly used to study causes of death. VA data are usually analyzed by physician-coded verbal autopsy (PCVA). PCVA is expensive and its comparability across regions is questionable. Nearly all validation studies of PCVA have allowed physicians access to information collected from the household members' recall of medical records or contact with health services, thus exaggerating accuracy of PCVA in communities where few deaths had any interaction with the health system. In this study we develop and validate a statistical strategy for analyzing VA data that overcomes the limitations of PCVA. Methods and Findings We propose and validate a method that combines the advantages of methods proposed by King and Lu, and Byass, which we term the symptom pattern (SP) method. The SP method uses two sources of VA data. First, it requires a dataset for which we know the true cause of death, but which need not be representative of the population of interest; this dataset might come from deaths that occur in a hospital. The SP method can then be applied to a second VA sample that is representative of the population of interest. From the hospital data we compute the properties of each symptom; that is, the probability of responding yes to each symptom, given the true cause of death. These symptom properties allow us first to estimate the population-level cause-specific mortality fractions (CSMFs), and to then use the CSMFs as an input in assigning a cause of death to each individual VA response. Finally, we use our individual cause-of-death assignments to refine our population-level CSMF estimates. The results from applying our method to data collected in China are promising. At the population level, SP estimates the CSMFs with 16% average relative error and 0.7% average absolute error, while PCVA results in 27% average relative error and 1.1% average absolute error. At the individual level, SP assigns the correct cause of death in 83% of the cases, while PCVA does so for 69% of the cases. We also compare the results of SP and PCVA when both methods have restricted access to the information from the medical record recall section of the VA instrument. At the population level, without medical record recall, the SP method estimates the CSMFs with 14% average relative error and 0.6% average absolute error, while PCVA results in 70% average relative error and 3.2% average absolute error. For individual estimates without medical record recall, SP assigns the correct cause of death in 78% of cases, while PCVA does so for 38% of cases. Conclusions Our results from the data collected in China suggest that the SP method outperforms PCVA, both at the population and especially at the individual level. Further study is needed on additional VA datasets in order to continue validation of the method, and to understand how the symptom properties vary as a function of culture, language, and other factors. Our results also suggest that PCVA relies heavily on household recall of medical records and related information, limiting its applicability in low-resource settings. SP does not require that additional information to adequately estimate causes of death.

[1]  D Chandramohan,et al.  Diagnostic accuracy of physician review, expert algorithms and data-derived algorithms in adult verbal autopsies. , 1999, International journal of epidemiology.

[2]  Alan D. Lopez,et al.  Validation of cause-of-death statistics in urban China. , 2007, International journal of epidemiology.

[3]  P A Singer,et al.  Grand Challenges in Global Health , 2003, Science.

[4]  D Chandramohan,et al.  Validity of data‐derived algorithms for ascertaining causes of adult death in two African sites using verbal autopsy , 2000, Tropical medicine & international health : TM & IH.

[5]  Alan D. Lopez,et al.  Validity of verbal autopsy procedures for determining cause of death in Tanzania , 2006, Tropical medicine & international health : TM & IH.

[6]  Ying Lu,et al.  Verbal Autopsy Methods with Multiple Causes of Death , 2008, 0808.0645.

[7]  S. Khoury,et al.  Mortality and causes of death in Jordan 1995-96: assessment by verbal autopsy. , 1999, Bulletin of the World Health Organization.

[8]  Alan D. Lopez,et al.  Global mortality, disability, and the contribution of risk factors: Global Burden of Disease Study , 1997, The Lancet.

[9]  B Zaba,et al.  The impact of HIV/AIDS on mortality and household mobility in rural Tanzania , 2001, AIDS.

[10]  Alan D. Lopez,et al.  Setting international standards for verbal autopsy. , 2007, Bulletin of the World Health Organization.

[11]  C. Donnelly,et al.  Creating and Validating an Algorithm to Measure AIDS Mortality in the Adult Population using Verbal Autopsy , 2006, PLoS medicine.

[12]  B. Reeves,et al.  A review of data-derived methods for assigning causes of death from verbal autopsy data. , 1997, International journal of epidemiology.

[13]  R. Black,et al.  Validation of postmortem interviews to ascertain selected causes of death in children. , 1990, International journal of epidemiology.

[14]  A Boulle,et al.  A case study of using artificial neural networks for classifying cause of death from verbal autopsy. , 2001, International journal of epidemiology.

[15]  Gonghuan Yang,et al.  Validation of verbal autopsy procedures for adult deaths in China. , 2006, International journal of epidemiology.

[16]  Peter Byass,et al.  Refining a probabilistic model for interpreting verbal autopsy data , 2006, Scandinavian journal of public health.

[17]  D Chandramohan,et al.  Effect of misclassification of causes of death in verbal autopsy: can it be adjusted? , 2001, International journal of epidemiology.

[18]  S. Ahmed,et al.  Maternal mortality in rural Bangladesh. , 1974, Studies in family planning.

[19]  R. Peto,et al.  Prospective Study of One Million Deaths in India: Rationale, Design, and Validation Results , 2005, PLoS medicine.

[20]  Robert E Black,et al.  Where and why are 10 million children dying every year? , 2003, The Lancet.

[21]  D. Ross,et al.  The effect of different sensitivity, specificity and cause-specific mortality fractions on the estimation of differences in cause-specific mortality rates in children from studies using verbal autopsies. , 1997, International journal of epidemiology.

[22]  Alan D. Lopez,et al.  Evaluating national cause-of-death statistics: principles and application to the case of China. , 2005, Bulletin of the World Health Organization.

[23]  Alan D. Lopez,et al.  Core Verbal Autopsy Procedures with Comparative Validation Results from Two Countries , 2006, PLoS medicine.

[24]  J. Katz,et al.  Evaluation of neonatal verbal autopsy using physician review versus algorithm-based cause-of-death assignment in rural Nepal. , 2005, Paediatric and perinatal epidemiology.

[25]  Daniel Chandramohan,et al.  Verbal autopsy: current practices and challenges. , 2006, Bulletin of the World Health Organization.

[26]  S. Tollman,et al.  Validation and application of verbal autopsies in a rural area of South Africa , 2000, Tropical medicine & international health : TM & IH.

[27]  R. J. Hayes,et al.  The validity of verbal autopsies for assessing the causes of institutional maternal death. , 1998, Studies in family planning.

[28]  Alan D. Lopez,et al.  Global and regional burden of disease and risk factors, 2001: systematic analysis of population health data , 2006, The Lancet.

[29]  C. Murray,et al.  Understanding the Coronary Heart Disease Versus Total Cardiovascular Mortality Paradox: A Method to Enhance the Comparability of Cardiovascular Death Statistics in the United States , 2006, Circulation.

[30]  Peter Byass,et al.  A probabilistic approach to interpreting verbal autopsies: methodology and preliminary validation in Vietnam , 2003, Scandinavian journal of public health. Supplement.

[31]  M. Anker,et al.  The effect of misclassification error on reported cause-specific mortality fractions from verbal autopsy. , 1997, International journal of epidemiology.