Robust metrics for assessing the performance of different verbal autopsy cause assignment methods in validation studies

BackgroundVerbal autopsy (VA) is an important method for obtaining cause of death information in settings without vital registration and medical certification of causes of death. An array of methods, including physician review and computer-automated methods, have been proposed and used. Choosing the best method for VA requires the appropriate metrics for assessing performance. Currently used metrics such as sensitivity, specificity, and cause-specific mortality fraction (CSMF) errors do not provide a robust basis for comparison.MethodsWe use simple simulations of populations with three causes of death to demonstrate that most metrics used in VA validation studies are extremely sensitive to the CSMF composition of the test dataset. Simulations also demonstrate that an inferior method can appear to have better performance than an alternative due strictly to the CSMF composition of the test set.ResultsVA methods need to be evaluated across a set of test datasets with widely varying CSMF compositions. We propose two metrics for assessing the performance of a proposed VA method. For assessing how well a method does at individual cause of death assignment, we recommend the average chance-corrected concordance across causes. This metric is insensitive to the CSMF composition of the test sets and corrects for the degree to which a method will get the cause correct due strictly to chance. For the evaluation of CSMF estimation, we propose CSMF accuracy. CSMF accuracy is defined as one minus the sum of all absolute CSMF errors across causes divided by the maximum total error. It is scaled from zero to one and can generalize a method's CSMF estimation capability regardless of the number of causes. Performance of a VA method for CSMF estimation by cause can be assessed by examining the relationship across test datasets between the estimated CSMF and the true CSMF.ConclusionsWith an increasing range of VA methods available, it will be critical to objectively assess their performance in assigning cause of death. Chance-corrected concordance and CSMF accuracy assessed across a large number of test datasets with widely varying CSMF composition provide a robust strategy for this assessment.

[1]  Rajendra Prasad,et al.  Population Health Metrics Research Consortium gold standard verbal autopsy validation study: design, implementation, and development of analysis datasets , 2011, Population health metrics.

[2]  P. Byass,et al.  Assessing a new approach to verbal autopsy interpretation in a rural Ethiopian community: the InterVA model. , 2006, Bulletin of the World Health Organization.

[3]  Alan D. Lopez,et al.  Performance of InterVA for assigning causes of death to verbal autopsies: multisite validation study using clinical diagnostic gold standards , 2011, Population health metrics.

[4]  C. Kyobutungi,et al.  Verbal autopsy interpretation: a comparative analysis of the InterVA model versus physician review in determining causes of death in the Nairobi DSS , 2010, Population health metrics.

[5]  Gonghuan Yang,et al.  Validation of the Symptom Pattern Method for Analyzing Verbal Autopsy Data , 2007, PLoS medicine.

[6]  Leo A. Goodman,et al.  Corrigenda: Measures of Association for Cross Classifications , 1957 .

[7]  C. Donnelly,et al.  Creating and Validating an Algorithm to Measure AIDS Mortality in the Adult Population using Verbal Autopsy , 2006, PLoS medicine.

[8]  A. Krishnan,et al.  Adult mortality surveillance by routine health workers using a short verbal autopsy tool in rural north India , 2011, Journal of Epidemiology & Community Health.

[9]  Alan D. Lopez,et al.  Cause-of-death ascertainment for deaths that occur outside hospitals in Thailand: application of verbal autopsy methods , 2010, Population health metrics.

[10]  Ming Jiang,et al.  Axiomatic characterization of nonlinear homomorphic means , 2005 .

[11]  B. Reeves,et al.  A review of data-derived methods for assigning causes of death from verbal autopsy data. , 1997, International journal of epidemiology.

[12]  L. J. Savage Elicitation of Personal Probabilities and Expectations , 1971 .

[13]  J. Katz,et al.  Evaluation of neonatal verbal autopsy using physician review versus algorithm-based cause-of-death assignment in rural Nepal. , 2005, Paediatric and perinatal epidemiology.

[14]  Alan D. Lopez,et al.  Verbal autopsy coding: are multiple coders better than one? , 2009, Bulletin of the World Health Organization.

[15]  Gonghuan Yang,et al.  Validation of verbal autopsy procedures for adult deaths in China. , 2006, International journal of epidemiology.

[16]  Ying Lu,et al.  Verbal Autopsy Methods with Multiple Causes of Death , 2008, 0808.0645.

[17]  L. A. Goodman,et al.  Measures of association for cross classifications , 1979 .

[18]  A Boulle,et al.  A case study of using artificial neural networks for classifying cause of death from verbal autopsy. , 2001, International journal of epidemiology.

[19]  Alan D. Lopez,et al.  Simplified Symptom Pattern Method for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards , 2011, Population health metrics.

[20]  R. Peto,et al.  Verbal autopsy of 80,000 adult deaths in Tamilnadu, South India , 2004, BMC public health.

[21]  D. Ross,et al.  The effect of different sensitivity, specificity and cause-specific mortality fractions on the estimation of differences in cause-specific mortality rates in children from studies using verbal autopsies. , 1997, International journal of epidemiology.

[22]  D Chandramohan,et al.  Effect of misclassification of causes of death in verbal autopsy: can it be adjusted? , 2001, International journal of epidemiology.

[23]  Dao Lan Huong,et al.  Applying verbal autopsy to determine cause of death in rural Vietnam , 2003, Scandinavian journal of public health. Supplement.

[24]  Hongen Liao,et al.  Medical Imaging and Augmented Reality , 2004 .

[25]  Karen A. F. Copeland An Introduction to Categorical Data Analysis , 1997 .

[26]  Alan D. Lopez,et al.  Validity of verbal autopsy procedures for determining cause of death in Tanzania , 2006, Tropical medicine & international health : TM & IH.

[27]  Reza Malekzadeh,et al.  Verbal Autopsy: Reliability and Validity Estimates for Causes of Death in the Golestan Cohort Study in Iran , 2010, PloS one.

[28]  T. Boerma,et al.  Verbal autopsy can consistently measure AIDS mortality: a validation study in Tanzania and Zimbabwe , 2009, Journal of Epidemiology & Community Health.

[29]  J. Thakur,et al.  Validity of verbal autopsy in determining causes of adult deaths. , 2006, Indian journal of public health.

[30]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[31]  D Chandramohan,et al.  Diagnostic accuracy of physician review, expert algorithms and data-derived algorithms in adult verbal autopsies. , 1999, International journal of epidemiology.

[32]  Sean T. Green,et al.  Random forests for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards , 2011, Population health metrics.

[33]  Daniel Chandramohan,et al.  Verbal autopsy: current practices and challenges. , 2006, Bulletin of the World Health Organization.

[34]  M. Fantahun The InterVA model: verbal autopsy interpretation in rural Ethiopia , 2006 .

[35]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[36]  D Chandramohan,et al.  Verbal autopsies for adult deaths: issues in their development and validation. , 1994, International journal of epidemiology.

[37]  Alan D. Lopez,et al.  Performance of physician-certified verbal autopsies: multisite validation study using clinical diagnostic gold standards , 2011, Population health metrics.

[38]  Peter Byass,et al.  A probabilistic approach to interpreting verbal autopsies: methodology and preliminary validation in Vietnam , 2003, Scandinavian journal of public health. Supplement.

[39]  Peter Byass,et al.  Moving from Data on Deaths to Public Health Policy in Agincourt, South Africa: Approaches to Analysing and Understanding Verbal Autopsy Findings , 2010, PLoS medicine.

[40]  A. Agresti An introduction to categorical data analysis , 1997 .

[41]  Peter Byass,et al.  Refining a probabilistic model for interpreting verbal autopsy data , 2006, Scandinavian journal of public health.

[42]  C. Murray,et al.  Direct estimation of cause-specific mortality fractions from verbal autopsies: multisite validation study using clinical diagnostic gold standards , 2011, Population health metrics.

[43]  Abraham D Flaxman,et al.  Performance of the Tariff Method: validation of a simple additive algorithm for analysis of verbal autopsies , 2011, Population health metrics.

[44]  K. Marsh,et al.  How useful are verbal autopsies to estimate childhood causes of death , 1992 .