Markov blanket-based approach for learning multi-dimensional Bayesian network classifiers: An application to predict the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson's Disease Questionnaire (PDQ-39)

Multi-dimensional Bayesian network classifiers (MBCs) are probabilistic graphical models recently proposed to deal with multi-dimensional classification problems, where each instance in the data set has to be assigned to more than one class variable. In this paper, we propose a Markov blanket-based approach for learning MBCs from data. Basically, it consists of determining the Markov blanket around each class variable using the HITON algorithm, then specifying the directionality over the MBC subgraphs. Our approach is applied to the prediction problem of the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson's Disease Questionnaire (PDQ-39) in order to estimate the health-related quality of life of Parkinson's patients. Fivefold cross-validation experiments were carried out on randomly generated synthetic data sets, Yeast data set, as well as on a real-world Parkinson's disease data set containing 488 patients. The experimental study, including comparison with additional Bayesian network-based approaches, back propagation for multi-label learning, multi-label k-nearest neighbor, multinomial logistic regression, ordinary least squares, and censored least absolute deviations, shows encouraging results in terms of predictive accuracy as well as the identification of dependence relationships among class and feature variables.

[1]  Anette Schrag,et al.  Health‐related quality‐of‐life scales in Parkinson's disease: Critique and recommendations , 2011, Movement disorders : official journal of the Movement Disorder Society.

[2]  R. Fitzpatrick,et al.  The development and validation of a short measure of functioning and well being for individuals with Parkinson's disease , 1995, Quality of Life Research.

[3]  Max Henrion,et al.  Propagating uncertainty in bayesian networks by probabilistic logic sampling , 1986, UAI.

[4]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[5]  Linda C. van der Gaag,et al.  Inference and Learning in Multi-dimensional Bayesian Network Classifiers , 2007, ECSQARU.

[6]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[7]  Wei-Pang Yang,et al.  A discretization algorithm based on Class-Attribute Contingency Coefficient , 2008, Inf. Sci..

[8]  Jason N Doctor,et al.  Probabilistic Mapping of Descriptive Health Status Responses Onto Health State Utilities Using Bayesian Networks: An Empirical Analysis Converting SF-12 Into EQ-5D Utility Index in a National US Sample , 2011, Medical care.

[9]  Houeto Jean-Luc [Parkinson's disease]. , 2022, La Revue du praticien.

[10]  Vahram Ghushchyan,et al.  Mapping the EQ-5D Index from the SF-12: US General Population Preferences in a Nationally Representative Sample , 2006, Medical decision making : an international journal of the Society for Medical Decision Making.

[11]  P. Martínez-Martín,et al.  Rasch analysis of the hospital anxiety and depression scale in Parkinson's disease , 2009, Movement disorders : official journal of the Movement Disorder Society.

[12]  ZhouZhi-Hua,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006 .

[13]  Constantin F. Aliferis,et al.  Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification Part II: Analysis and Extensions , 2010, J. Mach. Learn. Res..

[14]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[15]  Constantin F. Aliferis,et al.  Causal Explorer: A Causal Probabilistic Network Learning Toolkit for Biomedical Discovery , 2003, METMBS.

[16]  R. Fitzpatrick,et al.  PDQ-39: a review of the development, validation and application of a Parkinson’s disease quality of life questionnaire and its associated measures , 1998, Journal of Neurology.

[17]  R. Fitzpatrick,et al.  The Parkinson's Disease Questionnaire (PDQ-39): development and validation of a Parkinson's disease summary index score. , 1997, Age and ageing.

[18]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[19]  Aj Lees,et al.  Parkinson's disease (vol 373, pg 2055, 2009) , 2009 .

[20]  Ray Fitzpatrick,et al.  Cross-cultural evaluation of the Parkinson's Disease Questionnaire: tests of data quality, score reliability, response rate, and scaling assumptions in the United States, Canada, Japan, Italy, and Spain. , 2003, Journal of clinical epidemiology.

[21]  Constantin F. Aliferis,et al.  Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification Part I: Algorithms and Empirical Evaluation , 2010, J. Mach. Learn. Res..

[22]  A. Kasuya EuroQol--a new facility for the measurement of health-related quality of life. , 1990, Health policy.

[23]  Concha Bielza,et al.  Bayesian Chain Classifiers for Multidimensional Classification , 2011, IJCAI.

[24]  A. Williams EuroQol : a new facility for the measurement of health-related quality of life , 1990 .

[25]  R M Werner,et al.  The EQ-5D—a generic quality of life measure—is a useful instrument to measure quality of life in patients with Parkinson's disease , 2001, Journal of neurology, neurosurgery, and psychiatry.

[26]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[27]  Oliver Rivero-Arias,et al.  Estimating the Association between SF-12 Responses and EQ-5D Utility Values by Response Mapping , 2006, Medical decision making : an international journal of the Society for Medical Decision Making.

[28]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[29]  J. Powell,et al.  Least absolute deviations estimation for the censored regression model , 1984 .

[30]  Haomiao Jia,et al.  Mapping the SF-12 to the EuroQol EQ-5D Index in a National US Sample , 2004, Medical decision making : an international journal of the Society for Medical Decision Making.

[31]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[32]  Y. Cheung,et al.  Mapping the eight-item Parkinson’s Disease Questionnaire (PDQ-8) to the EQ-5D utility index , 2008, Quality of Life Research.

[33]  Concha Bielza,et al.  Multi-dimensional classification with Bayesian networks , 2011, Int. J. Approx. Reason..

[34]  P. Spirtes,et al.  Causation, Prediction, and Search, 2nd Edition , 2001 .

[35]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[36]  Carmen Rodriguez-Blazquez,et al.  Psychometric attributes of the Hospital Anxiety and Depression Scale in Parkinson's disease , 2009, Movement disorders : official journal of the Movement Disorder Society.

[37]  Judea Pearl,et al.  Equivalence and Synthesis of Causal Models , 1990, UAI.