A hybrid method for learning multi-dimensional Bayesian network classifiers based on an optimization model

Bayesian networks, which have a solid mathematical basis as classifiers, take the prior information of samples into consideration. They have gained considerable popularity for solving classification problems. However, many real-world applications can be viewed as classification problems in which instances have to be assigned to a set of different classes at the same time. To address this problem, multi-dimensional Bayesian network classifiers (MBCs), which organize class and feature variables as three subgraphs, have recently been proposed. Because each subgraph has different structural restrictions, three different learning algorithms are needed. In this paper, we present for the first time an MBC learning algorithm based on an optimization model (MBC-OM) that is inspired by the constraint-based Bayesian network structure learning method. MBC-OM uses the chi-squared statistic and mutual information to estimate the dependence coefficients among variables, and these are used to construct an objective function as an overall measure of the dependence for a classifier structure. Therefore, the problem of searching for an optimal classifier becomes one of finding the maximum value of the objective function in feasible fields. We prove the existence and uniqueness of the numerical solution. Moreover, we validate our method on five benchmark data sets. Experimental results are competitive, and outperform state-of-the-art algorithms for multi-dimensional classification.

[1]  Grey Giddins,et al.  Statistics , 2016, The Journal of hand surgery, European volume.

[2]  John Z. Zhang,et al.  Enhancing multi-label music genre classification through ensemble techniques , 2011, SIGIR.

[3]  Lun-Ping Hung,et al.  A data driven ensemble classifier for credit scoring analysis , 2010, Expert Syst. Appl..

[4]  S. Appavu alias Balamurugan,et al.  NB+: An improved Naïve Bayesian algorithm , 2011, Knowl. Based Syst..

[5]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[6]  Gary Chartrand,et al.  Introduction to Graph Theory , 2004 .

[7]  Liangxiao Jiang,et al.  A Novel Bayes Model: Hidden Naive Bayes , 2009, IEEE Transactions on Knowledge and Data Engineering.

[8]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[9]  Solomon Kullback,et al.  Information Theory and Statistics , 1970, The Mathematical Gazette.

[10]  Christian Borgelt,et al.  A conditional independence algorithm for learning undirected graphical models , 2010, J. Comput. Syst. Sci..

[11]  Liangxiao Jiang,et al.  Improving Tree augmented Naive Bayes for class probability estimation , 2012, Knowl. Based Syst..

[12]  Luis Enrique Sucar,et al.  A Two-Step Method to Learn Multidimensional Bayesian Network Classifiers Based on Mutual Information Measures , 2011, FLAIRS.

[13]  Solomon Kullback,et al.  Information Theory and Statistics , 1960 .

[14]  Linda C. van der Gaag,et al.  Inference and Learning in Multi-dimensional Bayesian Network Classifiers , 2007, ECSQARU.

[15]  Jiawei Han,et al.  SRDA: An Efficient Algorithm for Large-Scale Discriminant Analysis , 2008, IEEE Transactions on Knowledge and Data Engineering.

[16]  Rudolf Kruse,et al.  Editor's foreword , 2010, J. Comput. Syst. Sci..

[17]  Concha Bielza,et al.  Multi-dimensional classification with Bayesian networks , 2011, Int. J. Approx. Reason..

[18]  Haytham Elghazel,et al.  A hybrid algorithm for Bayesian network structure learning with application to multi-label learning , 2014, Expert Syst. Appl..

[19]  Neelam Sharma,et al.  INTRUSION DETECTION USING NAIVE BAYES CLASSIFIER WITH FEATURE REDUCTION , 2012 .

[20]  Concha Bielza,et al.  Multi-label classification with Bayesian network-based chain classifiers , 2014, Pattern Recognit. Lett..

[21]  José Antonio Lozano Alonso,et al.  Learning Bayesian network classifiers for multidimensional supervised classification problems by means of a multiobjective approach , 2010 .

[22]  Concha Bielza,et al.  Markov blanket-based approach for learning multi-dimensional Bayesian network classifiers: An application to predict the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson's Disease Questionnaire (PDQ-39) , 2012, J. Biomed. Informatics.

[23]  David Maxwell Chickering,et al.  On the incompatibility of faithfulness and monotone DAG faithfulness , 2006, Artif. Intell..

[24]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[25]  Xue-wen Chen,et al.  Improving Bayesian Network Structure Learning with Mutual Information-Based Node Ordering in the K2 Algorithm , 2008, IEEE Transactions on Knowledge and Data Engineering.

[26]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[27]  Richard E. Neapolitan,et al.  Learning Bayesian networks , 2007, KDD '07.

[28]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[29]  José Antonio Lozano,et al.  Multi-Objective Learning of Multi-Dimensional Bayesian Classifiers , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[30]  Claire Cardie,et al.  39. Opinion mining and sentiment analysis , 2014 .

[31]  Yiming Yang,et al.  Multilabel classification with meta-level features , 2010, SIGIR.

[32]  Concha Bielza,et al.  Bayesian network modeling of the consensus between experts: An application to neuron classification , 2014 .

[33]  Luis M. de Campos,et al.  A Scoring Function for Learning Bayesian Networks based on Mutual Information and Conditional Independence Tests , 2006, J. Mach. Learn. Res..

[34]  Linda C. van der Gaag,et al.  Multi-dimensional Bayesian Network Classifiers , 2006, Probabilistic Graphical Models.

[35]  Bernard Yannou,et al.  Identifying product failure rate based on a conditional Bayesian network classifier , 2011, Expert Syst. Appl..

[36]  José Antonio Lozano,et al.  Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Wensheng Zhang,et al.  Improved heuristic equivalent search algorithm based on Maximal Information Coefficient for Bayesian Network Structure Learning , 2013, Neurocomputing.

[38]  Michael G. Madden,et al.  On the classification performance of TAN and general Bayesian networks , 2008, Knowl. Based Syst..

[39]  Iñaki Inza,et al.  Approaching Sentiment Analysis by using semi-supervised learning of multi-dimensional classifiers , 2012, Neurocomputing.

[40]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..