Multiple Hypothesis Testing and Quasi Essential Graph for Comparing Two Sets of Bayesian Networks

In machine learning, graphical models like Bayesian networks are one of important visualization tools that can be learned from data to represent pictorially a complex system. In order to compare two complex systems (or one complex system functioning in two different contexts), one usually compares directly their representative graphs. However, with small sample size data, it is hard to learn the graph that represents precisely the system. That's why ensemble methods (e.g. Bootstrapping, evolutionary algorithm, etc...) are proposed to learn from data of each system a set of graphs that represents more precisely this system. Then, for comparing two systems, one needs a mechanism to compare two sets of graphs. We propose in this work an approach based on multiple hypothesis testing and quasi essential graph (QEG) to compare two sets of Bayesian networks.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  Philippe Leray,et al.  Summarizing and visualizing a set of bayesian networks with quasi essential graphs , 2011 .

[3]  Tong Wang,et al.  A heuristic method for learning Bayesian networks using discrete particle swarm optimization , 2010, Knowledge and Information Systems.

[4]  Hubert Cardot,et al.  Two evolutionary methods for learning Bayesian network structures , 2006, 2006 International Conference on Computational Intelligence and Security.

[5]  Sushmita Mitra,et al.  Applications of Fuzzy Sets Theory, 7th International Workshop on Fuzzy Logic and Applications, WILF 2007, Camogli, Italy, July 7-10, 2007, Proceedings , 2007, WILF.

[6]  Vincent Frouin,et al.  Learning Transcriptional Regulatory Networks with Evolutionary Algorithms Enhanced with Niching , 2007, WILF.

[7]  S. Dudoit,et al.  Multiple Hypothesis Testing in Microarray Experiments , 2003 .

[8]  Pedro Larrañaga,et al.  Structure Learning of Bayesian Networks by Genetic Algorithms: A Performance Analysis of Control Parameters , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[10]  Nir Friedman,et al.  Data Analysis with Bayesian Networks: A Bootstrap Approach , 1999, UAI.

[11]  D. Madigan,et al.  Bayesian model averaging and model selection for markov equivalence classes of acyclic digraphs , 1996 .

[12]  Andrei S. Rodin,et al.  Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels) , 2005, Bioinform..

[13]  H. Abdi The Bonferonni and Šidák Corrections for Multiple Comparisons , 2006 .

[14]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[15]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.