BNFinder2: Faster Bayesian network learning and Bayesian classification

Summary: Bayesian Networks (BNs) are versatile probabilistic models applicable to many different biological phenomena. In biological applications the structure of the network is usually unknown and needs to be inferred from experimental data. BNFinder is a fast software implementation of an exact algorithm for finding the optimal structure of the network given a number of experimental observations. Its second version, presented in this article, represents a major improvement over the previous version. The improvements include (i) a parallelized learning algorithm leading to an order of magnitude speed-ups in BN structure learning time; (ii) inclusion of an additional scoring function based on mutual information criteria; (iii) possibility of choosing the resulting network specificity based on statistical criteria and (iv) a new module for classification by BNs, including cross-validation scheme and classifier quality measurements with receiver operator characteristic scores. Availability and implementation: BNFinder2 is implemented in python and freely available under the GNU general public license at the project Web site https://launchpad.net/bnfinder, together with a user’s manual, introductory tutorial and supplementary methods. Contact: dojer@mimuw.edu.pl or bartek@mimuw.edu.pl Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Michael A. Beer,et al.  Predicting Gene Expression from Sequence , 2004, Cell.

[2]  Xuan Vinh Nguyen,et al.  GlobalMIT: learning globally optimal dynamic bayesian network with the mutual information test criterion , 2011, Bioinform..

[3]  Norbert Dojer,et al.  Learning Bayesian Networks Does Not Have to Be NP-Hard , 2006, MFCS.

[4]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[5]  M. Gerstein,et al.  A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data , 2003, Science.

[6]  Guillaume J. Filion,et al.  Bayesian network analysis of targeting interactions in chromatin. , 2010, Genome research.

[7]  Bartek Wilczynski,et al.  BNFinder: exact and efficient method for learning Bayesian networks , 2008, Bioinform..

[8]  David Maxwell Chickering,et al.  Learning Bayesian Networks is NP-Complete , 2016, AISTATS.

[9]  Andrew J. Bulpitt,et al.  A Primer on Learning in Bayesian Networks for Computational Biology , 2007, PLoS Comput. Biol..

[10]  Henry Tirri,et al.  Classifier Learning with Supervised Marginal Likelihood , 2001, UAI.

[11]  Bartek Wilczynski,et al.  Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State , 2012, PLoS Comput. Biol..

[12]  Bozena Kaminska,et al.  Comparative analysis of cis-regulation following stroke and seizures in subspaces of conserved eigensystems , 2010, BMC Systems Biology.

[13]  Bartek Wilczynski,et al.  Applying dynamic Bayesian networks to perturbed gene expression data , 2006, BMC Bioinformatics.

[14]  E. Furlong,et al.  Tissue-specific analysis of chromatin state identifies temporal signatures of enhancer activity during embryonic development , 2012, Nature Genetics.