A Sparse Structure Learning Algorithm for Gaussian Bayesian Network Identification from High-Dimensional Data

Structure learning of Bayesian Networks (BNs) is an important topic in machine learning. Driven by modern applications in genetics and brain sciences, accurate and efficient learning of large-scale BN structures from high-dimensional data becomes a challenging problem. To tackle this challenge, we propose a Sparse Bayesian Network (SBN) structure learning algorithm that employs a novel formulation involving one L1-norm penalty term to impose sparsity and another penalty term to ensure that the learned BN is a Directed Acyclic Graph (DAG)—a required property of BNs. Through both theoretical analysis and extensive experiments on 11 moderate and large benchmark networks with various sample sizes, we show that SBN leads to improved learning accuracy, scalability, and efficiency as compared with 10 existing popular BN learning algorithms. We apply SBN to a real-world application of brain connectivity modeling for Alzheimer's disease (AD) and reveal findings that could lead to advancements in AD research.

[1]  Joe Suzuki,et al.  A Construction of Bayesian Networks from Databases Based on an MDL Principle , 1993, UAI.

[2]  Luis M. de Campos,et al.  Independency relationships and learning algorithms for singly connected networks , 1998, J. Exp. Theor. Artif. Intell..

[3]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[4]  Keith A. Johnson,et al.  Disruption of Functional Connectivity in Clinically Normal Older Adults Harboring Amyloid Burden , 2009, The Journal of Neuroscience.

[5]  David Maxwell Chickering,et al.  Optimal Structure Identification With Greedy Search , 2002, J. Mach. Learn. Res..

[6]  Mark W. Schmidt,et al.  Learning Graphical Model Structure Using L1-Regularization Paths , 2007, AAAI.

[7]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[8]  Sebastian Thrun,et al.  Bayesian Network Induction via Local Neighborhoods , 1999, NIPS.

[9]  Constantin F. Aliferis,et al.  Generating Realistic Large Bayesian Networks by Tiling , 2006, FLAIRS.

[10]  H. Braak,et al.  Evolution of the neuropathology of Alzheimer's disease , 1996, Acta neurologica Scandinavica. Supplementum.

[11]  Ernesto Estrada,et al.  Communicability in complex networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Nir Friedman,et al.  Learning Bayesian Network Structure from Massive Datasets: The "Sparse Candidate" Algorithm , 1999, UAI.

[13]  Luis M. de Campos,et al.  A new approach for learning belief networks using independence criteria , 2000, Int. J. Approx. Reason..

[14]  Jing Li,et al.  Knowledge discovery from observational data for process control using causal Bayesian networks , 2007 .

[15]  A. Fleisher,et al.  Altered default mode network connectivity in alzheimer's disease—A resting functional MRI and bayesian network study , 2011, Human brain mapping.

[16]  M. Greicius,et al.  Default-mode network activity distinguishes Alzheimer's disease from healthy aging: Evidence from functional MRI , 2004, Proc. Natl. Acad. Sci. USA.

[17]  Justin L. Vincent,et al.  Disruption of Large-Scale Brain Systems in Advanced Aging , 2007, Neuron.

[18]  André Elisseeff,et al.  Using Markov Blankets for Causal Structure Learning , 2008, J. Mach. Learn. Res..

[19]  Xindong Wu,et al.  A Study of Causal Discovery With Weak Links and Small Samples , 1997, IJCAI.

[20]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[21]  E. Bullmore,et al.  Brain mechanisms of successful compensation during learning in Alzheimer disease , 2006, Neurology.

[22]  Daniel L. Rubin,et al.  Network Analysis of Intrinsic Functional Brain Connectivity in Alzheimer's Disease , 2008, PLoS Comput. Biol..

[23]  Bernhard Schölkopf,et al.  Nonlinear causal discovery with additive noise models , 2008, NIPS.

[24]  Kevin B. Korb,et al.  Bayesian Artificial Intelligence , 2004, Computer science and data analysis series.

[25]  Judea Pearl,et al.  Equivalence and Synthesis of Causal Models , 1990, UAI.

[26]  Pedro Larrañaga,et al.  Structure Learning of Bayesian Networks by Genetic Algorithms: A Performance Analysis of Control Parameters , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  O. Sporns,et al.  Organization, development and function of complex brain networks , 2004, Trends in Cognitive Sciences.

[28]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[29]  David Heckerman,et al.  Learning Bayesian Networks: Search Methods and Experimental Results , 1995 .

[30]  G. Alexander,et al.  Longitudinal PET Evaluation of Cerebral Metabolic Decline in Dementia: A Potential Outcome Measure in Alzheimer's Disease Treatment Studies. , 2002, The American journal of psychiatry.

[31]  Constantin F. Aliferis,et al.  HITON: A Novel Markov Blanket Algorithm for Optimal Variable Selection , 2003, AMIA.

[32]  Pei Wang,et al.  Partial Correlation Estimation by Joint Sparse Regression Models , 2008, Journal of the American Statistical Association.

[33]  Wray L. Buntine A Guide to the Literature on Learning Probabilistic Networks from Data , 1996, IEEE Trans. Knowl. Data Eng..

[34]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[35]  Christopher Meek,et al.  Causal inference and causal explanation with background knowledge , 1995, UAI.

[36]  Martin J. McKeown,et al.  Dynamic Bayesian network modeling of fMRI: A comparison of group-analysis methods , 2008, NeuroImage.

[37]  Pedro Larrañaga,et al.  Learning Bayesian network structures by searching for the best ordering with genetic algorithms , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[38]  E. Ziegel Permutation, Parametric, and Bootstrap Tests of Hypotheses (3rd ed.) , 2005 .

[39]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[40]  Y. Stern Cognitive Reserve and Alzheimer Disease , 2006, Alzheimer disease and associated disorders.

[41]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[42]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[43]  Karl J. Friston Functional and effective connectivity in neuroimaging: A synthesis , 1994 .

[44]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[45]  H. Braak,et al.  Staging of Alzheimer-related cortical destruction. , 1997, International psychogeriatrics.

[46]  W. Klunk,et al.  Imaging brain amyloid in Alzheimer's disease with Pittsburgh Compound‐B , 2004, Annals of neurology.

[47]  Constantin F. Aliferis,et al.  Towards Principled Feature Selection: Relevancy, Filters and Wrappers , 2003 .

[48]  Andrei S. Rodin,et al.  Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels) , 2005, Bioinform..

[49]  Wai Lam,et al.  LEARNING BAYESIAN BELIEF NETWORKS: AN APPROACH BASED ON THE MDL PRINCIPLE , 1994, Comput. Intell..

[50]  Robert Castelo,et al.  On Inclusion-Driven Learning of Bayesian Networks , 2003, J. Mach. Learn. Res..

[51]  Patricia A. Reuter-Lorenz,et al.  A split-brain model of Alzheimer's disease? Behavioral evidence for comparable intra and interhemispheric decline , 2005, Neuropsychologia.

[52]  B. Marcot,et al.  Using Bayesian belief networks to evaluate fish and wildlife population viability under land management alternatives from an environmental impact statement , 2001 .

[53]  Alan Bundy,et al.  Symbolic and Quantitative Approaches to Reasoning and Uncertainty , 1993 .

[54]  Kuncheng Li,et al.  Altered functional connectivity in early Alzheimer's disease: A resting‐state fMRI study , 2007, Human brain mapping.

[55]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[56]  Remco R. Bouckaert,et al.  Probalistic Network Construction Using the Minimum Description Length Principle , 1993, ECSQARU.

[57]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[58]  S. DeKosky,et al.  Post-mortem correlates of in vivo PiB-PET amyloid imaging in a typical case of Alzheimer's disease , 2008, Brain : a journal of neurology.

[59]  E. Bigio,et al.  Lateralization on Neuroimaging Does Not Differentiate Frontotemporal Lobar Degeneration from Alzheimer’s Disease , 2004, Dementia and Geriatric Cognitive Disorders.

[60]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[61]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[62]  Nir Friedman,et al.  Learning Bayesian Networks with Local Structure , 1996, UAI.

[63]  R. Tibshirani,et al.  PATHWISE COORDINATE OPTIMIZATION , 2007, 0708.1485.

[64]  P. Good Permutation, Parametric, and Bootstrap Tests of Hypotheses , 2005 .

[65]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[66]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[67]  Luis M. de Campos,et al.  Searching for Bayesian Network Structures in the Space of Restricted Acyclic Partially Directed Graphs , 2011, J. Artif. Intell. Res..

[68]  Xue-wen Chen,et al.  Improving Bayesian Network Structure Learning with Mutual Information-Based Node Ordering in the K2 Algorithm , 2008, IEEE Transactions on Knowledge and Data Engineering.

[69]  Juan Zhou,et al.  Learning effective brain connectivity with dynamic Bayesian networks , 2007, NeuroImage.

[70]  Mark E. Borsuk,et al.  A Bayesian network of eutrophication models for synthesis, prediction, and uncertainty analysis , 2004 .