Objective Bayesian Search of Gaussian Directed Acyclic Graphical Models for Ordered Variables with Non‐Local Priors

Directed acyclic graphical (DAG) models are increasingly employed in the study of physical and biological systems to model direct influences between variables. Identifying the graph from data is a challenging endeavor, which can be more reasonably tackled if the variables are assumed to satisfy a given ordering; in this case we simply have to estimate the presence or absence of each potential edge. Working under this assumption, we propose an objective Bayesian method for searching the space of Gaussian DAG models, which provides a rich output from minimal input. We base our analysis on non-local parameter priors, which are especially suited for learning sparse graphs, because they allow a faster learning rate, relative to ordinary local parameter priors, when the true unknown sampling distribution belongs to a simple model. We implement an efficient stochastic search algorithm, which deals effectively with data sets having sample size smaller than the number of variables, and apply our method to a variety of simulated and real data sets. Our approach compares favorably, in terms of the ROC curve for edge hit rate versus false alarm rate, to current state-of-the-art frequentist methods relying on the assumption of ordered variables; under this assumption it exhibits a competitive advantage over the PC-algorithm, which can be considered as a frequentist benchmark for unordered variables. Importantly, we find that our method is still at an advantage for learning the skeleton of the DAG, when the ordering of the variables is only moderately mis-specified. Prospectively, our method could be coupled with a strategy to learn the order of the variables, thus dropping the known ordering assumption.

[1]  James G. Scott,et al.  Feature-Inclusion Stochastic Search for Gaussian Graphical Models , 2008 .

[2]  James G. Scott,et al.  Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem , 2010, 1011.2333.

[3]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[4]  N. Wermuth,et al.  Sequences of regressions and their independences , 2011, 1110.1986.

[5]  Guido Consonni,et al.  Moment priors for Bayesian model choice with applications to directed acyclic graphs , 2011 .

[6]  Alexandre d'Aspremont,et al.  Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data , 2022 .

[7]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[8]  J. Forster,et al.  Enhanced objective Bayesian testing for the equality of two proportions , 2010 .

[9]  Guido Consonni,et al.  Objective Bayes Factors for Gaussian Directed Acyclic Graphical Models , 2012 .

[10]  D. Edwards Introduction to graphical modelling , 1995 .

[11]  M. Drton,et al.  Multiple Testing and Error Control in Gaussian Graphical Model Selection , 2005, math/0508267.

[12]  W. Wong,et al.  Learning Causal Bayesian Network Structures From Experimental Data , 2008 .

[13]  Alexandre d'Aspremont,et al.  Model Selection Through Sparse Maximum Likelihood Estimation , 2007, ArXiv.

[14]  A. O'Hagan,et al.  Fractional Bayes factors for model comparison , 1995 .

[15]  David Heckerman,et al.  Parameter Priors for Directed Acyclic Graphical Models and the Characteriration of Several Probability Distributions , 1999, UAI.

[16]  James G. Scott,et al.  Objective Bayesian model selection in Gaussian graphical models , 2009 .

[17]  A. Dawid,et al.  Posterior Model Probabilities , 2011 .

[18]  M. Drton,et al.  Model selection for Gaussian concentration graphs , 2004 .

[19]  Elías Moreno,et al.  Bayes factors for intrinsic and fractional priors in nested models. Bayesian robustness , 1997 .

[20]  Peter Bühlmann,et al.  Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm , 2007, J. Mach. Learn. Res..

[21]  V. Johnson,et al.  Bayesian Model Selection in High-Dimensional Settings , 2012, Journal of the American Statistical Association.

[22]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[23]  Luis R. Pericchi,et al.  Model Selection and Hypothesis Testing based on Objective Probabilities and Bayes Factors , 2005 .

[24]  Kevin P. Murphy,et al.  Bayesian structure learning using dynamic programming and MCMC , 2007, UAI.

[25]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[26]  Ali Shojaie,et al.  Penalized likelihood methods for estimation of sparse high-dimensional directed acyclic graphs. , 2009, Biometrika.

[27]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[28]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[29]  Michael I. Jordan,et al.  Probabilistic Networks and Expert Systems , 1999 .

[30]  V. Johnson,et al.  On the use of non‐local prior densities in Bayesian hypothesis tests , 2010 .

[31]  Daphne Koller,et al.  Ordering-Based Search: A Simple and Effective Algorithm for Learning Bayesian Networks , 2005, UAI.

[32]  James O. Berger,et al.  Posterior model probabilities via path‐based pairwise priors , 2005 .

[33]  P. Green,et al.  Decomposable graphical Gaussian model determination , 1999 .