Learning mixed graphical models from data with p larger than n

Structure learning of Gaussian graphical models is an extensively studied problem in the classical multivariate setting where the sample size n is larger than the number of random variables p, as well as in the more challenging setting when p>>n. However, analogous approaches for learning the structure of graphical models with mixed discrete and continuous variables when p>>n remain largely unexplored. Here we describe a statistical learning procedure for this problem based on limited-order correlations and assess its performance with synthetic and real data.

[1]  Nicholas C. Wormald,et al.  Generating Random Regular Graphs Quickly , 1999, Combinatorics, Probability and Computing.

[2]  Robert Gentleman,et al.  Using GOstats to test gene lists for GO term association , 2007, Bioinform..

[3]  Victor Chubukov,et al.  Dynamics and Design Principles of a Basic Regulatory Architecture Controlling Metabolic Pathways , 2008, PLoS Biology.

[4]  Frank Harary,et al.  Graph Theory , 2016 .

[5]  Steffen L. Lauritzen,et al.  Graphical models in R , 1996 .

[6]  Robert Castelo,et al.  A Robust Procedure For Gaussian Graphical Model Search From Microarray Data With p Larger Than n , 2006, J. Mach. Learn. Res..

[7]  Rachel B. Brem,et al.  The landscape of genetic complexity across 5,700 gene expression traits in yeast. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Calyampudi R. Rao,et al.  Linear statistical inference and its applications , 1965 .

[9]  Michael A. West,et al.  Covariance decomposition in undirected Gaussian graphical models , 2005 .

[10]  Robert Castelo,et al.  Reverse Engineering Molecular Regulatory Networks from Microarray Data with qp-Graphs , 2009, J. Comput. Biol..

[11]  Calyampudi R. Rao,et al.  Linear Statistical Inference and Its Applications. , 1975 .

[12]  K. Broman,et al.  A Guide to QTL Mapping with R/qtl , 2009 .

[13]  N. Wermuth,et al.  Graphical Models for Associations between Variables, some of which are Qualitative and some Quantitative , 1989 .

[14]  Nir Friedman,et al.  Learning Module Networks , 2002, J. Mach. Learn. Res..

[15]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[16]  David Edwards,et al.  Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests , 2010, BMC Bioinformatics.

[17]  Robert Castelo,et al.  On Inclusion-Driven Learning of Bayesian Networks , 2003, J. Mach. Learn. Res..

[18]  David Maxwell Chickering,et al.  Optimal Structure Identification With Greedy Search , 2002, J. Mach. Learn. Res..

[19]  M. Rockman,et al.  Reverse engineering the genotype–phenotype map with natural genetic variation , 2008, Nature.

[20]  D. Edwards Introduction to graphical modelling , 1995 .

[21]  Jingyuan Fu,et al.  Genetical Genomics: Spotlight on QTL Hotspots , 2008, PLoS genetics.