Predicting microbial interactions from time series data with network information

The evolution of biotechnological knowledge poses some new challenges to study microbial interactions. Vector autoregressive (VAR) model was proved to be an efficient approach to infer dynamic interactions in biological systems. However, high-throughput metagenomics or 16S-rRNA sequencing data is high dimension, which means that the number of covariates is much larger than the number of observations. Reducing the dimension of data or selecting suitable covariates became a critical component VAR modelling. In this paper, we develop a graph-regularised vector autoregressive model incorporating network information to infer causal relationships among microbial entities. The method not only considers the signs of the network connections among any two covariates, but also constructs a network weighted matrix by microbial topology information. The coordinate descent algorithm for estimating model parameters improves the accuracy of prediction. The experimental results on a time series data set of human gut microbiomes indicate that the proposed approach has better performance than other VAR-based models with penalty functions.

[1]  K. Lange,et al.  Coordinate descent algorithms for lasso penalized regression , 2008, 0803.3876.

[2]  Kirill Peskov,et al.  Agent Based Modeling of Human Gut Microbiome Interactions and Perturbations , 2016, PloS one.

[3]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[4]  Steven C. Wheelwright,et al.  Forecasting methods and applications. , 1979 .

[5]  Yan Wang,et al.  Weighted fusion regularisation and predicting microbial interactions with vector autoregressive model , 2015, Int. J. Data Min. Bioinform..

[6]  Niranjan Nagarajan,et al.  Predicting microbial interactions through computational approaches. , 2016, Methods.

[7]  Jian Huang,et al.  The Sparse Laplacian Shrinkage Estimator for High-Dimensional Regression. , 2011, Annals of statistics.

[8]  J. Raes,et al.  Microbial interactions: from networks to models , 2012, Nature Reviews Microbiology.

[9]  N. Ravin,et al.  Metagenomic Analysis of the Dynamic Changes in the Gut Microbiome of the Participants of the MARS-500 Experiment, Simulating Long Term Space Flight , 2013, Acta naturae.

[10]  Jonathan Friedman,et al.  Inferring Correlation Networks from Genomic Survey Data , 2012, PLoS Comput. Biol..

[11]  Hongzhe Li,et al.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS. , 2010, The annals of applied statistics.

[12]  Curtis Huttenhower,et al.  Microbial Co-occurrence Relationships in the Human Microbiome , 2012, PLoS Comput. Biol..

[13]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[14]  Xingpeng Jiang,et al.  Predicting Microbial Interactions Using Vector Autoregressive Model with Graph Regularization , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[15]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[16]  W. Z. Lidicker,et al.  A Clarification of Interactions in Ecological Systems , 1979 .

[17]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[18]  Z. John Daye,et al.  Shrinkage and model selection with correlated variables via weighted fusion , 2009, Comput. Stat. Data Anal..

[19]  Rui Ma,et al.  Exploring the Dynamic Core Microbiome of Plaque Microbiota during Head-and-Neck Radiotherapy Using Pyrosequencing , 2013, PloS one.

[20]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[21]  D. Relman,et al.  Incomplete recovery and individualized responses of the human distal gut microbiota to repeated antibiotic perturbation , 2010, Proceedings of the National Academy of Sciences.

[22]  Beverley D. Causey,et al.  A Frequentist Analysis of a Class of Ridge Regression Estimators , 1980 .

[23]  Georg K Gerber,et al.  The dynamic microbiome , 2014, FEBS letters.

[24]  R. Tibshirani,et al.  PATHWISE COORDINATE OPTIMIZATION , 2007, 0708.1485.

[25]  Xingpeng Jiang,et al.  Microbiome Data Representation by Joint Nonnegative Matrix Factorization with Laplacian Regularization , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.