Reconstructing ecological networks with hierarchical Bayesian regression and Mondrian processes

Ecological systems consist of complex sets of interactions among species and their environment, the understanding of which has implications for predicting environmental response to perturbations such as invading species and climate change. However, the revelation of these interactions is not straightforward, nor are the interactions necessarily stable across space. Machine learning can enable the recovery of such complex, spatially varying interactions from relatively easily obtained species abundance data. Here, we describe a novel Bayesian regression and Mondrian process model (BRAMP) for reconstructing species interaction networks from observed field data. BRAMP enables robust inference of species interactions considering autocorrelation in species abundances and allowing for variation in the interactions across space. We evaluate the model on spatially explicit simulated data, produced using a trophic niche model combined with stochastic population dynamics. We compare the model’s performance against L1-penalized sparse regression (LASSO) and non-linear Bayesian networks with the BDe scoring scheme. Finally, we apply BRAMP to real ecological data.

[1]  J Memmott,et al.  Infiltration of a Hawaiian Community by Introduced Biological Control Agents , 2001, Science.

[2]  D. Haydon,et al.  Alternative stable states in ecology , 2003 .

[3]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[4]  Terry V. Callaghan,et al.  The balance between positive and negative plant interactions and its relationship to environmental gradients : a model , 1998 .

[5]  Alexander J. Hartemink,et al.  Principled computational methods for the validation discovery of genetic regulatory networks , 2001 .

[6]  W. Hagemeijer,et al.  The EBCC Atlas of European Breeding Birds , 1997 .

[7]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[8]  Colin M. Beale,et al.  Are richness patterns of common and rare species equally well explained by environmental variables , 2011 .

[9]  Mark A. Girolami,et al.  Bayesian ranking of biochemical system models , 2008, Bioinform..

[10]  H. Charles J. Godfray,et al.  A positive trait-mediated indirect effect involving the natural enemies of competing herbivores , 2009, Oecologia.

[11]  Quentin Paynter,et al.  The invertebrate fauna on broom, Cytisus scoparius, in two native and two exotic habitats , 2000 .

[12]  J. Dahlgren,et al.  Alternative regression methods are not considered in Murtaugh (2009) or by ecologists in general. , 2010, Ecology letters.

[13]  Marcel J. T. Reinders,et al.  Least absolute regression network analysis of the murine osteoblast differentiation network , 2006, Bioinform..

[14]  Kathryn B. Laskey,et al.  Nonparametric Bayesian Co-clustering Ensembles , 2011, SDM.

[15]  Mark A. Girolami,et al.  Bayesian ranking of biochemical system models , 2008, Bioinform..

[16]  J. Bruno,et al.  Inclusion of facilitation into ecological theory , 2003 .

[17]  S. Carpenter,et al.  Global Consequences of Land Use , 2005, Science.

[18]  S. Peacor,et al.  A REVIEW OF TRAIT-MEDIATED INDIRECT INTERACTIONS IN ECOLOGICAL COMMUNITIES , 2003 .

[19]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[20]  R. Lande,et al.  Stochastic Population Dynamics in Ecology and Conservation , 2003 .

[21]  Christophe Andrieu,et al.  Bayesian curve fitting using MCMC with applications to signal segmentation , 2002, IEEE Trans. Signal Process..

[22]  Yves Grandvalet Least Absolute Shrinkage is Equivalent to Quadratic Penalization , 1998 .

[23]  Colin M Beale,et al.  Revealing ecological networks using Bayesian network inference algorithms. , 2010, Ecology.

[24]  Marco Grzegorczyk,et al.  Bayesian regularization of non-homogeneous dynamic Bayesian networks by globally coupling interaction parameters , 2012, AISTATS.

[25]  Antti Honkela,et al.  Model-based method for transcription factor target identification with limited data , 2010, Proceedings of the National Academy of Sciences.

[26]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[27]  Alexander J. Hartemink,et al.  Learning Non-Stationary Dynamic Bayesian Networks , 2010, J. Mach. Learn. Res..

[28]  Neo D. Martinez,et al.  Simple rules yield complex food webs , 2000, Nature.

[29]  Paul P. Wang,et al.  Advances to Bayesian network inference for generating causal networks from observational biological data , 2004, Bioinform..

[30]  Dirk Husmeier,et al.  Inferring species interaction networks from species abundance data: A comparative evaluation of various statistical and machine learning methods , 2010, Ecol. Informatics.

[31]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[32]  Jing Yu,et al.  Computational Inference of Neural Information Flow Networks , 2006, PLoS Comput. Biol..

[33]  Michael P. H. Stumpf,et al.  Statistical inference of the time-varying structure of gene-regulation networks , 2010, BMC Systems Biology.

[34]  Jack J. Lennon,et al.  Red-shifts and red herrings in geographical ecology , 2000 .

[35]  Christophe Andrieu,et al.  Joint Bayesian model selection and estimation of noisy sinusoids via reversible jump MCMC , 1999, IEEE Trans. Signal Process..

[36]  Neo D. Martinez,et al.  Network structure and biodiversity loss in food webs: robustness increases with connectance , 2002, Ecology Letters.

[37]  Yee Whye Teh,et al.  The Mondrian Process , 2008, NIPS.