A Bayesian approach for structure learning in oscillating regulatory networks

Motivation: Oscillations lie at the core of many biological processes, from the cell cycle, to circadian oscillations and developmental processes. Time-keeping mechanisms are essential to enable organisms to adapt to varying conditions in environmental cycles, from day/night to seasonal. Transcriptional regulatory networks are one of the mechanisms behind these biological oscillations. However, while identifying cyclically expressed genes from time series measurements is relatively easy, determining the structure of the interaction network underpinning the oscillation is a far more challenging problem. Results: Here, we explicitly leverage the oscillatory nature of the transcriptional signals and present a method for reconstructing network interactions tailored to this special but important class of genetic circuits. Our method is based on projecting the signal onto a set of oscillatory basis functions using a Discrete Fourier Transform. We build a Bayesian Hierarchical model within a frequency domain linear model in order to enforce sparsity and incorporate prior knowledge about the network structure. Experiments on real and simulated data show that the method can lead to substantial improvements over competing approaches if the oscillatory assumption is met, and remains competitive also in cases it is not. Availability: DSS, experiment scripts and data are available at http://homepages.inf.ed.ac.uk/gsanguin/DSS.zip. Contact: d.trejo-banos@sms.ed.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Ting Wang,et al.  An improved map of conserved regulatory sites for Saccharomyces cerevisiae , 2006, BMC Bioinformatics.

[2]  Dirk Husmeier,et al.  ODE parameter inference using adaptive gradient matching with Gaussian processes , 2013, AISTATS.

[3]  Michael Q. Zhang,et al.  SCPD: a promoter database of the yeast Saccharomyces cerevisiae , 1999, Bioinform..

[4]  James Sharpe,et al.  An atlas of gene regulatory networks reveals multiple three-gene mechanisms for interpreting morphogen gradients , 2010, Molecular systems biology.

[5]  Bryan J Venters,et al.  A comprehensive genomic binding map of gene and chromatin regulatory proteins in Saccharomyces. , 2011, Molecular cell.

[6]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[7]  E. R. Morrissey,et al.  Inferring the time-invariant topology of a nonlinear sparse gene regulatory network using fully Bayesian spline autoregression. , 2011, Biostatistics.

[8]  Se-Ran Jun,et al.  Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions , 2009, Proceedings of the National Academy of Sciences.

[9]  D. Husmeier,et al.  Dynamic Bayesian networks in molecular plant science: inferring gene regulatory networks from multiple gene expression time series , 2012, Euphytica.

[10]  Yiming Yang,et al.  Inferring regulatory networks using a hierarchical Bayesian graphical Gaussian model , 2005 .

[11]  William Stafford Noble,et al.  The Forkhead transcription factor Hcm1 regulates chromosome segregation genes and fills the S-phase gap in the transcriptional circuitry of the cell cycle. , 2006, Genes & development.

[12]  Takeshi Mizuno,et al.  Data assimilation constrains new connections and components in a complex, eukaryotic circadian clock model , 2010, Molecular Systems Biology.

[13]  Katherine C. Chen,et al.  Integrative analysis of cell cycle control in budding yeast. , 2004, Molecular biology of the cell.

[14]  J. Skotheim,et al.  Cell Size Control in Yeast , 2012, Current Biology.

[15]  C. Wittenberg,et al.  Topology and Control of the Cell-Cycle-Regulated Transcriptional Circuitry , 2014, Genetics.

[16]  Neil Dalchau,et al.  Understanding biological timing using mechanistic and black-box models. , 2012, The New phytologist.

[17]  D. Stillman,et al.  Dancing the cell cycle two-step: regulation of yeast G1-cell-cycle genes by chromatin structure. , 2013, Trends in biochemical sciences.

[18]  Christophe Ambroise,et al.  Statistical Applications in Genetics and Molecular Biology Weighted-LASSO for Structured Network Inference from Time Course Data , 2011 .

[19]  Thomas R. Ioerger,et al.  A Hidden Markov Model for identifying essential and growth-defect regions in bacterial genomes from transposon insertion sequencing data , 2013, BMC Bioinformatics.

[20]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[21]  J. S. Rao,et al.  Spike and slab variable selection: Frequentist and Bayesian strategies , 2005, math/0505633.

[22]  J. Bähler Cell-cycle control of gene expression in budding and fission yeast. , 2005, Annual review of genetics.

[23]  P. Hardin,et al.  Circadian rhythms from multiple oscillators: lessons from diverse organisms , 2005, Nature Reviews Genetics.

[24]  Eric L. Weiss,et al.  Mitotic Exit and Separation of Mother and Daughter Cells , 2012, Genetics.

[25]  Paul E. Brown,et al.  Quantitative analysis of regulatory flexibility under changing environmental conditions , 2010, Molecular systems biology.

[26]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[27]  C J Oates,et al.  Network Inference and Biological Dynamics. , 2011, The annals of applied statistics.

[28]  Richard Bonneau,et al.  Robust data-driven incorporation of prior knowledge into the inference of dynamic regulatory networks , 2013, Bioinform..

[29]  P. Geurts,et al.  Inferring Regulatory Networks from Expression Data Using Tree-Based Methods , 2010, PloS one.

[30]  A. Shevchenko,et al.  Forkhead transcription factors, Fkh1p and Fkh2p, collaborate with Mcm1p to control transcription required for M-phase , 2000, Current Biology.

[31]  Jean-Philippe Vert,et al.  TIGRESS: Trustful Inference of Gene REgulation using Stability Selection , 2012, BMC Systems Biology.

[32]  A. Millar,et al.  The clock gene circuit in Arabidopsis includes a repressilator with additional feedback loops , 2012, Molecular systems biology.

[33]  I. Glad,et al.  Weighted Lasso with Data Integration , 2011, Statistical applications in genetics and molecular biology.

[34]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[35]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[36]  David Lydall,et al.  NDD1, a High-Dosage Suppressor ofcdc28-1N, Is Essential for Expression of a Subset of Late-S-Phase-Specific Genes in Saccharomyces cerevisiae , 1999, Molecular and Cellular Biology.

[37]  Michael Q. Zhang Inferring Gene Regulatory Networks , 2008 .

[38]  Simon Tavaré,et al.  Fkh1 and Fkh2 Bind Multiple Chromosomal Elements in the S. cerevisiae Genome with Distinct Specificities and Cell Cycle Dynamics , 2014, PloS one.

[39]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[40]  Ralf Zimmer,et al.  Inferring gene regulatory networks by ANOVA , 2012, Bioinform..

[41]  Richard Bonneau,et al.  The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo , 2006, Genome Biology.

[42]  Joshua E. S. Socolar,et al.  Global control of cell-cycle transcription by coupled CDK and network oscillators , 2008, Nature.

[43]  Dhundy Bastola,et al.  Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis , 2014, Briefings Bioinform..

[44]  Guido Sanguinetti,et al.  Combining tree-based and dynamical systems for the inference of gene regulatory networks , 2015, Bioinform..

[45]  C Robertson McClung,et al.  Provided for Non-commercial Research and Educational Use Only. Not for Reproduction, Distribution or Commercial Use. the Genetics of Plant Clocks , 2022 .

[46]  B. Mirkin Additive clustering and qualitative factor analysis methods for similarity matrices , 1989 .

[47]  Q. Ouyang,et al.  The yeast cell-cycle network is robustly designed. , 2003, Proceedings of the National Academy of Sciences of the United States of America.