Robust non-linear differential equation models of gene expression evolution across Drosophila development

BackgroundThis paper lies in the context of modeling the evolution of gene expression away from stationary states, for example in systems subject to external perturbations or during the development of an organism. We base our analysis on experimental data and proceed in a top-down approach, where we start from data on a system's transcriptome, and deduce rules and models from it without a priori knowledge. We focus here on a publicly available DNA microarray time series, representing the transcriptome of Drosophila across evolution from the embryonic to the adult stage.ResultsIn the first step, genes were clustered on the basis of similarity of their expression profiles, measured by a translation-invariant and scale-invariant distance that proved appropriate for detecting transitions between development stages. Average profiles representing each cluster were computed and their time evolution was analyzed using coupled differential equations. A linear and several non-linear model structures involving a transcription and a degradation term were tested. The parameters were identified in three steps: determination of the strongest connections between genes, optimization of the parameters defining these connections, and elimination of the unnecessary parameters using various reduction schemes. Different solutions were compared on the basis of their abilities to reproduce the data, to keep realistic gene expression levels when extrapolated in time, to show the biologically expected robustness with respect to parameter variations, and to contain as few parameters as possible.ConclusionsWe showed that the linear model did very well in reproducing the data with few parameters, but was not sufficiently robust and yielded unrealistic values upon extrapolation in time. In contrast, the non-linear models all reached the latter two objectives, but some were unable to reproduce the data. A family of non-linear models, constructed from the exponential of linear combinations of expression levels, reached all the objectives. It defined networks with a mean number of connections equal to two, when restricted to the embryonic time series, and equal to five for the full time series. These networks were compared with experimental data about gene-transcription factor and protein-protein interactions. The non-uniqueness of the solutions was discussed in the context of plasticity and cluster versus single-gene networks.

[1]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[2]  Xiwei Wu,et al.  From microarray to biological networks: Analysis of gene expression profiles. , 2006, Methods in molecular biology.

[3]  Stephen Guest,et al.  DroID 2011: a comprehensive, integrated resource for protein, transcription factor, RNA and gene interactions for Drosophila , 2010, Nucleic Acids Res..

[4]  Edward R. Dougherty,et al.  Inferring gene regulatory networks from time series data using the minimum description length principle , 2006, Bioinform..

[5]  Jaroslav Albert,et al.  Robustness Analysis of a Linear Dynamical Model of the Drosophila Gene Expression , 2010, CIBB.

[6]  Dong Xu,et al.  Projecting Gene Expression Trajectories through Inducing Differential Equations from Microarray Time Series Experiments , 2008, J. Signal Process. Syst..

[7]  T. Gregory Dewey,et al.  From Microarray to Biological Networks , 2006 .

[8]  Wenxuan Zhong,et al.  A data-driven clustering method for time course gene expression data , 2006, Nucleic acids research.

[9]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[10]  Ziv Bar-Joseph,et al.  Analyzing time series gene expression data , 2004, Bioinform..

[11]  Denis Thieffry,et al.  Segmenting the fly embryo: a logical analysis of the pair-rule cross-regulatory module. , 2003, Journal of theoretical biology.

[12]  Paulo J. G. Lisboa,et al.  Proceedings of the 7th international conference on Computational intelligence methods for bioinformatics and biostatistics , 2010 .

[13]  Chao Sima,et al.  Inference of Gene Regulatory Networks Using Time-Series Data: A Survey , 2009, Current genomics.

[14]  H. Jäckle,et al.  From gradients to stripes in Drosophila embryogenesis: filling in the gaps. , 1996, Trends in genetics : TIG.

[15]  G. Gibson,et al.  Microarray Analysis , 2020, Definitions.

[16]  N. Lee,et al.  Computational and experimental approaches for modeling gene regulatory networks. , 2007, Current pharmaceutical design.

[17]  Masaru Tomita,et al.  Indeterminacy of Reverse Engineering of Gene Regulatory Networks: The Curse of Gene Elasticity , 2007, PloS one.

[18]  P. Brazhnik,et al.  Linking the genes: inferring quantitative gene networks from microarray data. , 2002, Trends in genetics : TIG.

[19]  Jürgen Kurths,et al.  Unraveling gene regulatory networks from time-resolved gene expression data -- a measures comparison study , 2011, BMC Bioinformatics.

[20]  D. Thieffry,et al.  Segmenting the fly embryo: logical analysis of the role of the segment polarity cross-regulatory module. , 2008, The International journal of developmental biology.

[21]  Juergen Hahn,et al.  Parameter Set Selection via Clustering of Parameters into Pairwise Indistinguishable Groups of Parameters , 2009 .

[22]  Marianne Rooman,et al.  Gene expression model (in)validation by Fourier analysis , 2010, BMC Systems Biology.

[23]  Martin Dufva,et al.  Introduction to Microarray Technology , 2018, Predictive Biomarkers in Oncology.

[24]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[25]  J. Doyle,et al.  Mutation, specialization, and hypersensitivity in highly optimized tolerance , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Araceli M. Huerta,et al.  From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. , 1998, BioEssays : news and reviews in molecular, cellular and developmental biology.

[27]  M. Rooman,et al.  Detection of Perturbation Phases and Developmental Stages in Organisms from DNA Microarray Time Series Data , 2011, PloS one.

[28]  Philippe Bogaerts,et al.  Modeling the temporal evolution of the Drosophila gene expression from DNA microarray time series , 2009, Physical biology.

[29]  Lakhdar Chiter DIRECT algorithm: A new definition of potentially optimal hyperrectangles , 2006, Appl. Math. Comput..

[30]  H. Kitano Towards a theory of biological robustness , 2007, Molecular systems biology.

[31]  I. Androulakis,et al.  Analysis of time-series gene expression data: methods, challenges, and opportunities. , 2007, Annual review of biomedical engineering.

[32]  Leon D. Segal,et al.  Functions , 1995 .

[33]  Jaroslav Albert,et al.  Dynamic modeling of gene expression in prokaryotes: application to glucose-lactose diauxie in Escherichia coli , 2011, Systems and Synthetic Biology.

[34]  J. Stelling,et al.  Robustness of Cellular Functions , 2004, Cell.

[35]  E. Furlong,et al.  Challenges for modeling global gene regulatory networks during development: insights from Drosophila. , 2010, Developmental biology.

[36]  B. S. Baker,et al.  Gene Expression During the Life Cycle of Drosophila melanogaster , 2002, Science.