Identification of Global Transcriptional Dynamics

Background One of the challenges in exploiting high throughput measurement techniques such as microarrays is the conversion of the vast amounts of data obtained into relevant knowledge. Of particular importance is the identification of the intrinsic response of a transcriptional experiment and the characterization of the underlying dynamics. Methodology and Findings The proposed algorithm seeks to provide the researcher a summary as to various aspects relating to the dynamic progression of a biological system, rather than that of individual genes. The approach is based on the identification of smaller number of expression motifs that define the transcriptional state of the system which quantifies the deviation of the cellular response from a control state in the presence of an external perturbation. The approach is demonstrated with a number of data sets including a synthetic base case and four animal studies. The synthetic dataset will be used to establish the response of the algorithm on a “null” dataset, whereas the four different experimental datasets represent a spectrum of possible time course experiments in terms of the degree of perturbation associated with the experiment as well as representing a wide range of temporal sampling strategies. This wide range of experimental datasets will thus allow us to explore the performance of the proposed algorithm and determine its ability identify relevant information. Conclusions and Significance In this work, we present a computational approach which operates on high throughput temporal gene expression data to assess the information content of the experiment, identify dynamic markers of important processes associated with the experimental perturbation, and summarize in a concise manner the evolution of the system over time with respect to the experimental perturbation.

[1]  Richard R. Almon,et al.  Gene arrays and temporal patterns of drug response: corticosteroid effects on rat liver , 2003, Functional & Integrative Genomics.

[2]  Martin L. Yarmush,et al.  Dynamics of gene expression in rat hepatocytes under stress. , 2000, Metabolic engineering.

[3]  Debra C DuBois,et al.  Pharmacodynamics and pharmacogenomics of methylprednisolone during 7-day infusions in rats. , 2002, The Journal of pharmacology and experimental therapeutics.

[4]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[5]  Martin L. Yarmush,et al.  Expression profiling analysis of the metabolic and inflammatory changes following burn injury in rats. , 2004, Physiological genomics.

[6]  Gabriel S. Eichler,et al.  Cell fates as high-dimensional attractor states of a complex gene regulatory network. , 2005, Physical review letters.

[7]  V. Cassone,et al.  Circadian profiling of the transcriptome in immortalized rat SCN cells. , 2005, Physiological genomics.

[8]  S. Stürzenbaum,et al.  'Systems toxicology' approach identifies coordinated metabolic responses to copper in a terrestrial non-model invertebrate, the earthworm Lumbricus rubellus , 2008, BMC Biology.

[9]  Alexander Schliep,et al.  Using hidden Markov models to analyze gene expression time course data , 2003, ISMB.

[10]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[11]  Atul J. Butte,et al.  Quantifying the relationship between co-expression, co-regulation and gene function , 2004, BMC Bioinformatics.

[12]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Jin Y. Jin,et al.  Modeling of Corticosteroid Pharmacogenomics in Rat Liver Using Gene Microarrays , 2003, Journal of Pharmacology and Experimental Therapeutics.

[14]  Ioannis P. Androulakis,et al.  Bioinformatics analysis of the early inflammatory response in a rat thermal injury model , 2007, BMC Bioinformatics.

[15]  M. Dougados,et al.  Tolerance and short term efficacy of rituximab in 43 patients with systemic autoimmune diseases , 2004, Annals of the rheumatic diseases.

[16]  Arul Jayaraman,et al.  Evaluation of an in vitro model of hepatic inflammatory response by gene expression profiling. , 2005, Tissue engineering.

[17]  Varun Garg,et al.  Comparison of four basic models of indirect pharmacodynamic responses , 1993, Journal of Pharmacokinetics and Biopharmaceutics.

[18]  Ziv Bar-Joseph,et al.  Clustering short time series gene expression data , 2005, ISMB.

[19]  D K Agrafiotis,et al.  Kolmogorov-Smirnov statistic and its application in library design. , 2000, Journal of molecular graphics & modelling.

[20]  Murray R. Spiegel,et al.  Schaum's outline of theory and problems of statistics. , 1961 .

[21]  Scott N Peterson,et al.  The complexity of simplicity , 2001, Genome Biology.

[22]  Eyke Hüllermeier,et al.  Clustering of gene expression data using a local shape-based similarity measure , 2005, Bioinform..

[23]  Jeffrey T. Leek,et al.  Gene expression EDGE : extraction and analysis of differential gene expression , 2006 .

[24]  Ziv Bar-Joseph,et al.  STEM: a tool for the analysis of short time series gene expression data , 2006, BMC Bioinformatics.

[25]  Michal Linial,et al.  Novel Unsupervised Feature Filtering of Biological Data , 2006, ISMB.

[26]  Dennis B. Troup,et al.  NCBI GEO: mining millions of expression profiles—database and tools , 2004, Nucleic Acids Res..

[27]  Bruno R. Preiss,et al.  Data Structures and Algorithms with Object-Oriented Design Patterns in Java , 1999 .

[28]  G. Churchill Using ANOVA to analyze microarray data. , 2004, BioTechniques.

[29]  Richard J. Fox,et al.  A two-sample Bayesian t-test for microarray data , 2006, BMC Bioinformatics.