eXPatGen: Generating Dynamic Expression Patterns for the Systematic Evaluation of Analytical Methods

MOTIVATION Experimental gene expression data sets, such as those generated by microarray or gene chip experiments, typically have significant noise and complicated interconnectivities that make understanding even simple regulatory patterns difficult. Given these complications, characterizing the effectiveness of different analysis techniques to uncover network groups and structures remains a challenge. Generating simulated expression patterns with known biological features of expression complexity, diversity and interconnectivities provides a more controlled means of investigating the appropriateness of different analysis methods. A simulation-based approach can systematically evaluate different gene expression analysis techniques and provide a basis for improved methods in dynamic metabolic network reconstruction. RESULTS We have developed an on-line simulator, called eXPatGen, to generate dynamic gene expression patterns typical of microarray experiments. eXPatGen provides a quantitative network structure to represent key biological features, including the induction, repression, and cascade regulation of messenger RNA (mRNA). The simulation is modular such that the expression model can be replaced with other representations, depending on the level of biological detail required by the user. Two example gene networks, of 25 and 100 genes respectively, were simulated. Two standard analysis techniques, clustering and PCA analysis, were performed on the resulting expression patterns in order to demonstrate how the simulator might be used to evaluate different analysis methods and provide experimental guidance for biological studies of gene expression. AVAILABILITY http://www.che.udel.edu/eXPatGen/

[1]  D. Botstein,et al.  Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[2]  W. Michalek,et al.  Identification of genes specifically expressed in maternal and filial tissues of barley caryopses: a cDNA array analysis , 2001, Molecular Genetics and Genomics.

[3]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[4]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[5]  Jizhong Zhou,et al.  DNA Microarray Analysis of the Hyperthermophilic Archaeon Pyrococcus furiosus: Evidence for a New Type of Sulfur-Reducing Enzyme Complex , 2001, Journal of bacteriology.

[6]  Rainer Fuchs,et al.  Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters , 2001, Bioinform..

[7]  J. Barker,et al.  Large-scale temporal gene expression mapping of central nervous system development. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[11]  R. Nuttall,et al.  An evaluation of the performance of cDNA microarrays for detecting changes in global mRNA expression. , 2001, Nucleic acids research.

[12]  Denis C. Shields,et al.  Integrating Genotypic Data With Transcriptomic and Proteomic Data , 2002, Comparative and functional genomics.

[13]  Nikolaus Rajewsky,et al.  The evolution of DNA regulatory regions for proteo-gamma bacteria by interspecies comparisons. , 2002, Genome research.

[14]  Neal S. Holter,et al.  Fundamental patterns underlying gene expression profiles: simplicity from complexity. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Terence Hwa,et al.  Extracting transcriptional events from temporal gene expression patterns during Dictyostelium development , 2002, Bioinform..

[16]  A G Marsh,et al.  Gene expression and enzyme activities of the sodium pump during sea urchin development: implications for indices of physiological state. , 2000, The Biological bulletin.

[17]  Ka Yee Yeung,et al.  Validating clustering for gene expression data , 2001, Bioinform..

[18]  Joshua M. Stuart,et al.  MICROARRAY EXPERIMENTS : APPLICATION TO SPORULATION TIME SERIES , 1999 .

[19]  William H. Press,et al.  Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[20]  Lars Juhl Jensen,et al.  Automatic discovery of regulatory patterns in promoter regions based on whole cell expression data and functional annotation , 2000, Bioinform..

[21]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[22]  Partha S. Vasisht Computational Analysis of Microarray Data , 2003 .

[23]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[24]  John Quackenbush,et al.  Computational genetics: Computational analysis of microarray data , 2001, Nature Reviews Genetics.

[25]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[26]  M. Vidal,et al.  Combined Functional Genomic Maps of the C. elegans DNA Damage Response , 2002, Science.

[27]  A. Goffeau,et al.  Genome microarray analysis of transcriptional activation in multidrug resistance yeast mutants , 2000, FEBS letters.

[28]  A. Brazma,et al.  Gene expression data analysis , 2000, FEBS letters.

[29]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[30]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[31]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[32]  D. Botstein,et al.  The transcriptional program in the response of human fibroblasts to serum. , 1999, Science.