Finding Significantly Expressed genes from time-course expression profiles

This paper proposes a statistical method for finding Significantly Expressed (SE) genes from time-course expression. SE genes are time-dependent while non-SE genes are time-independent. This method models time-dependent gene expression profiles by autoregressive equations plus Gaussian noises, and time-independent ones by Gaussian noises. The statistical F-testing is used to calculate the probability (p-value) that a profile is time-independent. Both a synthetic dataset and a biological dataset were employed to evaluate the performance of this method, measured by the False Discovery Rate (FDR) and the False Non-discovery Rate (FNR). Results show that the proposed method outperforms traditional methods.

[1]  George A. F. Seber,et al.  Linear regression analysis , 1977 .

[2]  P. Kuwabara DNA Microarrays and Gene Expression: From Experiments to Data Analysis and Modeling , 2003 .

[3]  Thomas G. Dietterich,et al.  Bioinformatics The Machine Learning Approach 2nd ed. , 2001 .

[4]  H. McAdams,et al.  Global analysis of the genetic network controlling a bacterial cell cycle. , 2000, Science.

[5]  Alan J. Lee,et al.  Linear Regression Analysis: Seber/Linear , 2003 .

[6]  Gordon K. Smyth,et al.  Use of within-array replicate spots for assessing differential expression in microarray experiments , 2005, Bioinform..

[7]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[8]  Ziv Bar-Joseph,et al.  Analyzing time series gene expression data , 2004, Bioinform..

[9]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Fang-Xiang Wu,et al.  Dynamic Model-based Clustering for Time-course Gene Expression Data , 2005, J. Bioinform. Comput. Biol..

[11]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[12]  R. Melnik Dynamic system evolution and markov chain approximation , 1998 .

[13]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[14]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[15]  C. Ball,et al.  Identification of genes periodically expressed in the human cell cycle and their expression in tumors. , 2002, Molecular biology of the cell.

[16]  D. Lockhart,et al.  Mitotic misregulation and human aging. , 2000, Science.

[17]  H J Keselman,et al.  Controlling the rate of Type I error over a large set of statistical tests. , 2002, The British journal of mathematical and statistical psychology.

[18]  Y. Benjamini,et al.  On the Adaptive Control of the False Discovery Rate in Multiple Testing With Independent Statistics , 2000 .

[19]  J. Claverie Computational methods for the identification of differential and coordinated gene expression. , 1999, Human molecular genetics.

[20]  Pierre Baldi,et al.  Bioinformatics - the machine learning approach (2. ed.) , 2000 .

[21]  Alexander Schliep,et al.  Analyzing Gene Expression Time-Courses , 2005, IEEE ACM Trans. Comput. Biol. Bioinform..

[22]  D. Botstein,et al.  The transcriptional program in the response of human fibroblasts to serum. , 1999, Science.

[23]  Paola Sebastiani,et al.  Cluster analysis of gene expression dynamics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Ivan G. Costa,et al.  Analyzing gene expression time-courses , 2005, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[25]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[26]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Adrian E. Raftery,et al.  Model-based clustering and data transformations for gene expression data , 2001, Bioinform..

[28]  A. Harvey Time series models , 1983 .