Identification of significant periodic genes in microarray gene expression data

BackgroundOne frequent application of microarray experiments is in the study of monitoring gene activities in a cell during cell cycle or cell division. A new challenge for analyzing the microarray experiments is to identify genes that are statistically significantly periodically expressed during the cell cycle. Such a challenge occurs due to the large number of genes that are simultaneously measured, a moderate to small number of measurements per gene taken at different time points, and high levels of non-normal random noises inherited in the data.ResultsBased on two statistical hypothesis testing methods for identifying periodic time series, a novel statistical inference approach, the C&G procedure, is proposed to effectively screen out statistically significantly periodically expressed genes. The approach is then applied to yeast and bacterial cell cycle gene expression data sets, as well as to human fibroblasts and human cancer cell line data sets, and significantly periodically expressed genes are successfully identified.ConclusionThe C&G procedure proposed is an effective method for identifying statistically significant periodic genes in microarray time series gene expression data.

[1]  Chris Chatfield,et al.  Introduction to Statistical Time Series. , 1976 .

[2]  W. Fuller,et al.  Introduction to Statistical Time Series (2nd ed.) , 1997 .

[3]  H. Hartley,et al.  Tests of significance in harmonic analysis. , 1949, Biometrika.

[4]  James Durbin,et al.  Tests for serial correlation in regression analysis based on the periodogram of least-squares residuals , 1969 .

[5]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[6]  C. Ball,et al.  Identification of genes periodically expressed in the human cell cycle and their expression in tumors. , 2002, Molecular biology of the cell.

[7]  Ronald W. Davis,et al.  Transcriptional regulation and function during the human cell cycle , 2001, Nature Genetics.

[8]  Kerby Shedden,et al.  Analysis of cell-cycle gene expression in Saccharomyces cerevisiae using microarrays and multiple synchronization methods , 2002, Nucleic Acids Res..

[9]  H. McAdams,et al.  Global analysis of the genetic network controlling a bacterial cell cycle. , 2000, Science.

[10]  James Durbin,et al.  Tests of serial independence based on the cumulated periodogram , 1967 .

[11]  Harold T. Davis,et al.  The Analysis of Economic Time Series. , 1942 .

[12]  K. Shedden,et al.  Analysis of cell-cycle-specific gene expression in human cells as determined by microarrays and double-thymidine block synchronization , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Korbinian Strimmer,et al.  Identifying periodically expressed transcripts in microarray time series data , 2008, Bioinform..

[14]  D. B. Preston Spectral Analysis and Time Series , 1983 .

[15]  M. S. Bartlett,et al.  An introduction to stochastic processes, with special reference to methods and applications , 1955 .

[16]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .