Spectral Preprocessing for Clustering Time-Series Gene Expressions

Based on gene expression profiles, genes can be partitioned into clusters, which might be associated with biological processes or functions, for example, cell cycle, circadian rhythm, and so forth. This paper proposes a novel clustering preprocessing strategy which combines clustering with spectral estimation techniques so that the time information present in time series gene expressions is fully exploited. By comparing the clustering results with a set of biologically annotated yeast cell-cycle genes, the proposed clustering strategy is corroborated to yield significantly different clusters from those created by the traditional expression-based schemes. The proposed technique is especially helpful in grouping genes participating in time-regulated processes.

[1]  N. Lomb Least-squares frequency analysis of unequally spaced data , 1976 .

[2]  Francis D. Gibbons,et al.  Judging the quality of gene expression-based clustering methods using gene annotation. , 2002, Genome research.

[3]  Petre Stoica,et al.  Spectral analysis of irregularly-sampled data: Paralleling the regularly-sampled data approaches , 2006, Digit. Signal Process..

[4]  Edward R. Dougherty,et al.  Detecting Periodic Genes from Irregularly Sampled Gene Expressions: A Comparison Study , 2008, EURASIP J. Bioinform. Syst. Biol..

[5]  Patrik D'haeseleer,et al.  How does gene expression clustering work? , 2005, Nature Biotechnology.

[6]  P. Bartholdi,et al.  VARIABLE STARS : WHICH NYQUIST FREQUENCY? , 1999 .

[7]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[8]  J. Scargle Studies in astronomical time series analysis. II - Statistical aspects of spectral analysis of unevenly spaced data , 1982 .

[9]  Atul J. Butte,et al.  Comparing the Similarity of Time-Series Gene Expression Using Signal Processing Metrics , 2001, J. Biomed. Informatics.

[10]  Xiaobo Zhou,et al.  Gene Clustering Based on Clusterwide Mutual Information , 2004, J. Comput. Biol..

[11]  Jian Li,et al.  Nonparametric spectral analysis with missing data via the EM algorithm , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[12]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[13]  M. Meilă Comparing clusterings---an information based distance , 2007 .

[14]  Paola Sebastiani,et al.  Cluster analysis of gene expression dynamics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  David R. Brillinger,et al.  SECOND-ORDER MOMENTS AND MUTUAL INFORMATION IN THE ANALYSIS OF TIME SERIES , 2002 .

[16]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[17]  Jaakko Astola,et al.  Fast Iterative Gene Clustering Based on Information Theoretic Criteria for Selecting the Cluster Structure , 2004, J. Comput. Biol..

[18]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Edward R. Dougherty,et al.  Inferring gene regulatory networks from time series data using the minimum description length principle , 2006, Bioinform..

[20]  Ziv Bar-Joseph,et al.  Clustering short time series gene expression data , 2005, ISMB.

[21]  Jaakko Astola,et al.  Clustering the non-uniformly sampled time series of gene expression data , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[22]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[23]  Ronald K. Pearson,et al.  BMC Bioinformatics BioMed Central Methodology article , 2005 .

[24]  C. Ball,et al.  Identification of genes periodically expressed in the human cell cycle and their expression in tumors. , 2002, Molecular biology of the cell.

[25]  Korbinian Strimmer,et al.  Identifying periodically expressed transcripts in microarray time series data , 2008, Bioinform..

[26]  Korbinian Strimmer,et al.  Identifying periodically expressed transcripts in microarray time series data , 2008, Bioinform..

[27]  E. L. Robinson,et al.  The 1051 s period of the interacting binary white dwarf amcvn , 1984 .

[28]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[29]  C. Mallows,et al.  A Method for Comparing Two Hierarchical Clusterings , 1983 .

[30]  Jie Chen,et al.  Bioinformatics Original Paper Detecting Periodic Patterns in Unevenly Spaced Gene Expression Time Series Using Lomb–scargle Periodograms , 2022 .

[31]  I. Simon,et al.  Combined static and dynamic analysis for determining the quality of time-series expression profiles , 2005, Nature Biotechnology.