Identification of Under-Detected Periodicity in Time-Series Microarray Data by Using Empirical Mode Decomposition

Detecting periodicity signals from time-series microarray data is commonly used to facilitate the understanding of the critical roles and underlying mechanisms of regulatory transcriptomes. However, time-series microarray data are noisy. How the temporal data structure affects the performance of periodicity detection has remained elusive. We present a novel method based on empirical mode decomposition (EMD) to examine this effect. We applied EMD to a yeast microarray dataset and extracted a series of intrinsic mode function (IMF) oscillations from the time-series data. Our analysis indicated that many periodically expressed genes might have been under-detected in the original analysis because of interference between decomposed IMF oscillations. By validating a protein complex coexpression analysis, we revealed that 56 genes were newly determined as periodic. We demonstrated that EMD can be used incorporating with existing periodicity detection methods to improve their performance. This approach can be applied to other time-series microarray studies.

[1]  Earl F. Glynn,et al.  Comparison of Pattern Detection Methods in Microarray Time Series of the Segmentation Clock , 2008, PloS one.

[2]  Andrzej Kudlicki,et al.  SCEPTRANS: an online tool for analyzing periodic transcription in yeast , 2007, Bioinform..

[3]  Kai Li,et al.  Exploring the functional landscape of gene expression: directed search of large microarray compendia , 2007, Bioinform..

[4]  Ker-Chau Li,et al.  Patterns of co-expression for protein complexes by size in Saccharomyces cerevisiae , 2008, Nucleic acids research.

[5]  Swanirbhar Majumder,et al.  A Novel EMD based Watermarking of Fingerprint Biometric Using GEP , 2013 .

[6]  I. Simon,et al.  Deconvolving cell cycle expression data with complementary information , 2004, ISMB/ECCB.

[7]  Peer Bork,et al.  Comparison of computational methods for the identification of cell cycle-regulated genes , 2005, Bioinform..

[8]  L. Jensen,et al.  The more the merrier: comparative analysis of microarray studies on cell cycle‐regulated genes in fission yeast , 2006, Yeast.

[9]  Catherine Stamoulis,et al.  A novel signal processing approach for the detection of copy number variations in the human genome , 2011, Bioinform..

[10]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[11]  Hanlee P. Ji,et al.  The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. , 2006, Nature biotechnology.

[12]  David Botstein,et al.  GO: : TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes , 2004, Bioinform..

[13]  Hee-Seok Oh,et al.  EMD: A Package for Empirical Mode Decomposition and Hilbert Spectrum , 2009 .

[14]  Steven B. Haase,et al.  Design and analysis of large-scale biological rhythm studies: a comparison of algorithms for detecting periodic signals in biological data , 2013, Bioinform..

[15]  S. Brunak,et al.  New weakly expressed cell cycle‐regulated genes in yeast , 2005, Yeast.

[16]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[17]  Zu-Guo Yu,et al.  Fuzzy C-means method with empirical mode decomposition for clustering microarray data , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[18]  A. Kudlicki,et al.  Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes , 2005, Science.

[19]  Konan Peck,et al.  Matching gene activity with physiological functions , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Christian Lovis,et al.  Empirical Mode Decomposition and k-Nearest Embedding Vectors for Timely Analyses of Antibiotic Resistance Trends , 2013, PloS one.

[21]  Yongjin Park,et al.  How networks change with time , 2012, Bioinform..

[22]  C. Ball,et al.  Identification of genes periodically expressed in the human cell cycle and their expression in tumors. , 2002, Molecular biology of the cell.

[23]  Maqc Consortium The MicroArray Quality Control ( MAQC )-II study of common practices for the development and validation of microarray-based predictive models , 2012 .

[24]  Kerby Shedden,et al.  Analysis of cell-cycle gene expression in Saccharomyces cerevisiae using microarrays and multiple synchronization methods , 2002, Nucleic Acids Res..

[25]  Claudio Altafini,et al.  mRNA stability and the unfolding of gene expression in the long-period yeast metabolic cycle , 2009, BMC Systems Biology.

[26]  D. Murray,et al.  A genomewide oscillation in transcription gates DNA replication and cell cycle. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Alfredo Germani,et al.  Stochastic Modeling of Expression Kinetics Identifies Messenger Half-Lives and Reveals Sequential Waves of Co-ordinated Transcription and Decay , 2012, PLoS Comput. Biol..

[28]  Pi-Cheng Tung,et al.  Data analysis using a combination of independent component analysis and empirical mode decomposition. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Matthias E. Futschik,et al.  Are we Overestimating the Number of Cell-Cycling Genes? The Impact of Background Models , 2008, German Conference on Bioinformatics.

[30]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[31]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[32]  Chung-Kang Peng,et al.  Temporal Associations between Weather and Headache: Analysis by Empirical Mode Decomposition , 2011, PloS one.

[33]  Max A. Little,et al.  Highly comparative time-series analysis: the empirical structure of time series and their methods , 2013, Journal of The Royal Society Interface.

[34]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[35]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[36]  Slawomir J. Nasuto,et al.  Neural Correlates of True and False Memory in Mild Cognitive Impairment , 2012, PloS one.

[37]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[38]  Zhaohua Wu,et al.  On the trend, detrending, and variability of nonlinear and nonstationary time series , 2007, Proceedings of the National Academy of Sciences.

[39]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[40]  Attila Csikász-Nagy,et al.  Computational systems biology of the cell cycle , 2009, Briefings Bioinform..

[41]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[42]  Kun Hu,et al.  A Nonlinear Dynamic Approach Reveals a Long-Term Stroke Effect on Cerebral Blood Flow Regulation at Multiple Time Scales , 2012, PLoS Comput. Biol..