Spectral estimation in unevenly sampled space of periodically expressed microarray time series data.

BackgroundPeriodogram analysis of time-series is widespread in biology. A new challenge for analyzing the microarray time series data is to identify genes that are periodically expressed. Such challenge occurs due to the fact that the observed time series usually exhibit non-idealities, such as noise, short length, and unevenly sampled time points. Most methods used in the literature operate on evenly sampled time series and are not suitable for unevenly sampled time series.ResultsFor evenly sampled data, methods based on the classical Fourier periodogram are often used to detect periodically expressed gene. Recently, the Lomb-Scargle algorithm has been applied to unevenly sampled gene expression data for spectral estimation. However, since the Lomb-Scargle method assumes that there is a single stationary sinusoid wave with infinite support, it introduces spurious periodic components in the periodogram for data with a finite length. In this paper, we propose a new spectral estimation algorithm for unevenly sampled gene expression data. The new method is based on signal reconstruction in a shift-invariant signal space, where a direct spectral estimation procedure is developed using the B-spline basis. Experiments on simulated noisy gene expression profiles show that our algorithm is superior to the Lomb-Scargle algorithm and the classical Fourier periodogram based method in detecting periodically expressed genes. We have applied our algorithm to the Plasmodium falciparum and Yeast gene expression data and the results show that the algorithm is able to detect biologically meaningful periodically expressed genes.ConclusionWe have proposed an effective method for identifying periodic genes in unevenly sampled space of microarray time series gene expression data. The method can also be used as an effective tool for gene expression time series interpolation or resampling.

[1]  Shuichi Itoh,et al.  On sampling in shift invariant spaces , 2002, IEEE Trans. Inf. Theory.

[2]  Karlheinz Gröchenig,et al.  Fast Local Reconstruction Methods for Nonuniform Sampling in Shift-Invariant Spaces , 2002, SIAM J. Matrix Anal. Appl..

[3]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[4]  Youming Liu Irregular Sampling for Spline Wavelet Subspaces , 1996, IEEE Trans. Inf. Theory.

[5]  William H. Press,et al.  Numerical Recipes in C, 2nd Edition , 1992 .

[6]  Jie Chen,et al.  Identification of significant periodic genes in microarray gene expression data , 2005, BMC Bioinformatics.

[7]  ChenJie,et al.  Detecting periodic patterns in unevenly spaced gene expression time series using Lomb--Scargle periodograms , 2006 .

[8]  T. H. Bø,et al.  LSimpute: accurate estimation of missing values in microarray data with least squares methods. , 2004, Nucleic acids research.

[9]  C. Chui Wavelets: A Tutorial in Theory and Applications , 1992 .

[10]  H. Hartley,et al.  Tests of significance in harmonic analysis. , 1949, Biometrika.

[11]  Monson H. Hayes,et al.  Statistical Digital Signal Processing and Modeling , 1996 .

[12]  Say Song Goha,et al.  Reconstruction of bandlimited signals from irregular samples , 1995, Signal Process..

[13]  Susan K. Avery,et al.  Estimation of randomly sampled sinusoids in additive noise , 1988, IEEE Trans. Acoust. Speech Signal Process..

[14]  M. Eisen,et al.  Why PLoS Became a Publisher , 2003, PLoS biology.

[15]  Jun Xian,et al.  Sampling and reconstruction in time-warped spaces and their applications , 2004, Appl. Math. Comput..

[16]  Hongzhe Li,et al.  Model-based methods for identifying periodically expressed genes based on time course microarray gene expression data , 2004, Bioinform..

[17]  Shuichi Itoh,et al.  Irregular Sampling Theorems for Wavelet Subspaces , 1998, IEEE Trans. Inf. Theory.

[18]  Mats G. Gustafsson,et al.  Bayesian detection of periodic mRNA time profiles without use of training examples , 2006, BMC Bioinformatics.

[19]  William H. Press,et al.  Numerical recipes in C , 2002 .

[20]  L. Schumaker Spline Functions: Basic Theory , 1981 .

[21]  Jun Xian,et al.  Weighted sampling and signal reconstruction in spline subspaces , 2004, First International Symposium on Control, Communications and Signal Processing, 2004..

[22]  Jie Chen,et al.  Bioinformatics Original Paper Detecting Periodic Patterns in Unevenly Spaced Gene Expression Time Series Using Lomb–scargle Periodograms , 2022 .

[23]  Korbinian Strimmer,et al.  Identifying periodically expressed transcripts in microarray time series data , 2008, Bioinform..

[24]  Hong Yan,et al.  Microarray missing data imputation based on a set theoretic framework and biological knowledge , 2006, Nucleic acids research.

[25]  J. Benedetto Irregular sampling and frames , 1993 .

[26]  Peer Bork,et al.  Comparison of computational methods for the identification of cell cycle-regulated genes , 2005, Bioinform..

[27]  Ronald K. Pearson,et al.  BMC Bioinformatics BioMed Central Methodology article , 2005 .

[28]  J. Derisi,et al.  The Transcriptome of the Intraerythrocytic Developmental Cycle of Plasmodium falciparum , 2003, PLoS biology.

[29]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[30]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[31]  M. Hütt,et al.  Identification of Rhythmic Subsystems in the Circadian Cycle of Crassulacean Acid Metabolism under Thermoperiodic Perturbations , 2003, Biological chemistry.

[32]  Thomas Ruf,et al.  The Lomb-Scargle Periodogram in Biological Rhythm Research: Analysis of Incomplete and Unequally Spaced Time-Series , 1999 .

[33]  S.M. Kay,et al.  Spectrum analysis—A modern perspective , 1981, Proceedings of the IEEE.

[34]  N. Lomb Least-squares frequency analysis of unequally spaced data , 1976 .

[35]  Stefan Ericsson,et al.  An Analysis Method for Sampling in Shift-Invariant Spaces , 2005, Int. J. Wavelets Multiresolution Inf. Process..

[36]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[37]  Hong Yan,et al.  Dominant spectral component analysis for transcriptional regulations using microarray time-series data , 2004, Bioinform..

[38]  Zhaohui S. Qin,et al.  Statistical resynchronization and Bayesian detection of periodically expressed genes. , 2004, Nucleic acids research.