A random-periods model for expression of cell-cycle genes.

We propose a nonlinear regression model for quantitatively analyzing periodic gene expression in studies of experimentally synchronized cells. Our model accounts for the observed attenuation in cycle amplitude by a simple and biologically plausible mechanism. We represent the expression level for each gene as an average across a large number of cells. For a given cell-cycle gene, we model its expression in each cell in the culture as following the same sinusoidal function except that the period, which in any individual cell must be the same for all cell-cycle genes, varies randomly across cells. We model these random periods by using a lognormal distribution. The variability in period causes the measured amplitude of the cyclic expression trajectory to attenuate over time as cells fall increasingly out of synchrony. Gene-specific parameters include initial amplitude and phase angle. Applying the model to data from Whitfield et al. [Whitfield, M. L., Sherlock, G., Saldanha, A. J., Murray, J, I., Ball, C. A., et al. (2002) Mol. Biol. Cell 13, 1977-2000], we fit the trajectories of 18 well characterized phase-marker genes and find that the fit does not suffer when a common lognormal distribution is assumed for all 18 genes compared with a separate distribution for each. We then use the model to identify 337 periodically expressed transcripts, including the 18 phase-marker genes. The model permits estimation of and hypothesis testing about biologically meaningful parameters that characterize cycling genes.

[1]  Asymptotic theory in heteroscedastic nonlinear models , 1990 .

[2]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[3]  H. Künsch The Jackknife and the Bootstrap for General Stationary Observations , 1989 .

[4]  Laurie J. Heyer,et al.  Exploring expression data: identification and analysis of coexpressed genes. , 1999, Genome research.

[5]  C. Ball,et al.  Identification of genes periodically expressed in the human cell cycle and their expression in tumors. , 2002, Molecular biology of the cell.

[6]  Neal S. Holter,et al.  Dynamic modeling of gene expression data. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Ronald W. Davis,et al.  Transcriptional regulation and function during the human cell cycle , 2001, Nature Genetics.

[8]  Paola Sebastiani,et al.  Cluster analysis of gene expression dynamics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Anders Berglund,et al.  A multivariate approach applied to microarray data for identification of genes with cell cycle-coupled transcription , 2003, Bioinform..

[10]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[11]  L. P. Zhao,et al.  Statistical modeling of large microarray data sets to identify stimulus-response profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Shyamal D. Peddada,et al.  Gene Selection and Clustering for Time-course and Dose-response Microarray Experiments Using Order-restricted Inference , 2003, Bioinform..

[13]  Neal S. Holter,et al.  Fundamental patterns underlying gene expression profiles: simplicity from complexity. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Shyamal D. Peddada,et al.  21 Jackknife variance estimation and bias reduction , 1993, Computational Statistics.

[15]  Bradley Efron,et al.  Statistics in the 21st century , 1993 .

[16]  Hongzhe Li,et al.  Clustering of time-course gene expression data using a mixed-effects model with B-splines , 2003, Bioinform..

[17]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[18]  D. Botstein,et al.  Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Shyamal D. Peddada,et al.  Jackknife variance estimators in linear models , 1992 .

[20]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[21]  Mike Tyers,et al.  Systematic Identification of Pathways That Couple Cell Growth and Division in Yeast , 2002, Science.

[22]  A. Gallant,et al.  Nonlinear Statistical Models , 1988 .

[23]  Calyampudi R. Rao Handbook of statistics , 1980 .