Significance analysis of time course microarray experiments.

Characterizing the genome-wide dynamic regulation of gene expression is important and will be of much interest in the future. However, there is currently no established method for identifying differentially expressed genes in a time course study. Here we propose a significance method for analyzing time course microarray studies that can be applied to the typical types of comparisons and sampling schemes. This method is applied to two studies on humans. In one study, genes are identified that show differential expression over time in response to in vivo endotoxin administration. By using our method, 7,409 genes are called significant at a 1% false-discovery rate level, whereas several existing approaches fail to identify any genes. In another study, 417 genes are identified at a 10% false-discovery rate level that show expression changing with age in the kidney cortex. Here it is also shown that as many as 47% of the genes change with age in a manner more complex than simple exponential growth or decay. The methodology proposed here has been implemented in the freely distributed and open-source edge software package.

[1]  R. Pearl Biometrics , 1914, The American Naturalist.

[2]  Mitchell J. Mergenthaler Nonparametrics: Statistical Methods Based on Ranks , 1979 .

[3]  Robert Tibshirani,et al.  An Introduction to the Bootstrap CHAPMAN & HALL/CRC , 1993 .

[4]  B. Silverman,et al.  Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[5]  P. Diggle Analysis of Longitudinal Data , 1995 .

[6]  P. Diggle,et al.  Analysis of Longitudinal Data. , 1997 .

[7]  Yuedong Wang Mixed effects smoothing spline analysis of variance , 1998 .

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  D. Salant,et al.  Lack of chemokine receptor CCR1 enhances Th1 responses and glomerular injury during nephrotoxic nephritis. , 1999, The Journal of clinical investigation.

[10]  Don Foster,et al.  TACI and BCMA are receptors for a TNF homologue implicated in B-cell autoimmune disease , 2000, Nature.

[11]  D. Botstein,et al.  Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  R. Majeti,et al.  An Inactivating Point Mutation in the Inhibitory Wedge of CD45 Causes Lymphoproliferation and Autoimmunity , 2000, Cell.

[13]  D. Botstein,et al.  Two yeast forkhead genes regulate the cell cycle and pseudohyphal growth , 2000, Nature.

[14]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[15]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[16]  A. DeFranco,et al.  Lupus-like kidney disease in mice deficient in the Src family tyrosine kinases Lyn and Fyn , 2001, Current Biology.

[17]  Colin O. Wu,et al.  Nonparametric Mixed Effects Models for Unequally Sampled Noisy Curves , 2001, Biometrics.

[18]  Gareth M. James,et al.  Functional linear discriminant analysis for irregularly sampled curves , 2001 .

[19]  Ana Ivelisse Avilés,et al.  Linear Mixed Models for Longitudinal Data , 2001, Technometrics.

[20]  R. Irizarry,et al.  Assessing Homeostasis Through Circadian Patterns , 2001, Biometrics.

[21]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[22]  E. Dougherty,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[23]  D. Slonim From patterns to pathways: gene expression data analysis comes of age , 2002, Nature Genetics.

[24]  E. Haddad,et al.  Hypomagnesemia with secondary hypocalcemia is caused by mutations in TRPM6, a new member of the TRPM gene family , 2002, Nature Genetics.

[25]  John D. Storey A direct approach to false discovery rates , 2002 .

[26]  C1q Deficiency and Autoimmunity: The Effects of Genetic Background on Disease Expression1 , 2002, The Journal of Immunology.

[27]  Catherine A. Sugar,et al.  Clustering for Sparsely Sampled Functional Data , 2003 .

[28]  Tommi S. Jaakkola,et al.  Continuous Representations of Time-Series Gene Expression Data , 2003, J. Comput. Biol..

[29]  X. Cui,et al.  Statistical tests for differential expression in cDNA microarray experiments , 2003, Genome Biology.

[30]  Hongzhe Li,et al.  Clustering of time-course gene expression data using a mixed-effects model with B-splines , 2003, Bioinform..

[31]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[32]  T. Jaakkola,et al.  Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Kelli Montgomery,et al.  Gene expression in the normal adult human kidney assessed by complementary DNA microarray. , 2003, Molecular biology of the cell.

[34]  Lingli Wang,et al.  A Transcriptional Profile of Aging in the Human Kidney , 2004, PLoS biology.

[35]  S. Cohen,et al.  Connecting proliferation and apoptosis in development and disease , 2004, Nature Reviews Molecular Cell Biology.

[36]  John D. Storey,et al.  A network-based analysis of systemic inflammation in humans , 2005, Nature.

[37]  Malik Beshir Malik,et al.  Applied Linear Regression , 2005, Technometrics.

[38]  Stephen E. Fienberg,et al.  Testing Statistical Hypotheses , 2005 .

[39]  James O. Ramsay Functional Data Analysis , 2005 .

[40]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .