On the gene ranking of replicated microarray time course data

Consider the gene ranking problem of replicated microarray time course experiments where there are multiple biological conditions, and genes of interest are those whose temporal profiles are different across conditions. We derive the multi-sample multivariate empirical Bayes statistic for ranking genes in the order of differential expression, from both longitudinal and cross-sectional replicated developmental microarray time course data. Our longitudinal multi-sample model assumes that time course replicates are i.i.d. multivariate normal vectors. On the other hand, we construct our cross-sectional model using a normal regression framework with any appropriate basis for the design matrices. In both cases, we use natural conjugate priors in our empirical Bayes setting which guarantee closed form solutions 1 for the posterior odds. Our simulations and two case studies using published worm and mouse microarray time course datasets indicate that the proposed approaches work well. keywords: longitudinal; cross-sectional; microarray time course; gene ranking; empirical Bayes.

[1]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[2]  Howard Raiffa,et al.  Applied Statistical Decision Theory. , 1961 .

[3]  John Aitchison,et al.  Statistical Prediction Analysis , 1975 .

[4]  S. J. Press,et al.  Applied multivariate analysis : using Bayesian and frequentist methods of inference , 1984 .

[5]  Kazuo Shigemasu,et al.  Bayesian manova and manocova under exchangeability , 1985 .

[6]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[7]  S. James Press,et al.  Bayesian inference in the multivariate mixed model MANOVA , 1997 .

[8]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[9]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[10]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[11]  Kai-Florian Storch,et al.  Extensive and divergent circadian gene expression in liver and heart , 2002, Nature.

[12]  Stuart K. Kim,et al.  Global analysis of dauer gene expression in Caenorhabditis elegans , 2003, Development.

[13]  T. Jaakkola,et al.  Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Kazuo Yagi,et al.  Gene expression profile of long‐lived Ames dwarf mice and Little mice , 2004, Aging cell.

[15]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[16]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[17]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[18]  John D. Storey,et al.  Significance analysis of time course microarray experiments. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[19]  T. Speed,et al.  Statistical Analysis of Microarray Time Course Data , 2005 .

[20]  Christina Kendziorski,et al.  Hidden Markov Models for Microarray Time Course Data in Multiple Biological Conditions , 2006 .

[21]  T. Speed,et al.  A multivariate empirical Bayes statistic for replicated microarray time course data , 2006, math/0702685.

[22]  F Hong,et al.  Functional Hierarchical Models for Identifying Genes with Different Time‐Course Expression Profiles , 2006, Biometrics.