Survival analysis of longitudinal microarrays

MOTIVATION The development of methods for linking gene expressions to various clinical and phenotypic characteristics is an active area of genomic research. Scientists hope that such analysis may, for example, describe relationships between gene function and clinical events such as death or recovery. Methods are available for relating gene expression to measurements that are categorized or continuous, but there is less work in relating expressions to an observed event time such as time to death, response or relapse. When gene expressions are measured over time, there are methods for differentiating temporal patterns. However, methods have not yet been proposed for the survival analysis of longitudinally collected microarrays. RESULTS We describe an approach for the survival analysis of longitudinal gene expression data. We construct a measure of association between the time to an event and gene expressions collected over time. Statistical significance is addressed using permutations and control of the false discovery rate. Our proposed method is illustrated on a dataset from a multi-center research study of inflammation and response to injury that aims to uncover the biological reasons why patients can have dramatically different outcomes after suffering a traumatic injury (www.gluegrant.org).

[1]  Meland,et al.  THE USE OF MOLECULAR PROFILING TO PREDICT SURVIVAL AFTER CHEMOTHERAPY FOR DIFFUSE LARGE-B-CELL LYMPHOMA , 2002 .

[2]  Michael P. Jones,et al.  A general class of nonparametric tests for survival analysis. , 1989 .

[3]  Lu Tian,et al.  Linking gene expression data with patient survival times using partial least squares , 2002, ISMB.

[4]  Wing Hung Wong,et al.  Model-based analysis of oligonucleotide arrays and issues in cDNA microarray analysis , 2003 .

[5]  N. Jewell,et al.  Hypothesis testing of regression parameters in semiparametric generalized linear models for cluster correlated data , 1990 .

[6]  Kouros Owzar,et al.  A multiple testing procedure to associate gene expression levels with survival , 2005, Statistics in medicine.

[7]  Michael Peacock,et al.  Hierarchical Clustering Analysis of Tissue Microarray Immunostaining Data Identifies Prognostically Significant Groups of Breast Carcinoma , 2004, Clinical Cancer Research.

[8]  D.,et al.  Regression Models and Life-Tables , 2022 .

[9]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Zelen,et al.  Urn Sampling and the Proportional Hazard Model , 1998, Lifetime data analysis.

[11]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[13]  Roger E Bumgarner,et al.  Clustering gene-expression data with repeated measurements , 2003, Genome Biology.

[14]  D. Ross,et al.  Microarrays and molecular markers for tumor classification , 2002, Genome Biology.

[15]  J. Crowley,et al.  A general class of nonparametric tests for survival analysis. , 1989, Biometrics.

[16]  Danh V. Nguyen,et al.  Partial least squares proportional hazard regression for application to DNA microarray survival data , 2002, Bioinform..

[17]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[18]  Danh V. Nguyen,et al.  Assessing Patient Survival Using Microarray Gene Expression Data Via Partial Least Squares Proportional Hazard Regression , 2003 .

[19]  Wei Pan,et al.  Gene expression A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data , 2005 .

[20]  S. Young,et al.  p Value Adjustments for Multiple Tests in Multivariate Binomial Models , 1989 .

[21]  Yudong D. He,et al.  A Gene-Expression Signature as a Predictor of Survival in Breast Cancer , 2002 .

[22]  Jiang Gui,et al.  Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data , 2005, Bioinform..

[23]  Hongzhe Li,et al.  Kernel Cox Regression Models for Linking Gene Expression Profiles to Censored Survival Data , 2002, Pacific Symposium on Biocomputing.

[24]  Cheng Li,et al.  DNA-Chip Analyzer (dChip) , 2003 .

[25]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[26]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[27]  John D. Storey,et al.  Significance analysis of time course microarray experiments. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[28]  D. Cox Regression Models and Life-Tables , 1972 .

[29]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[30]  Anastasios A. Tsiatis,et al.  A semiparametric estimator for the proportional hazards model with longitudinal covariates measured with error , 2001 .

[31]  Jiang Gui,et al.  Partial Cox regression analysis for high-dimensional microarray gene expression data , 2004, ISMB/ECCB.

[32]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[33]  Raymond J. Carroll,et al.  Conditional scores and optimal scores for generalized linear measurement-error models , 1987 .

[34]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Hongzhe Li,et al.  Clustering of time-course gene expression data using a mixed-effects model with B-splines , 2003, Bioinform..

[36]  Marie Davidian,et al.  A Semiparametric Likelihood Approach to Joint Modeling of Longitudinal and Time‐to‐Event Data , 2002, Biometrics.

[37]  R. Tibshirani,et al.  Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data , 2004, PLoS biology.