Testing association of a pathway with survival using gene expression data

MOTIVATION A recent surge of interest in survival as the primary clinical endpoint of microarray studies has called for an extension of the Global Test methodology to survival. RESULTS We present a score test for association of the expression profile of one or more groups of genes with a (possibly censored) survival time. Groups of genes may be pathways, areas of the genome, clusters from a cluster analysis or all genes on a chip. The test allows one to test hypotheses about the influence of these groups of genes on survival directly, without the intermediary of single gene testing. The test is based on the Cox proportional hazards model and is calculated using martingale residuals. It is possible to adjust the test for the presence of covariates. We also present a diagnostic graph to assist in the interpretation of the test result, visualizing the influence of genes. The test is applied to a tumor dataset, revealing pathways from the gene ontology database that are associated with survival of patients. AVAILABILITY The Global Test for survival has been incorporated into the R-package globaltest (version 3.0), available at http://www.bioconductor.org

[1]  Jun Yan Survival Analysis: Techniques for Censored and Truncated Data , 2004 .

[2]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[3]  Niels Keiding,et al.  Statistical Models Based on Counting Processes , 1993 .

[4]  M. Kendall Theoretical Statistics , 1956, Nature.

[5]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[6]  Jelle J. Goeman,et al.  A global test for groups of genes: testing association with a clinical outcome , 2004, Bioinform..

[7]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[8]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[9]  Y Pawitan,et al.  Gene expression profiling for prognosis using Cox regression , 2004, Statistics in medicine.

[10]  Rafael A. Irizarry,et al.  A Model-Based Background Adjustment for Oligonucleotide Expression Arrays , 2004 .

[11]  D. Harrington,et al.  Counting Processes and Survival Analysis , 1991 .

[12]  Yudong D. He,et al.  A Gene-Expression Signature as a Predictor of Survival in Breast Cancer , 2002 .

[13]  M. Tyers,et al.  Molecular profiling of non-small cell lung cancer and correlation with disease-free survival. , 2002, Cancer research.

[14]  Theo Stijnen,et al.  A goodness-of-fit test for Cox's proportional hazards model based on martingale residuals , 1998 .

[15]  H C van Houwelingen,et al.  Testing the fit of a regression model via score tests in random effects models. , 1995, Biometrics.

[16]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[17]  W. Kuo,et al.  Associations between gene expressions in breast cancer and patient survival , 2002, Human Genetics.

[18]  J P Klein,et al.  Testing for centre effects in multi-centre survival studies: a Monte Carlo comparison of fixed and random effects tests. , 1999, Statistics in medicine.