Exploratory failure time analysis in large scale genomics

In large scale genomic analyses dealing with detecting genotype-phenotype associations, such as genome wide association studies (GWAS), it is desirable to have numerically and statistically robust procedures to test the stochastic independence null hypothesis against certain alternatives. Motivated by a special case in a GWAS, a novel test procedure called correlation profile test (CPT) is developed for testing genomic associations with failure-time phenotypes subject to right censoring and competing risks. Performance and operating characteristics of CPT are investigated and compared to existing approaches, by a simulation study and on a real dataset. Compared to popular choices of semiparametric and nonparametric methods, CPT has three advantages: it is numerically more robust because it solely relies on sample moments; it is more robust against the violation of the proportional hazards condition; and it is more flexible in handling various failure and censoring scenarios. CPT is a general approach to testing the null hypothesis of stochastic independence between a failure event point process and any random variable; thus it is widely applicable beyond genomic studies.

[1]  D Scharfstein,et al.  Methods for Conducting Sensitivity Analysis of Trials with Potentially Nonignorable Competing Causes of Censoring , 2001, Biometrics.

[2]  Robert Gray,et al.  A Proportional Hazards Model for the Subdistribution of a Competing Risk , 1999 .

[3]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[4]  P. Major,et al.  Strong Embedding of the Estimator of the Distribution Function under Random Censorship , 1988 .

[5]  Roderick J. A. Little,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models: Comment , 1999 .

[6]  Daryl J. Daley,et al.  An Introduction to the Theory of Point Processes , 2013 .

[7]  Zhiliang Ying,et al.  Linear regression analysis of censored survival data based on rank tests , 1990 .

[8]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[9]  I. James,et al.  Linear regression with censored data , 1979 .

[10]  Charles J. Geyer,et al.  Computational Methods for Semiparametric Linear Regression with Censored Data , 1992 .

[11]  Yuan Ji,et al.  Applications of beta-mixture models in bioinformatics , 2005, Bioinform..

[12]  E. Laska,et al.  Nonparametric estimation and testing in a cure model. , 1992, Biometrics.

[13]  D. Harrington,et al.  Counting Processes and Survival Analysis , 1991 .

[14]  Cheng Cheng,et al.  Statistical Significance Threshold Criteria For Analysis of Microarray Gene Expression Data , 2004, Statistical applications in genetics and molecular biology.

[15]  Lajos Horváth,et al.  Strong approximations of the quantile process of the product-limit estimator , 1985 .

[16]  R. Gray A Class of $K$-Sample Tests for Comparing the Cumulative Incidence of a Competing Risk , 1988 .

[17]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[18]  Z. Ying,et al.  Accelerated failure time models for counting processes , 1998 .