Integrated Analysis of Pharmacokinetic, Clinical, and SNP Microarray Data Using Projection onto the Most Interesting Statistical Evidence with Adaptive Permutation Testing

Powerful methods for integrated analysis of multiple biological data sets must be developed to maximize researchers’ ability to interpret them and acquire meaningful knowledge. Projection onto the most interesting statistical evidence (PROMISE) is a powerful statistical procedure that has been recently developed to incorporate a biological paradigm for the relationships among endpoint variables into an integrated analysis of microarray gene expression data with multiple biological and clinical endpoints. Here, PROMISE is adapted to perform an integrated analysis of pharmacokinetic, clinical, and genome-wide genotype data that incorporates a clinically relevant biological paradigm for pharmacokinetic and clinical response data. An efficient permutation-testing algorithm is introduced so that statistical calculations are computationally feasible in this higher-dimension setting. The new method is applied to a pediatric leukemia data set. The results clearly indicate that PROMISE can be a very powerful statistical tool for identifying genomic features that exhibit a pattern of association with multiple endpoint variables which is concordant with a practically useful biological paradigm.

[1]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[2]  Cheng Cheng,et al.  Acquired variation outweighs inherited variation in whole genome analysis of methotrexate polyglutamate accumulation in leukemia. , 2009, Blood.

[3]  R. Gelber,et al.  Improved outcome for children with acute lymphoblastic leukemia: results of Dana-Farber Consortium Protocol 91-01. , 2001, Blood.

[4]  Elaine Coustan-Smith,et al.  A simplified flow cytometric assay identifies children with acute lymphoblastic leukemia who have a superior clinical outcome. , 2006, Blood.

[5]  Terence P. Speed,et al.  Genome analysis A genotype calling algorithm for affymetrix SNP arrays , 2005 .

[6]  P. Good,et al.  Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses , 1995 .

[7]  Cheng Cheng,et al.  In Vivo Response to Methotrexate Forecasts Outcome of Acute Lymphoblastic Leukemia and Has a Distinct Gene Expression Profile , 2008, PLoS medicine.

[8]  Dario Campana,et al.  Role of minimal residual disease monitoring in adult and pediatric acute lymphoblastic leukemia. , 2009, Hematology/oncology clinics of North America.

[9]  N. Giri Multivariate Statistical Analysis : Revised And Expanded , 2003 .

[10]  C. Borror Nonparametric Statistical Methods, 2nd, Ed. , 2001 .

[11]  Cheng Cheng,et al.  Treating childhood acute lymphoblastic leukemia without cranial irradiation. , 2009, The New England journal of medicine.

[12]  Zhen Jiang,et al.  Bioconductor Project Bioconductor Project Working Papers Year Paper Extensions to Gene Set Enrichment , 2013 .

[13]  Cheng Cheng,et al.  PROMISE: a tool to identify genomic features with a specific biologically interesting pattern of associations with multiple endpoint variables , 2009, Bioinform..

[14]  Cheng Cheng,et al.  Improved outcome for children with acute lymphoblastic leukemia: results of Total Therapy Study XIIIB at St Jude Children's Research Hospital. , 2004, Blood.

[15]  Douglas A. Wolfe,et al.  Nonparametric Statistical Methods , 1973 .