Integrative analysis for identifying joint modular patterns of gene-expression and drug-response data

MOTIVATION The underlying relationship between genomic factors and the response of diverse cancer drugs still remains unclear. A number of studies showed that the heterogeneous responses to anticancer treatments of patients were partly associated with their specific changes in gene expression and somatic alterations. The emerging large-scale pharmacogenomic data provide us valuable opportunities to improve existing therapies or to guide early-phase clinical trials of compounds under development. However, how to identify the underlying combinatorial patterns among pharmacogenomics data are still a challenging issue. RESULTS In this study, we adopted a sparse network-regularized partial least square (SNPLS) method to identify joint modular patterns using large-scale pairwise gene-expression and drug-response data. We incorporated a molecular network to the (sparse) partial least square model to improve the module accuracy via a network-based penalty. We first demonstrated the effectiveness of SNPLS using a set of simulation data and compared it with two typical methods. Further, we applied it to gene expression profiles for 13 321 genes and pharmacological profiles for 98 anticancer drugs across 641 cancer cell lines consisting of diverse types of human cancers. We identified 20 gene-drug co-modules, each of which consists of 30 cell lines, 137 genes and 2 drugs on average. The majority of identified co-modules have significantly functional implications and coordinated gene-drug associations. The modular analysis here provided us new insights into the molecular mechanisms of how drugs act and suggested new drug targets for therapy of certain types of cancers. AVAILABILITY AND IMPLEMENTATION A matlab package of SNPLS is available at http://page.amss.ac.cn/shihua.zhang/ CONTACT : zsh@amss.ac.cn SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  J. Utikal,et al.  Combined BRAF and MEK inhibition versus BRAF inhibition alone in melanoma. , 2014, The New England journal of medicine.

[2]  Stephen J Benkovic,et al.  Interaction of dihydrofolate reductase with methotrexate: Ensemble and single-molecule kinetics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Jin Liu,et al.  Incorporating Network Structure in Integrative Analysis of Cancer Prognosis Data , 2013, Genetic epidemiology.

[4]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[5]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[6]  J. Medina-Franco,et al.  Shifting from the single to the multitarget paradigm in drug discovery. , 2013, Drug discovery today.

[7]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[8]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[9]  Sven Bergmann,et al.  A modular approach for integrative analysis of large-scale gene-expression and drug-response data , 2008, Nature Biotechnology.

[10]  Marko Grobelnik,et al.  Subspace, Latent Structure and Feature Selection, Statistical and Optimization, Perspectives Workshop, SLSFS 2005, Bohinj, Slovenia, February 23-25, 2005, Revised Selected Papers , 2006, SLSFS.

[11]  Philippe Besse,et al.  Statistical Applications in Genetics and Molecular Biology A Sparse PLS for Variable Selection when Integrating Omics Data , 2011 .

[12]  G. V. Paolini,et al.  Global mapping of pharmacological space , 2006, Nature Biotechnology.

[13]  S. Ramaswamy,et al.  Systematic identification of genomic markers of drug sensitivity in cancer cells , 2012, Nature.

[14]  Damien Kee,et al.  Acquired resistance to BRAF inhibitors mediated by a RAF kinase switch in melanoma can be overcome by cotargeting MEK and IGF-1R/PI3K. , 2010, Cancer cell.

[15]  Masato Hareyama,et al.  The combination of olaparib and camptothecin for effective radiosensitization , 2012, Radiation oncology.

[16]  J. Reid,et al.  Fixed dose combination therapy in the treatment of hypertension , 2002, Journal of Human Hypertension.

[17]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[18]  R Berger,et al.  All-trans retinoic acid as a differentiation therapy for acute promyelocytic leukemia. I. Clinical results. , 1990, Blood.

[19]  J. Sebolt-Leopold,et al.  CI-1040 (PD184352), a targeted signal transduction inhibitor of MEK (MAPKK). , 2003, Seminars in oncology.

[20]  Anne-Laure Boulesteix,et al.  Partial least squares: a versatile tool for the analysis of high-dimensional genomic data , 2006, Briefings Bioinform..

[21]  Z A Arlin,et al.  Treatment of acute promyelocytic leukemia. , 1992, Blood.

[22]  Tsviya Olender,et al.  GeneCards Version 3: the human gene integrator , 2010, Database J. Biol. Databases Curation.

[23]  Zhen-yi Wang,et al.  Use of all-trans retinoic acid in the treatment of acute promyelocytic leukemia. , 1988, Haematology and blood transfusion.

[24]  K. Flaherty,et al.  Combined BRAF and MEK inhibition in melanoma with BRAF V600 mutations. , 2012, The New England journal of medicine.

[25]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[26]  Hongzhe Li,et al.  In Response to Comment on "Network-constrained regularization and variable selection for analysis of genomic data" , 2008, Bioinform..

[27]  Juan Liu,et al.  A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules , 2011, Bioinform..

[28]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[29]  R. Shoemaker The NCI60 human tumour cell line anticancer drug screen , 2006, Nature Reviews Cancer.

[30]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[31]  Shi-Hua Zhang,et al.  Detecting disease associated modules and prioritizing active genes based on high throughput data , 2010, BMC Bioinformatics.

[32]  Gary D. Bader,et al.  The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function , 2010, Nucleic Acids Res..

[33]  Do Joong Park,et al.  Combining PARP-1 Inhibition and Radiation in Ewing Sarcoma Results in Lethal DNA Damage , 2013, Molecular Cancer Therapeutics.

[34]  Luo Xiao,et al.  Learning regulatory programs by threshold SVD regression , 2014, Proceedings of the National Academy of Sciences.

[35]  Xing-Ming Zhao,et al.  Prediction of Drug Combinations by Integrating Molecular and Pharmacological Data , 2011, PLoS Comput. Biol..

[36]  J. Drews Drug discovery: a historical perspective. , 2000, Science.

[37]  K. Tsuda,et al.  Mining Significant Substructure Pairs for Interpreting Polypharmacology in Drug-Target Network , 2011, PloS one.

[38]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[39]  Kung-Sik Chan,et al.  Reduced rank stochastic regression with a sparse singular value decomposition , 2012 .

[40]  David A. Cheresh,et al.  Integrins in cancer: biological implications and therapeutic opportunities , 2010, Nature Reviews Cancer.

[41]  Arul M Chinnaiyan,et al.  PARP-1 inhibition as a targeted strategy to treat Ewing's sarcoma. , 2012, Cancer research.

[42]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[43]  Shuxing Zhang,et al.  Polypharmacology: drug discovery for the future , 2013, Expert review of clinical pharmacology.

[44]  Keith T Flaherty,et al.  Targeted molecular therapy in melanoma. , 2010, Seminars in cutaneous medicine and surgery.

[45]  A. Hopkins,et al.  The druggable genome , 2002, Nature Reviews Drug Discovery.

[46]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[47]  Hongzhe Li,et al.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS. , 2010, The annals of applied statistics.

[48]  Péter Csermely,et al.  The efficiency of multi-target drugs: the network approach might help drug design. , 2004, Trends in pharmacological sciences.

[49]  Michael R. Kosorok,et al.  Identification of differential gene pathways with principal component analysis , 2009, Bioinform..

[50]  Luonan Chen,et al.  Discovering functions and revealing mechanisms at molecular level from biological networks , 2007, Proteomics.

[51]  S Hiraki,et al.  [Anticancer drug sensitivity tested by tumor stem cell assay]. , 1984, Gan no rinsho. Japan journal of cancer clinics.

[52]  Jason H. Moore,et al.  Systems genetics for drug target discovery. , 2011, Trends in pharmacological sciences.

[53]  Shi-Hua Zhang,et al.  Identifying multi-layer gene regulatory modules from multi-dimensional genomic data , 2012, Bioinform..

[54]  S. Keleş,et al.  Sparse partial least squares regression for simultaneous dimension reduction and variable selection , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[55]  X-S Zhang,et al.  Predicting cooperative drug effects through the quantitative cellular profiling of response to individual drugs , 2014, CPT: pharmacometrics & systems pharmacology.

[56]  Jin-jian Lu,et al.  Multi-Target Drugs: The Trend of Drug Research and Development , 2012, PloS one.

[57]  A. Hopkins Network pharmacology: the next paradigm in drug discovery. , 2008, Nature chemical biology.

[58]  Hua Guo,et al.  Combating trastuzumab resistance by targeting SRC, a common node downstream of multiple resistance pathways , 2011, Nature Medicine.

[59]  P. Laird,et al.  Discovery of multi-dimensional modules by integrative analysis of cancer genomic data , 2012, Nucleic acids research.

[60]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[61]  M. Sanz,et al.  Treatment of acute promyelocytic leukemia. , 2006, Hematology. American Society of Hematology. Education Program.

[62]  Robert W Veltri,et al.  Nuclear Morphometry, Epigenetic Changes, and Clinical Relevance in Prostate Cancer , 2013, Advances in experimental medicine and biology.

[63]  F. Bushman,et al.  Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis. , 2013, Biostatistics.

[64]  Syed U Bokhari,et al.  Beneficial Effects of a Glyburide/Metformin Combination Preparation in Type 2 Diabetes Mellitus , 2003, The American journal of the medical sciences.