Ranking Breast Cancer Drugs and Biomarkers Identification Using Machine Learning and Pharmacogenomics.

Breast cancer is one of the major causes of death in women worldwide. It is a diverse illness with substantial intersubject heterogeneity, even among individuals with the same type of tumor, and customized therapy has become increasingly important in this sector. Because of the clinical and physical variability of different kinds of breast cancers, multiple staging and classification systems have been developed. As a result, these tumors exhibit a wide range of gene expression and prognostic indicators. To date, no comprehensive investigation of model training procedures on information from numerous cell line screenings has been conducted together with radiation data. We used human breast cancer cell lines and drug sensitivity information from Cancer Cell Line Encyclopedia (CCLE) and Genomics of Drug Sensitivity in Cancer (GDSC) databases to scan for potential drugs using cell line data. The results are further validated through three machine learning approaches: Elastic Net, LASSO, and Ridge. Next, we selected top-ranked biomarkers based on their role in breast cancer and tested them further for their resistance to radiation using the data from the Cleveland database. We have identified six drugs named Palbociclib, Panobinostat, PD-0325901, PLX4720, Selumetinib, and Tanespimycin that significantly perform on breast cancer cell lines. Also, five biomarkers named TNFSF15, DCAF6, KDM6A, PHETA2, and IFNGR1 are sensitive to all six shortlisted drugs and show sensitivity to the radiations. The proposed biomarkers and drug sensitivity analysis are helpful in translational cancer studies and provide valuable insights for clinical trial design.

[1]  C. Huber,et al.  Glycosphingolipid-Glycan Signatures of Acute Myeloid Leukemia Cell Lines Reflect Hematopoietic Differentiation , 2022, Journal of proteome research.

[2]  B. Haibe-Kains,et al.  Evaluation of statistical approaches for association testing in noisy drug screening data , 2021, BMC Bioinformatics.

[3]  M. Gasparini,et al.  Ridge regression and its applications in genetic studies , 2021, PloS one.

[4]  Cheng-Dong Li,et al.  Bringing Structural Implications and Deep Learning-Based Drug Identification for KRAS Mutants , 2021, J. Chem. Inf. Model..

[5]  T. Maurer,et al.  The advantages of describing covalent inhibitor in vitro potencies by IC50 at a fixed time point. IC50 determination of covalent inhibitors provides meaningful data to medicinal chemistry for SAR optimization. , 2020, Bioorganic & medicinal chemistry.

[6]  J. Chiang,et al.  Comprehensive assessments of germline deletion structural variants reveal the association between prognostic MUC4 and CEP72 deletions and immune response gene expression in colorectal cancer patients , 2020, Human genomics.

[7]  Emily M. Coonrod,et al.  Pan-cancer proteogenomic analysis reveals long and circular noncoding RNAs encoding peptides , 2020, NAR cancer.

[8]  Afees A. Salisu,et al.  The COVID-19 global fear index and the predictability of commodity price returns , 2020, Journal of Behavioral and Experimental Finance.

[9]  Dan Zhou,et al.  Circ_DCAF6 potentiates cell stemness and growth in breast cancer through GLI1-Hedgehog pathway. , 2020, Experimental and molecular pathology.

[10]  M. Dolan,et al.  Clinical and genetic risk factors for radiation‐associated ototoxicity: A report from the Childhood Cancer Survivor Study and the St. Jude Lifetime Cohort , 2020, Cancer.

[11]  Rajiv Gandhi Govindaraj,et al.  Extremely-randomized-tree-based Prediction of N6-Methyladenosine Sites in Saccharomyces cerevisiae , 2020, Current genomics.

[12]  Florian Heiss,et al.  Nonparametric estimation of the random coefficients model: An elastic net approach , 2019, Journal of Econometrics.

[13]  T. Aittokallio,et al.  Machine learning and feature selection for drug response prediction in precision oncology applications , 2018, Biophysical Reviews.

[14]  Luis Tobalina,et al.  How to find the right drug for each patient? Advances and challenges in pharmacogenomics , 2018, Current opinion in systems biology.

[15]  D. Alavantic,et al.  The HACD4 haplotype as a risk factor for atherosclerosis in males. , 2018, Gene.

[16]  Xiaofeng Dai,et al.  Breast Cancer Cell Line Classification and Its Relevance with Breast Tumor Subtyping , 2017, Journal of Cancer.

[17]  Mohamed F. Ghalwash,et al.  Minimum redundancy maximum relevance feature selection approach for temporal gene expression data , 2017, BMC Bioinformatics.

[18]  Nancy J. Cox,et al.  Consistency in large pharmacogenomic studies , 2016, Nature.

[19]  Marc R. Birtwistle,et al.  Drug response consistency in CCLE and CGP , 2016, Nature.

[20]  Krister Wennerberg,et al.  Consistency in drug response profiling , 2016, Nature.

[21]  Lodewyk F. A. Wessels,et al.  TANDEM: a two-stage approach to maximize interpretability of drug response models based on multiple molecular data types , 2016, Bioinform..

[22]  Emanuel J. V. Gonçalves,et al.  A Landscape of Pharmacogenomic Interactions in Cancer , 2016, Cell.

[23]  David Robinson,et al.  tidytext: Text Mining and Analysis Using Tidy Data Principles in R , 2016, J. Open Source Softw..

[24]  Tero Aittokallio,et al.  Drug response prediction by inferring pathway-response associations with kernelized Bayesian matrix factorization , 2016, Bioinform..

[25]  Scott E. Martin,et al.  Reproducible pharmacogenomic profiling of cancer cell line panels , 2016, Nature.

[26]  Mechthild Krause,et al.  Radiation oncology in the era of precision medicine , 2016, Nature Reviews Cancer.

[27]  Andrew H. Beck,et al.  PharmacoGx: an R package for analysis of large pharmacogenomic datasets , 2015, Bioinform..

[28]  Michael P. Morrissey,et al.  Pharmacogenomic agreement between two cancer cell line data sets , 2015, Nature.

[29]  Andrew H. Beck,et al.  Revisiting inconsistency in large pharmacogenomic studies , 2015, bioRxiv.

[30]  Chun Xing Li,et al.  Anticancer drug sensitivity prediction in cell lines from baseline gene expression through recursive feature selection , 2015, BMC Cancer.

[31]  Steffen Falgreen,et al.  Predicting response to multidrug regimens in cancer patients using cell line experiments and regularised regression models , 2015, BMC Cancer.

[32]  Xiaoqi Zheng,et al.  DISIS: Prediction of Drug Response through an Iterative Sure Independence Screening , 2015, PloS one.

[33]  Krister Wennerberg,et al.  Integrative and Personalized QSAR Analysis in Cancer by Kernelized Bayesian Matrix Factorization , 2014, J. Chem. Inf. Model..

[34]  N. Cox,et al.  Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines , 2014, Genome Biology.

[35]  Fang Fang,et al.  The H3K27me3 demethylase UTX is a gender-specific tumor suppressor in T-cell acute lymphoblastic leukemia. , 2014, Blood.

[36]  Benjamin Haibe-Kains,et al.  Inconsistency in large pharmacogenomic studies , 2013, Nature.

[37]  Justin Guinney,et al.  Systematic Assessment of Analytical Methods for Drug Sensitivity Prediction from Cancer Cell Line Data , 2013, Pacific Symposium on Biocomputing.

[38]  Benjamin Haibe-Kains,et al.  Research and applications: Comparison and validation of genomic predictors for anticancer drug sensitivity , 2013, J. Am. Medical Informatics Assoc..

[39]  Julio Saez-Rodriguez,et al.  Machine Learning Prediction of Cancer Cell Sensitivity to Drugs Based on Genomic and Chemical Properties , 2012, PloS one.

[40]  Sridhar Ramaswamy,et al.  Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells , 2012, Nucleic Acids Res..

[41]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[42]  X. Gu,et al.  Down-modulation of TNFSF15 in ovarian cancer by VEGF and MCP-1 is a pre-requisite for tumor neovascularization , 2012, Angiogenesis.

[43]  B. Shen,et al.  Growth Factors , Cytokines , and Cell Cycle Molecules Modulation of IFN-Receptor 1 Expression by AP-2 Influences IFN-Sensitivity of Cancer Cells , 2012 .

[44]  H. Gómez,et al.  Chemotherapy resistance in metastatic breast cancer: the evolving role of ixabepilone , 2010, Breast Cancer Research.

[45]  R. Chen,et al.  Comprehensive expression-based isoform biomarkers predictive of drug responses based on isoform co-expression networks and clinical data. , 2019, Genomics.