A transfer learning approach via procrustes analysis and mean shift for cancer drug sensitivity prediction

Transfer learning (TL) algorithms aim to improve the prediction performance in a target task (e.g. the prediction of cisplatin sensitivity in triple-negative breast cancer patients) via transferring knowledge from auxiliary data of a related task (e.g. the prediction of docetaxel sensitivity in breast cancer patients), where the distribution and even the feature space of the data pertaining to the tasks can be different. In real-world applications, we sometimes have a limited training set in a target task while we have auxiliary data from a related task. To obtain a better prediction performance in the target task, supervised learning requires a sufficiently large training set in the target task to perform well in predicting future test examples of the target task. In this paper, we propose a TL approach for cancer drug sensitivity prediction, where our approach combines three techniques. First, we shift the representation of a subset of examples from auxiliary data of a related task to a representation closer to a target training set of a target task. Second, we align the shifted representation of the selected examples of the auxiliary data to the target training set to obtain examples with representation aligned to the target training set. Third, we train machine learning algorithms using both the target training set and the aligned examples. We evaluate the performance of our approach against baseline approaches using the Area Under the receiver operating characteristic (ROC) Curve (AUC) on real clinical trial datasets pertaining to multiple myeloma, nonsmall cell lung cancer, triple-negative breast cancer, and breast cancer. Experimental results show that our approach is better than the baseline approaches in terms of performance and statistical significance.

[1]  Thomas Lengauer,et al.  Managing drug resistance in cancer: lessons from HIV therapy , 2012, Nature Reviews Cancer.

[2]  Nci Dream Community A community effort to assess and improve drug sensitivity prediction algorithms , 2014 .

[3]  T. Poggio,et al.  General conditions for predictivity in learning theory , 2004, Nature.

[4]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[5]  B. Lewis,et al.  Methods of using real-time social media technologies for detection and remote monitoring of HIV outcomes. , 2014, Preventive medicine.

[6]  Paul Workman,et al.  Genome-based cancer therapeutics: targets, kinase drug resistance and future strategies for precision oncology. , 2013, Current opinion in pharmacology.

[7]  A. Jemal,et al.  Cancer statistics, 2016 , 2016, CA: a cancer journal for clinicians.

[8]  Taghi M. Khoshgoftaar,et al.  A survey of transfer learning , 2016, Journal of Big Data.

[9]  Michael W. Mahoney,et al.  rCUR: an R package for CUR matrix decomposition , 2012, BMC Bioinformatics.

[10]  Jiangning Song,et al.  Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure , 2007, Bioinform..

[11]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[12]  Yaron Kinar,et al.  Development and validation of a predictive model for detection of colorectal cancer in primary care by analysis of complete blood counts: a binational retrospective study , 2016, J. Am. Medical Informatics Assoc..

[13]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[14]  Francisco Azuaje,et al.  Computational models for predicting drug responses in cancer research , 2016, Briefings Bioinform..

[15]  Chen Shi,et al.  Inhibition of the cancer-associated TASK 3 channels by magnetically induced thermal release of Tetrandrine from a polymeric drug carrier. , 2016, Journal of controlled release : official journal of the Controlled Release Society.

[16]  Zhi Wei,et al.  Transfer Learning Approaches to Improve Drug Sensitivity Prediction in Multiple Myeloma Patients , 2017, IEEE Access.

[17]  Rebecca L. Siegel Mph,et al.  Cancer statistics, 2016 , 2016 .

[18]  S. Young Behavioral insights on big data: using social media for predicting biomedical outcomes. , 2014, Trends in microbiology.

[19]  Julio Saez-Rodriguez,et al.  Machine Learning Prediction of Cancer Cell Sensitivity to Drugs Based on Genomic and Chemical Properties , 2012, PloS one.

[20]  B. Al-Lazikani,et al.  Personalized Cancer Medicine: Molecular Diagnostics, Predictive biomarkers, and Drug Resistance , 2012, Clinical pharmacology and therapeutics.

[21]  Harikrishna Narasimhan,et al.  Predicting clinical response to anticancer drugs using an ex vivo platform that captures tumour heterogeneity , 2015, Nature Communications.

[22]  Kerstin Amann,et al.  The proteasome inhibitor bortezomib depletes plasma cells and protects mice with lupus-like disease from nephritis , 2008, Nature Medicine.

[23]  Robert Clarke,et al.  Multilevel support vector regression analysis to identify condition-specific regulatory networks , 2010, Bioinform..

[24]  David M. Lin,et al.  Effective similarity measures for expression profiles , 2006, Bioinform..

[25]  Sophie J. Weiss,et al.  Parallel Mapping of Antibiotic Resistance Alleles in Escherichia coli , 2016, PloS one.

[26]  Shoji Kudoh,et al.  Anticancer drug clustering in lung cancer based on gene expression profiles and sensitivity database , 2006, BMC Cancer.

[27]  André Elisseeff,et al.  Stability and Generalization , 2002, J. Mach. Learn. Res..

[28]  Jordi Mestres,et al.  Polypharmacology in Precision Oncology: Current Applications and Future 
Prospects , 2016, Current pharmaceutical design.

[29]  Jean-Philippe Vert,et al.  SIRENE: supervised inference of regulatory networks , 2008, ECCB.

[30]  Richard Sullivan,et al.  The global burden of women’s cancers: a grand challenge in global health , 2017, The Lancet.

[31]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[32]  Joseph Kee-Yin Ng,et al.  Location Estimation via Support Vector Regression , 2007, IEEE Transactions on Mobile Computing.

[33]  Dario Rossi,et al.  Support vector regression for link load prediction , 2008, 2008 4th International Telecommunication Networking Workshop on QoS in Multiservice IP Networks.

[34]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[35]  Anthony Boral,et al.  Gene expression profiling and correlation with outcome in clinical trials of the proteasome inhibitor bortezomib. , 2006, Blood.

[36]  Anindya Bhattacharya,et al.  Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles , 2008, Bioinform..

[37]  Petros Drineas,et al.  CUR matrix decompositions for improved data analysis , 2009, Proceedings of the National Academy of Sciences.

[38]  Markus Ringnér,et al.  What is principal component analysis? , 2008, Nature Biotechnology.

[39]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[40]  P. Johnston,et al.  Cancer drug resistance: an evolving paradigm , 2013, Nature Reviews Cancer.

[41]  M. Maitland,et al.  Predicting Response to Histone Deacetylase Inhibitors Using High-Throughput Genomics. , 2015, Journal of the National Cancer Institute.

[42]  Eyke Hüllermeier,et al.  Clustering of gene expression data using a local shape-based similarity measure , 2005, Bioinform..

[43]  N. Cox,et al.  Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines , 2014, Genome Biology.

[44]  Björn Wallner,et al.  Finding correct protein–protein docking models using ProQDock , 2016, Bioinform..

[45]  Howard A. Fine,et al.  Predicting in vitro drug sensitivity using Random Forests , 2011, Bioinform..

[46]  Andrew E. Jaffe,et al.  Bioinformatics Applications Note Gene Expression the Sva Package for Removing Batch Effects and Other Unwanted Variation in High-throughput Experiments , 2022 .

[47]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.