Challenge-Enabled Machine Learning to Drug-Response Prediction

In recent decades, the advancement of computational algorithms and the availability of big data have enabled artificial intelligence (AI) to dramatically improve predictive performance in nearly all research areas. Specifically, machine learning (ML) techniques, a major branch of AI, have been widely used in many tasks of drug discovery and development, including predicting treatment effects, identifying target genes and functional pathways, as well as selecting potential biomarkers. However, in practice, blindly applying ML methods may lead to common pitfalls, including overfitting and lack of generalizability. Therefore, how to improve the robustness and prediction accuracy of ML methods has become a crucial problem for researchers. In this review, we summarize the application of ML models to drug discovery by introducing the top-performing methods developed from large-scale drug-related data challenges in recent years.

[1]  黃崇冀,et al.  Machine learning : an artificial intelligence approach , 1988 .

[2]  Tapio Pahikkala,et al.  Toward more realistic drug^target interaction predictions , 2014 .

[3]  S. Rees,et al.  Principles of early drug discovery , 2011, British journal of pharmacology.

[4]  Tudor I. Oprea,et al.  A comprehensive map of molecular drug targets , 2016, Nature Reviews Drug Discovery.

[5]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[6]  Yuanfang Guan,et al.  Machine Learning for Cancer Drug Combination , 2020, Clinical pharmacology and therapeutics.

[7]  Bin Li,et al.  Applications of machine learning in drug discovery and development , 2019, Nature Reviews Drug Discovery.

[8]  Amir K. Foroushani,et al.  Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen , 2019, Nature Communications.

[9]  Yuanfang Guan,et al.  TAIJI: approaching experimental replicates-level accuracy for drug synergy prediction , 2018, Bioinform..

[10]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[11]  Sean Ekins The Next Era: Deep Learning in Pharmaceutical Research , 2016, Pharmaceutical Research.

[12]  Artem Cherkasov,et al.  SimBoost: a read-across approach for predicting drug–target binding affinities using gradient boosting machines , 2017, Journal of Cheminformatics.

[13]  Richard C. Mohs,et al.  Drug discovery and development: Role of basic biological research , 2017, Alzheimer's & dementia.

[14]  Thomas Blaschke,et al.  The rise of deep learning in drug discovery. , 2018, Drug discovery today.

[15]  Drews Drug discovery today - and tomorrow. , 2000, Drug discovery today.

[16]  Laura M. Heiser,et al.  A community effort to assess and improve drug sensitivity prediction algorithms , 2014, Nature Biotechnology.

[17]  Nci Dream Community A community effort to assess and improve drug sensitivity prediction algorithms , 2014 .

[18]  F. Harrell,et al.  Prognostic/Clinical Prediction Models: Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors , 2005 .

[19]  Xuefeng Wang,et al.  Multiple-kernel learning for genomic data mining and prediction , 2018 .

[20]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[21]  Nello Cristianini,et al.  Cross‐Validation (K‐Fold Cross‐Validation, Leave‐One‐Out, Jackknife, Bootstrap) , 2014 .

[22]  Ethem Alpaydin,et al.  Machine Learning: The New AI , 2016 .

[23]  Yang Xie,et al.  A community computational challenge to predict the activity of pairs of compounds Citation , 2015 .

[24]  John P. Overington,et al.  Comprehensive characterization of the Published Kinase Inhibitor Set , 2016, Nature Biotechnology.

[25]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[26]  Yuanfang Guan,et al.  Network Propagation Predicts Drug Synergy in Cancers. , 2018, Cancer research.

[27]  Daniel B. Mark,et al.  TUTORIAL IN BIOSTATISTICS MULTIVARIABLE PROGNOSTIC MODELS: ISSUES IN DEVELOPING MODELS, EVALUATING ASSUMPTIONS AND ADEQUACY, AND MEASURING AND REDUCING ERRORS , 1996 .

[28]  K. Borgwardt,et al.  Machine Learning in Medicine , 2015, Mach. Learn. under Resour. Constraints Vol. 3.

[29]  Arzucan Özgür,et al.  DeepDTA: deep drug–target binding affinity prediction , 2018, Bioinform..