Predicting successes and failures of clinical trials with an ensemble LS-SVR

For a variety of reasons, most drug candidates cannot eventually pass the drug approval process. Thus, developing reliable methods for predicting clinical trial outcomes of drug candidates is crucial in improving the success rate of drug discovery and development. In this study, we propose an ensemble classifier based on weighted least squares support vector regression (LS-SVR) for predicting successes and failures of clinical trials. The efficacy of the proposed ensemble classifier is demonstrated through an experimental study on PrOCTOR dataset, which consists of informative chemical features of the drugs and target-based features. Comparing random forest and other models, the proposed ensemble classifier obtains the highest value for the area under the receiver operator curve (AUC). The results of this study demonstrate that the proposed ensemble classifier can be used to effectively predict drug approvals.

[1]  Evgeny Putin,et al.  Integrated deep learned transcriptomic and structure-based predictor of clinical trials outcomes , 2016, bioRxiv.

[2]  Kaitlyn M. Gayvert,et al.  A Data-Driven Approach to Predicting Successes and Failures of Clinical Trials. , 2016, Cell chemical biology.

[3]  G. Wahba,et al.  Some results on Tchebycheffian spline functions , 1971 .

[4]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[5]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[6]  Changha Hwang,et al.  Multiclass Classification via Least Squares Support Vector Machine Regression , 2008 .

[7]  J. Mercer Functions of positive and negative type, and their connection with the theory of integral equations , 1909 .

[8]  A. Graul,et al.  The year's new drugs and biologics 2018: Part II - News that shaped the industry in 2018. , 2019, Drugs of today.

[9]  B. Moor,et al.  Robust Cross-Validation Score Functions with Application to Weighted Least Squares Support Vector Machine Function Estimation ∗ , 2009 .

[10]  J. Mercer Functions of Positive and Negative Type, and their Connection with the Theory of Integral Equations , 1909 .

[11]  Joarder Kamruzzaman,et al.  z-SVM: An SVM for Improved Classification of Imbalanced Data , 2006, Australian Conference on Artificial Intelligence.

[12]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[13]  Jamie Munro,et al.  Trends in clinical success rates and therapeutic focus , 2019, Nature Reviews Drug Discovery.

[14]  Danilo Alfonso López-Sarmiento,et al.  Least Square Support Vector Machine Classifier vs a Logistic Regression Classifier on the Recognition of Numeric Digits , 2013 .

[15]  R. Harrison,et al.  Phase II and phase III failures: 2013–2015 , 2016, Nature Reviews Drug Discovery.