Reliable prediction of anti-diabetic drug failure using a reject option

The medical care for patients with type 2 diabetes generally involves ingestion of oral hypoglycemic agents in order to lower their glucose level. When predicting the result of the medication using a classification approach, high prediction accuracy of the classifier is essential because of high misclassification costs. The application of a reject option to this approach supports more accurate prediction, allowing for human experts to examine when the classifier is unreliable to predict. In this paper, we propose a reject option framework based on heterogeneous ensemble learning through a two-phase fusion. The first phase is to calculate confidence scores, which are used to determine whether to predict, and the second phase is to derive final prediction results by fusing the outputs from multiple heterogeneous classifiers. We confirm the effectiveness of the proposed method to the anti-diabetic drug failure prediction problem through experiments on actual electronic medical records data of type 2 diabetes. The proposed method yields a better trade-off between accuracy and rejection than other reject options with statistical significance. A lower prediction error is obtained for the same degree of rejection. We obtained desirable accuracy for the anti-diabetic drug failure problem by applying the proposed reject option, which allows using the classification approach in practice. The accurate prediction of drug failure at the moment of prescription can assist clinical decisions for patients. In addition, in-depth analysis can be considered for those prescriptions that are predicted as failure or rejected.

[1]  Sungzoon Cho,et al.  A novel multi-class classification algorithm based on one-class support vector machine , 2015, Intell. Data Anal..

[2]  J. Shaw,et al.  A1C for Screening and Diagnosis of Type 2 Diabetes in Routine Clinical Practice , 2010, Diabetes Care.

[3]  Robert P. W. Duin,et al.  The interaction between classification and reject performance for distance-based reject-option classifiers , 2006, Pattern Recognit. Lett..

[4]  Alessandro Verri,et al.  Pattern Recognition with Support Vector Machines , 2002, Lecture Notes in Computer Science.

[5]  Josef Kittler,et al.  Multiple Classifier Systems , 2004, Lecture Notes in Computer Science.

[6]  Sung-Bae Cho,et al.  Multiple network fusion using fuzzy logic , 1995, IEEE Trans. Neural Networks.

[7]  Radu Herbei,et al.  Classification with reject option , 2006 .

[8]  S. Dharmage,et al.  HbA1c as a screening tool for detection of Type 2 diabetes: a systematic review , 2007 .

[9]  Illhoi Yoo,et al.  Data-Mining Technologies for Diabetes: A Systematic Review , 2011, Journal of diabetes science and technology.

[10]  José Augusto Baranauskas,et al.  How Many Trees in a Random Forest? , 2012, MLDM.

[11]  S. Sathiya Keerthi,et al.  Which Is the Best Multiclass SVM Method? An Empirical Study , 2005, Multiple Classifier Systems.

[12]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[13]  Fabio Roli,et al.  Support Vector Machines with Embedded Reject Option , 2002, SVM.

[14]  Senlin Luo,et al.  Rule Extraction From Support Vector Machines Using Ensemble Learning Approach: An Application for Diagnosis of Diabetes , 2015, IEEE Journal of Biomedical and Health Informatics.

[15]  Galina L. Rogova,et al.  Combining the results of several neural network classifiers , 1994, Neural Networks.

[16]  Edwin Lughofer,et al.  Reliable All-Pairs Evolving Fuzzy Classifiers , 2013, IEEE Transactions on Fuzzy Systems.

[17]  Joshua C. Denny,et al.  Type 2 Diabetes Risk Forecasting from EMR Data using Machine Learning , 2012, AMIA.

[18]  Hubert Cecotti,et al.  Rejection Schemes in Multi-class Classification -- Application to Handwritten Character Recognition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[19]  Sungzoon Cho,et al.  Optimal construction of one-against-one classifier based on meta-learning , 2015, Neurocomputing.

[20]  Edwin Lughofer,et al.  Towards incremental classifier fusion , 2010, Intell. Data Anal..

[21]  Fevzullah Temurtas,et al.  A comparative study on diabetes disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[22]  Bernard Zenko,et al.  Is Combining Classifiers with Stacking Better than Selecting the Best One? , 2004, Machine Learning.

[23]  Sungzoon Cho,et al.  An efficient and effective ensemble of support vector machines for anti-diabetic drug failure prediction , 2015, Expert Syst. Appl..

[24]  Francesco Tortorella,et al.  A ROC-based reject rule for dichotomizers , 2005, Pattern Recognit. Lett..

[25]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[26]  Andrew P. Bradley,et al.  Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus , 2010, IEEE Transactions on Information Technology in Biomedicine.

[27]  Claudio Marrocco,et al.  Design of reject rules for ECOC classification systems , 2012, Pattern Recognit..

[28]  Francisco Herrera,et al.  An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes , 2011, Pattern Recognit..

[29]  Robert P. W. Duin,et al.  Growing a multi-class classifier with a reject option , 2008, Pattern Recognit. Lett..

[30]  Sungzoon Cho,et al.  Multi-class classification via heterogeneous ensemble of one-class classifiers , 2015, Eng. Appl. Artif. Intell..

[31]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[32]  F. Wolf Standards of Medical Care in Diabetes—2014 , 2013, Diabetes Care.

[33]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[34]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[35]  H. Koh,et al.  Data mining applications in healthcare. , 2005, Journal of healthcare information management : JHIM.

[36]  Blaise Hanczar,et al.  Accuracy-Rejection Curves (ARCs) for Comparing Classification Methods with a Reject Option , 2009, MLSB.

[37]  Edwin Lughofer,et al.  Integrating new classes on the fly in evolving fuzzy classifier designs and their application in visual inspection , 2015, Appl. Soft Comput..

[38]  Norman D. Black,et al.  Feature selection and classification model construction on type 2 diabetic patients' data , 2007, Artif. Intell. Medicine.

[39]  C. K. Chow,et al.  On optimum recognition error and reject tradeoff , 1970, IEEE Trans. Inf. Theory.

[40]  Yves Grandvalet,et al.  Support Vector Machines with a Reject Option , 2008, NIPS.

[41]  Harleen Kaur,et al.  Empirical Study on Applications of Data Mining Techniques in Healthcare , 2006 .

[42]  Kush R. Varshney,et al.  A risk bound for ensemble classification with a reject option , 2011, 2011 IEEE Statistical Signal Processing Workshop (SSP).

[43]  Ludmila I. Kuncheva,et al.  Using measures of similarity and inclusion for multiple classifier fusion by decision templates , 2001, Fuzzy Sets Syst..

[44]  Sungzoon Cho,et al.  Constructing a multi-class classifier using one-against-one approach with different binary classifiers , 2015, Neurocomputing.