A preclustering-based ensemble learning technique for acute appendicitis diagnoses

OBJECTIVE Acute appendicitis is a common medical condition, whose effective, timely diagnosis can be difficult. A missed diagnosis not only puts the patient in danger but also requires additional resources for corrective treatments. An acute appendicitis diagnosis constitutes a classification problem, for which a further fundamental challenge pertains to the skewed outcome class distribution of instances in the training sample. A preclustering-based ensemble learning (PEL) technique aims to address the associated imbalanced sample learning problems and thereby support the timely, accurate diagnosis of acute appendicitis. MATERIALS AND METHODS The proposed PEL technique employs undersampling to reduce the number of majority-class instances in a training sample, uses preclustering to group similar majority-class instances into multiple groups, and selects from each group representative instances to create more balanced samples. The PEL technique thereby reduces potential information loss from random undersampling. It also takes advantage of ensemble learning to improve performance. We empirically evaluate this proposed technique with 574 clinical cases obtained from a comprehensive tertiary hospital in southern Taiwan, using several prevalent techniques and a salient scoring system as benchmarks. RESULTS The comparative results show that PEL is more effective and less biased than any benchmarks. The proposed PEL technique seems more sensitive to identifying positive acute appendicitis than the commonly used Alvarado scoring system and exhibits higher specificity in identifying negative acute appendicitis. In addition, the sensitivity and specificity values of PEL appear higher than those of the investigated benchmarks that follow the resampling approach. Our analysis suggests PEL benefits from the more representative majority-class instances in the training sample. According to our overall evaluation results, PEL records the best overall performance, and its area under the curve measure reaches 0.619. CONCLUSION The PEL technique is capable of addressing imbalanced sample learning associated with acute appendicitis diagnosis. Our evaluation results suggest PEL is less biased toward a positive or negative class than the investigated benchmark techniques. In addition, our results indicate the overall effectiveness of the proposed technique, compared with prevalent scoring systems or salient classification techniques that follow the resampling approach.

[1]  Lawrence O Gostin,et al.  Health care reform--a historic moment in US social policy. , 2010, JAMA.

[2]  Y Abdeldaim,et al.  The Alvarado score as a tool for diagnosis of acute appendicitis. , 2006, Irish medical journal.

[3]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[4]  Zhihua Chen,et al.  Using Prior Knowledge and Rule Induction Methods to Discover Molecular Markers of Prognosis in Lung Cancer , 2005, AMIA.

[5]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[6]  Juan Liu,et al.  Mixture classification model based on clinical markers for breast cancer prognosis , 2010, Artif. Intell. Medicine.

[7]  Yang Liu,et al.  Combining integrated sampling with SVM ensembles for learning from imbalanced datasets , 2011, Inf. Process. Manag..

[8]  Fei-Shih Yang,et al.  Hyperdense appendix on unenhanced CT: a sign of acute appendicitis , 2007, Abdominal Imaging.

[9]  Salvatore J. Stolfo,et al.  Distributed data mining in credit card fraud detection , 1999, IEEE Intell. Syst..

[10]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[11]  Zhi-Hua Zhou,et al.  Ieee Transactions on Knowledge and Data Engineering 1 Training Cost-sensitive Neural Networks with Methods Addressing the Class Imbalance Problem , 2022 .

[12]  Daniel Sánchez,et al.  Mining association rules with improved semantics in medical databases , 2001, Artif. Intell. Medicine.

[13]  Stan Matwin,et al.  Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.

[14]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[15]  Thanos Athanasiou,et al.  Patient-reported outcome measures: the importance of patient satisfaction in surgery. , 2009, Surgery.

[16]  Serguei V. S. Pakhomov,et al.  Technical Brief: Automatic Classification of Foot Examination Findings Using Clinical Notes and Machine Learning , 2008, J. Am. Medical Informatics Assoc..

[17]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[18]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[19]  Thomas F. O'Donnell,et al.  Principles of surgery, 6th ed , 1995 .

[20]  David R Flum,et al.  The clinical and economic correlates of misdiagnosed appendicitis: nationwide analysis. , 2002, Archives of surgery.

[21]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[22]  Chung-Ho Hsieh,et al.  Novel solutions for an old disease: diagnosis of acute appendicitis with random forest, support vector machines, and artificial neural networks. , 2011, Surgery.

[23]  Hardin Dm,et al.  Acute appendicitis: review and update. , 1999 .

[24]  Cem Ergün,et al.  Clustering Based Under-Sampling for Improving Speaker Verification Decisions Using AdaBoost , 2004, SSPR/SPR.

[25]  N. Tzanakis,et al.  A New Approach to Accurate Diagnosis of Acute Appendicitis , 2005, World Journal of Surgery.

[26]  Kouhei Akazawa,et al.  Comparison of the Levels of Accuracy of an Artificial Neural Network Model and a Logistic Regression Model for the Diagnosis of Acute Appendicitis , 2007, Journal of Medical Systems.

[27]  Zhi-Hua Zhou,et al.  Exploratory Under-Sampling for Class-Imbalance Learning , 2006, Sixth International Conference on Data Mining (ICDM'06).

[28]  D W Rattner,et al.  Introduction of appendiceal CT: impact on negative appendectomy and appendiceal perforation rates. , 1999, Annals of surgery.

[29]  Gisbert Schneider,et al.  Support vector machine applications in bioinformatics. , 2003, Applied bioinformatics.

[30]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[31]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[32]  Aytekin Oto,et al.  Rapid CT diagnosis of acute appendicitis with IV contrast material , 2006, Emergency Radiology.

[33]  Steven Walczak,et al.  Reducing surgical patient costs through use of an artificial neural network to predict transfusion requirements , 2000, Decis. Support Syst..

[34]  Fiona Filewood Improving diagnosis and treatment for appendicitis. , 2005, Nursing times.

[35]  R. McKay,et al.  The use of the clinical scoring system by Alvarado in the decision to perform computed tomography for acute appendicitis in the ED. , 2007, The American journal of emergency medicine.

[36]  Stephen M Cohn,et al.  A prospective randomized study of clinical assessment versus computed tomography for the diagnosis of acute appendicitis. , 2003, Surgical infections.

[37]  P. Ziprin,et al.  Artificial Neural Networks: Useful Aid in Diagnosing Acute Appendicitis , 2008, World Journal of Surgery.

[38]  N. Japkowicz Learning from Imbalanced Data Sets: A Comparison of Various Strategies * , 2000 .

[39]  Mu-Chen Chen,et al.  Prediction model building and feature selection with support vector machines in breast cancer diagnosis , 2008, Expert Syst. Appl..

[40]  James D Dziura,et al.  Is It Safe to Delay Appendectomy in Adults With Acute Appendicitis? , 2006, Annals of surgery.

[41]  Robert C. Holte,et al.  C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .

[42]  Khayal A Al-Khayal,et al.  Computed tomography and ultrasonography in the diagnosis of equivocal acute appendicitis. A meta-analysis. , 2007, Saudi medical journal.

[43]  Gerald J Berry,et al.  Imaging for suspected appendicitis: negative appendectomy and perforation rates. , 2002, Radiology.

[44]  Neil M Rofsky,et al.  Appendicitis: the impact of computed tomography imaging on negative appendectomy and perforation rates , 1998, American Journal of Gastroenterology.

[45]  A Alvarado,et al.  A practical score for the early diagnosis of acute appendicitis. , 1986, Annals of emergency medicine.

[46]  Kevin C. Desouza,et al.  Data mining in healthcare information systems: case study of a veterans' administration spinal cord injury population , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[47]  Shlomo Argamon,et al.  Committee-Based Sample Selection for Probabilistic Classifiers , 1999, J. Artif. Intell. Res..

[48]  Paul Jen-Hwa Hu,et al.  A Data-Driven Approach to Manage the Length of Stay for Appendectomy Patients , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[49]  Ali Kabir,et al.  Predicting negative appendectomy by using demographic, clinical, and laboratory parameters: a cross-sectional study. , 2008, International journal of surgery.

[50]  J. Hiatt,et al.  Negative appendectomy in pregnant women is associated with a substantial risk of fetal loss. , 2007, Journal of the American College of Surgeons.

[51]  Xiaowei Yang,et al.  Several SVM Ensemble Methods Integrated with Under-Sampling for Imbalanced Data Learning , 2009, ADMA.

[52]  J. Tuynman,et al.  Evaluating routine diagnostic imaging in acute appendicitis. , 2009, International journal of surgery.

[53]  D. Gouma,et al.  Implications of Removing a Normal Appendix , 2003, Digestive Surgery.

[54]  Austin H. Chen,et al.  A novel support vector sampling technique to improve classification accuracy and to identify key genes of leukaemia and prostate cancers , 2011, Expert Syst. Appl..

[55]  F Christian,et al.  A simple scoring system to reduce the negative appendicectomy rate. , 1992, Annals of the Royal College of Surgeons of England.

[56]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[57]  Mark J. Schreiber,et al.  Decision Tree Algorithms Predict the Diagnosis and Outcome of Dengue Fever in the Early Phase of Illness , 2008, PLoS neglected tropical diseases.

[58]  Foster Provost,et al.  The effect of class distribution on classifier learning: an empirical study , 2001 .

[59]  David Heckerman,et al.  Bayesian Networks for Data Mining , 2004, Data Mining and Knowledge Discovery.

[60]  Joshua Zhexue Huang,et al.  Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[61]  M. Liang,et al.  The art and science of diagnosing acute appendicitis. , 2005, Southern medical journal.

[62]  Jacek M. Zurada,et al.  Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance , 2008, Neural Networks.

[63]  Chien-Lung Chan,et al.  Decision Model for Acute Appendicitis Treatment With Decision Tree Technology—A Modification of the Alvarado Scoring System , 2010, Journal of the Chinese Medical Association : JCMA.

[64]  Dirk J Gouma,et al.  Scoring and diagnostic laparoscopy for suspected appendicitis. , 2003, The European journal of surgery = Acta chirurgica.

[65]  B. Birnbaum,et al.  Appendicitis at the millennium. , 2000, Radiology.

[66]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[67]  Teruhiko Terasawa,et al.  Systematic Review: Computed Tomography and Ultrasonography To Detect Acute Appendicitis in Adults and Adolescents , 2004, Annals of Internal Medicine.

[68]  David D. Lewis,et al.  Heterogeneous Uncertainty Sampling for Supervised Learning , 1994, ICML.

[69]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[70]  Yu-Chuan Li,et al.  Predicting hypotensive episodes during spinal anesthesia with the application of artificial neural networks , 2008, Comput. Methods Programs Biomed..

[71]  Taeho Jo,et al.  A Multiple Resampling Method for Learning from Imbalanced Data Sets , 2004, Comput. Intell..

[72]  P. Näsman,et al.  Acute appendicitis: a clinical study of 1018 cases of emergency appendectomy. , 1982, Acta chirurgica Scandinavica.

[73]  L. Enochsson,et al.  Diagnostic decision support in suspected acute appendicitis: validation of a simplified scoring system. , 1997, The European journal of surgery = Acta chirurgica.

[74]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[75]  William F. Punch,et al.  Knowledge discovery in medical and biological datasets using a hybrid Bayes classifier/evolutionary algorithm , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[76]  C. Ohmann,et al.  Diagnostic scores for acute appendicitis. Abdominal Pain Study Group. , 1995, The European journal of surgery = Acta chirurgica.

[77]  Tom Fahey,et al.  The Alvarado score for predicting acute appendicitis: a systematic review , 2011, BMC medicine.

[78]  T. Warren Liao,et al.  Classification of weld flaws with imbalanced class data , 2008, Expert Syst. Appl..

[79]  M Rioux,et al.  Sonographic detection of the normal and abnormal appendix. , 1992, AJR. American journal of roentgenology.

[80]  R B Jeffrey,et al.  CT and sonographic evaluation of acute right lower quadrant abdominal pain. , 1998, AJR. American journal of roentgenology.

[81]  Jerry L. Old,et al.  Imaging for suspected appendicitis. , 2005, American family physician.

[82]  Allard E Dembe,et al.  Management of acute appendicitis: the impact of CT scanning on the bottom line. , 2010, Journal of the American College of Surgeons.