ILIOU machine learning preprocessing method for depression type prediction

The main objective of this study was to find a data preprocessing method to boost the prediction performance of the machine learning algorithms in datasets of mental patients. Specifically, the machine learning methods must have almost excellent classification results in patients with depression, in order to achieve the sooner the possible the appropriate treatment. In this paper, we establish ILIOU data preprocessing method for Depression type detection. The performance of ILIOU data preprocessing method and principal component analysis preprocessing method was evaluated using the tenfold cross validation method assessing seven machine learning classification algorithms, nearest-neighbour classifier (IB1), C4.5 algorithm implementation (J48), random forest, multilayer perceptron (MLP), support vector machine (SMO), JRIP and fuzzy logic (FURIA), respectively. The classification results are presented and compared analytically. The experimental results reveal that the transformed dataset with new features after ILIOU preprocessing method implementation to the original dataset achieved 100% classification–prediction performance of the classification algorithms. So ILIOU data preprocessing method can be used for significantly boost classification algorithms performance in similar datasets and can be used for depression type prediction.

[1]  Alexander Gammerman,et al.  Machine learning classification with confidence: Application of transductive conformal predictors to MRI-based diagnostic and prognostic markers in depression , 2011, NeuroImage.

[2]  J. D. Haberman,et al.  Evaluation of new imaging procedures for breast cancer: proper process. , 1983, AJR. American journal of roentgenology.

[3]  Waldemar Wójcik,et al.  Assessment of significance of features acquired from thyroid ultrasonograms in Hashimoto's disease , 2012, Biomedical engineering online.

[4]  David E. Rumelhart,et al.  Generalization by Weight-Elimination with Application to Forecasting , 1990, NIPS.

[5]  Ioannis M. Stephanakis,et al.  A novel data preprocessing method for boosting neural network performance: A case study in osteoporosis prediction , 2017, Inf. Sci..

[6]  Haleh Vafaie,et al.  Feature Selection Methods: Genetic Algorithms vs. Greedy-like Search , 2009 .

[7]  Ewout W Steyerberg,et al.  Internal and external validation of predictive models: a simulation study of bias and precision in small samples. , 2003, Journal of clinical epidemiology.

[8]  Marcin Michalak,et al.  Support Vector Machines in Biomedical and Biometrical Applications , 2013 .

[9]  J. Reilly,et al.  Using pre-treatment EEG data to predict response to SSRI treatment for MDD , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[10]  Filip Smit,et al.  Preventing the onset of depressive disorders: a meta-analytic review of psychological interventions. , 2008, The American journal of psychiatry.

[11]  S. Hollon,et al.  Cognitive and cognitive-behavioral therapies. , 1994 .

[12]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[13]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[14]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[15]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[16]  Theodoros Iliou,et al.  A Novel Machine Learning Data Preprocessing Method for Enhancing Classification Algorithms Performance , 2015, EANN '15.

[17]  J. Concato,et al.  A simulation study of the number of events per variable in logistic regression analysis. , 1996, Journal of clinical epidemiology.

[18]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[19]  Mukund Balasubramanian,et al.  The Isomap Algorithm and Topological Stability , 2002, Science.

[20]  H. Aizenstein,et al.  Studying depression using imaging and machine learning methods , 2015, NeuroImage: Clinical.

[21]  Raja Srinivasa Reddy Boddu,et al.  Waikato Environment for Knowledge Analysis , 2019 .

[22]  Dmitrij Frishman,et al.  Pitfalls of supervised feature selection , 2009, Bioinform..

[23]  Janet B W Williams,et al.  Diagnostic and Statistical Manual of Mental Disorders , 2013 .

[24]  Massimiliano Grassi,et al.  Artificial Neural Network Model for the Prediction of Obsessive-Compulsive Disorder Treatment Response , 2009, Journal of clinical psychopharmacology.

[25]  G. Dunteman Principal Components Analysis , 1989 .

[26]  D. Segal Diagnostic and Statistical Manual of Mental Disorders (DSM-IV-TR) , 2010 .