Iliou Machine Learning Data Preprocessing Method for Stress Level Prediction

Data pre-processing is an important step in the data mining process. Data preparation and filtering steps can take considerable amount of processing time. Data pre-processing includes cleaning, normalization, transformation, feature extraction and selection. In this paper, Iliou and PCA data preprocessing methods evaluated in a data set of 103 students, aged 18–25, who were experiencing anxiety problems. The performance of Iliou and PCA data preprocessing methods was evaluated using the 10-fold cross validation method assessing seven classification algorithms, IB1, J48, Random Forest, MLP, SMO, JRip and FURIA, respectively. The classification results indicate that Iliou data preprocessing algorithm consistently and substantially outperforms PCA data preprocessing method, achieving 98.6% against 92.2% classification performance, respectively.

[1]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[2]  A. Bandura Self-efficacy: toward a unifying theory of behavioral change. , 1977, Psychological review.

[3]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[4]  A. Beck,et al.  An inventory for measuring clinical anxiety: psychometric properties. , 1988, Journal of consulting and clinical psychology.

[5]  Theodoros Iliou,et al.  A Novel Machine Learning Data Preprocessing Method for Enhancing Classification Algorithms Performance , 2015, EANN '15.

[6]  Oscar Mayora-Ibarra,et al.  Stress modelling and prediction in presence of scarce data , 2016, J. Biomed. Informatics.

[7]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[8]  P. Kendall,et al.  Self-referent speech and psychopathology: The balance of positive and negative thinking , 1989, Cognitive Therapy and Research.

[9]  Haleh Vafaie,et al.  Feature Selection Methods: Genetic Algorithms vs. Greedy-like Search , 2009 .

[10]  R A Steer,et al.  Use of the Beck Anxiety Inventory with Adolescent Psychiatric Outpatients , 1995, Psychological reports.

[11]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[12]  Purnendu Shekhar Pandey,et al.  Machine Learning and IoT for prediction and detection of stress , 2017, 2017 17th International Conference on Computational Science and Its Applications (ICCSA).

[13]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[14]  Nigar G. Khawaja,et al.  GENDER DIFFERENCES IN ANXIETY: AN INVESTIGATION OF THE SYMPTOMS, COGNITIONS, AND SENSITIVITY TOWARDS ANXIETY IN A NONCLINICAL POPULATION , 2002, Behavioural and Cognitive Psychotherapy.

[15]  I. Blackburn,et al.  Cognitive Therapy for Depression and Anxiety , 1990 .

[16]  S. Raj,et al.  The Postural Tachycardia Syndrome (POTS): Pathophysiology, Diagnosis & Management , 2006, Indian pacing and electrophysiology journal.

[17]  Ovsanna Leyfer,et al.  Examination of the utility of the Beck Anxiety Inventory and its factors as a screener for anxiety disorders. , 2006, Journal of anxiety disorders.

[18]  G. Dunteman Principal Components Analysis , 1989 .

[19]  A. Statnikov,et al.  Quantitative forecasting of PTSD from early trauma responses: a Machine Learning application. , 2014, Journal of psychiatric research.

[20]  Philip C. Kendall,et al.  Anxious self-talk: Development of the Anxious Self-Statements Questionnaire (ASSQ) , 1989, Cognitive Therapy and Research.

[21]  Philip C. Kendall,et al.  The future for cognitive assessment of anxiety: Let's get specific. , 1987 .

[22]  Oscar Mayora-Ibarra,et al.  Stress Modelling Using Transfer Learning in Presence of Scarce Data , 2015, AmIHEALTH.

[23]  Mukund Balasubramanian,et al.  The Isomap Algorithm and Topological Stability , 2002, Science.

[24]  Gavin Andrews,et al.  Panic and generalized anxiety disorders , 1993 .

[25]  R. Bell,et al.  The Beck Anxiety Inventory in a non-clinical sample. , 1995, Behaviour research and therapy.

[26]  Ioannis M. Stephanakis,et al.  A novel data preprocessing method for boosting neural network performance: A case study in osteoporosis prediction , 2017, Inf. Sci..

[27]  Thompson E. Davis,et al.  Anxiety Disorders and Phobias , 2010 .