Efficient sleep stage recognition system based on EEG signal using k-means clustering based feature weighting

Sleep scoring is one of the most important methods for diagnosis in psychiatry and neurology. Sleep staging is a time consuming and difficult task conducted by sleep specialists. The purposes of this work are to automatic score the sleep stages and to help to sleep physicians on sleep stage scoring. In this work, a novel data preprocessing method called k-means clustering based feature weighting (KMCFW) has been proposed and combined with k-NN (k-nearest neighbor) and decision tree classifiers to classify the EEG (electroencephalogram) sleep into six sleep stages including awake, N-REM (non-rapid eye movement) stage 1, N-REM stage 2, N-REM stage 3, REM, and non-sleep (movement time). First of all, frequency domain features belonging to sleep EEG signal have been extracted using Welch spectral analysis method and composed 129 features from EEG signal relating each sleep stages. In order to decrease the features, the statistical features comprising minimum value, maximum value, standard deviation, and mean value have been used and then reduced from 129 to 4 features. In the second phase, the sleep stages dataset with four features has been weighted by means of k-means clustering based feature weighting. Finally, the weighted sleep stages have been automatically classified into six sleep stages using k-NN and C4.5 decision tree classifier. In the classification of sleep stages, the k values of 10, 20, 30, 40, 50, and 60 in k-NN classifier have been used and compared with each other. In the experimental results, while sleep stages has been classified with 55.88% success rate using k-NN classifier (for k value of 40), the weighted sleep stages with KMCFW has been recognized with 82.15% success rate k-NN classifier (for k value of 40). And also, we have investigated the relevance between sleep stages and frequency domain features belonging to EEG signal. These results have demonstrated that proposed weighting method have a considerable impact on automatic determining of sleep stages. This system could be used as an online system in the automatic scoring of sleep stages and helps to sleep physicians in the sleep scoring process.

[1]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[2]  Trevor Darrell,et al.  Nearest-Neighbor Methods in Learning and Vision , 2008, IEEE Trans. Neural Networks.

[3]  Kemal Polat,et al.  Medical diagnosis of atherosclerosis from Carotid Artery Doppler Signals using principal component analysis (PCA), k-NN based weighting pre-processing and Artificial Immune Recognition System (AIRS) , 2008, J. Biomed. Informatics.

[4]  Kemal Polat,et al.  Comparison of Different Classifier Algorithms on the Automated Detection of Obstructive Sleep Apnea Syndrome , 2008, Journal of Medical Systems.

[5]  D H Evans Doppler signal analysis. , 2000, Ultrasound in medicine & biology.

[6]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[7]  Suzanne Lesecq,et al.  Feature selection for sleep/wake stages classification using data driven methods , 2007, Biomed. Signal Process. Control..

[8]  Kemal Polat,et al.  Classification of epileptiform EEG using a hybrid system based on decision tree classifier and fast Fourier transform , 2007, Appl. Math. Comput..

[9]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[10]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[11]  Westgate Road,et al.  Comparison of manual sleep staging with automated neural network-based analysis in clinical practice. , 2006 .

[12]  Daniel J Buysse,et al.  Sleep–Related Breathing Disorders in Adults: Recommendations for Syndrome Definition and Measurement Techniques in Clinical Research , 2000 .

[13]  Dimitar Filev,et al.  Generation of Fuzzy Rules by Mountain Clustering , 1994, J. Intell. Fuzzy Syst..

[14]  M. Müller,et al.  Comparison of color-flow Doppler scanning, power Doppler scanning, and frequency shift for assessment of carotid artery stenosis. , 2001, Journal of vascular surgery.

[15]  F. Gianfelici,et al.  Nearest-Neighbor Methods in Learning and Vision (Shakhnarovich, G. et al., Eds.; 2006) [Book review] , 2008 .

[16]  A. Rechtschaffen,et al.  A manual of standardized terminology, technique and scoring system for sleep stages of human subjects , 1968 .

[17]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[18]  B. Sigel,et al.  A brief history of Doppler ultrasound in the diagnosis of peripheral vascular disease. , 1998, Ultrasound in medicine & biology.

[19]  D. Evans Doppler Ultrasound: Physics Instrumentation and Clinical Applications , 1989 .

[20]  P. Estévez,et al.  Polysomnographic pattern recognition for automated classification of sleep-waking states in infants , 2006, Medical and Biological Engineering and Computing.

[21]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[22]  Abdulkadir Sengur,et al.  Comparison of clustering algorithms for analog modulation classification , 2006 .

[23]  Abdulkadir Sengür,et al.  Comparison of clustering algorithms for analog modulation classification , 2006, Expert Syst. Appl..

[24]  Kristína Susmáková,et al.  Discrimination ability of individual measures used in sleep stages classification , 2008, Artif. Intell. Medicine.

[25]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[26]  R S Cobbold,et al.  A comparative study and assessment of Doppler ultrasound spectral estimation techniques. Part II: Methods and results. , 1988, Ultrasound in medicine & biology.