Automatic Human Sleep Stage Scoring Using Deep Neural Networks

The classification of sleep stages is the first and an important step in the quantitative analysis of polysomnographic recordings. Sleep stage scoring relies heavily on visual pattern recognition by a human expert and is time consuming and subjective. Thus, there is a need for automatic classification. In this work we developed machine learning algorithms for sleep classification: random forest (RF) classification based on features and artificial neural networks (ANNs) working both with features and raw data. We tested our methods in healthy subjects and in patients. Most algorithms yielded good results comparable to human interrater agreement. Our study revealed that deep neural networks (DNNs) working with raw data performed better than feature-based methods. We also demonstrated that taking the local temporal structure of sleep into account a priori is important. Our results demonstrate the utility of neural network architectures for the classification of sleep.

[1]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[2]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[3]  W. A. Clark,et al.  Simulation of self-organizing systems by digital computer , 1954, Trans. IRE Prof. Group Inf. Theory.

[4]  John H. Holland,et al.  Tests on a cell assembly theory of the action of the brain, using a large digital computer , 1956, IRE Trans. Inf. Theory.

[5]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[6]  M. Jouvet,et al.  [Comparative electroencephalographic analysis of physiological sleep in the cat and in man]. , 1960, Revue neurologique.

[7]  R. L. Stratonovich CONDITIONAL MARKOV PROCESSES , 1960 .

[8]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[9]  J. Morgan,et al.  Problems in the Analysis of Survey Data, and a Proposal , 1963 .

[10]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[11]  Alekseĭ Grigorʹevich Ivakhnenko,et al.  Cybernetics and forecasting techniques , 1967 .

[12]  P. Welch The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .

[13]  T M Itil,et al.  Digital computer classifications of EEG sleep stages. , 1969, Electroencephalography and clinical neurophysiology.

[14]  E. Wolpert A Manual of Standardized Terminology, Techniques and Scoring System for Sleep Stages of Human Subjects. , 1969 .

[15]  D. O. Walter,et al.  On automatic methods of sleep staging by EEG spectra. , 1970, Electroencephalography and clinical neurophysiology.

[16]  J R Smith,et al.  EEG sleep stage scoring by an automatic hybrid system. , 1971, Electroencephalography and clinical neurophysiology.

[17]  R. D. Joseph,et al.  Pattern recognition of EEG-EOG as a technique for all-night sleep stage scoring. , 1972, Electroencephalography and clinical neurophysiology.

[18]  J M Gaillard,et al.  Principles of automatic analysis of sleep records with a hybrid system. , 1973, Computers and biomedical research, an international journal.

[19]  Kunihiko Fukushima,et al.  Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Visual Pattern Recognition , 1982 .

[20]  M. Kerkhofs,et al.  Automated sleep scoring: a comparative reliability study of two algorithms. , 1987, Electroencephalography and clinical neurophysiology.

[21]  Antoine Rémond,et al.  Methods of Analysis of Brain Electrical and Magnetic Signals , 1987 .

[22]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[23]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[24]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[26]  J P Macher,et al.  Neural network model: application to automatic analysis of human sleep. , 1993, Computers and biomedical research, an international journal.

[27]  L. Tarassenko,et al.  A new approach to the analysis of the human sleep/wakefulness continuum , 1996, Journal of sleep research.

[28]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[29]  J. Röschke,et al.  Discrimination of sleep stages: a comparison between spectral and nonlinear EEG measures. , 1996, Electroencephalography and clinical neurophysiology.

[30]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[31]  Hae-Jeong Park,et al.  Automated Sleep Stage Scoring Using Hybrid Rule- and Case-Based Reasoning , 2000, Comput. Biomed. Res..

[32]  Jean Gotman,et al.  Computer-assisted sleep staging , 2001, IEEE Trans. Biomed. Eng..

[33]  A. Varri,et al.  The SIESTA project polygraphic and clinical database , 2001, IEEE Engineering in Medicine and Biology Magazine.

[34]  Kazuhiko Fukuda,et al.  Proposed supplements and amendments to ‘A Manual of Standardized Terminology, Techniques and Scoring System for Sleep Stages of Human Subjects’, the Rechtschaffen & Kales (1968) standard , 2001, Psychiatry and clinical neurosciences.

[35]  A Flexer,et al.  Unsupervised continuous sleep analysis. , 2002, Methods and findings in experimental and clinical pharmacology.

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[37]  Richard Stephenson,et al.  Design and validation of a computer-based sleep-scoring algorithm , 2004, Journal of Neuroscience Methods.

[38]  A. Schlögl,et al.  Interrater reliability between scorers from eight European sleep laboratories in subjects with different sleep disorders , 2004, Journal of sleep research.

[39]  P. Cowen,et al.  The effects of paroxetine and nefazodone on sleep: a placebo controlled trial , 1996, Psychopharmacology.

[40]  P. R. Davidson,et al.  Detecting Behavioral Microsleeps using EEG and LSTM Recurrent Neural Networks , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[41]  D. Baldwin,et al.  Antidepressants and their effect on sleep , 2005, Human psychopharmacology.

[42]  A. Schlögl,et al.  An E-Health Solution for Automatic Sleep Classification according to Rechtschaffen and Kales: Validation Study of the Somnolyzer 24 × 7 Utilizing the Siesta Database , 2005, Neuropsychobiology.

[43]  Shie Mannor,et al.  A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..

[44]  A. Chesson,et al.  The AASM Manual for the Scoring of Sleep and Associated Events: Rules, Terminology, and Techinical Specifications , 2007 .

[45]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[46]  Robert W McCarley,et al.  Neurobiology of REM and NREM sleep. , 2007, Sleep medicine.

[47]  S V Selishchev,et al.  Classification of human sleep stages based on EEG processing using hidden Markov models , 2007, Meditsinskaia tekhnika.

[48]  Yann LeCun,et al.  Comparing SVM and convolutional networks for epileptic seizure prediction from intracranial EEG , 2008, 2008 IEEE Workshop on Machine Learning for Signal Processing.

[49]  Hubert Cecotti,et al.  Convolutional Neural Network with embedded Fourier Transform for EEG classification , 2008, 2008 19th International Conference on Pattern Recognition.

[50]  P. Anderer,et al.  Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard , 2009, Journal of sleep research.

[51]  Co-occurrence of Sawtooth Waves and Rapid Eye Movements during REM Sleep , 2009 .

[52]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[53]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[54]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[55]  Shing-Tai Pan,et al.  A transition-constrained discrete hidden Markov model for automatic sleep staging , 2012, Biomedical engineering online.

[56]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[57]  Amy Loutfi,et al.  Sleep Stage Classification Using Unsupervised Feature Learning , 2012, Adv. Artif. Neural Syst..

[58]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[59]  Thomas Penzel,et al.  Inter-scorer reliability between sleep centers can teach us what to improve in the scoring rules. , 2013, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[60]  R. Rosenberg,et al.  The American Academy of Sleep Medicine inter-scorer reliability program: sleep stage scoring. , 2013, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[61]  Genshiro A. Sunagawa,et al.  FASTER: an unsupervised fully automated sleep staging method for mice , 2013, Genes to cells : devoted to molecular & cellular mechanisms.

[62]  Emery N. Brown,et al.  A Review of Multitaper Spectral Analysis , 2014, IEEE Transactions on Biomedical Engineering.

[63]  P. Achermann,et al.  Human sleep and its regulation , 2014 .

[64]  H. Landolt,et al.  Schlafgewohnheiten, Schlafqualität und Schlafmittelkonsum der Schweizer Bevölkerung – Ergebnisse aus einer neuen Umfrage bei einer repräsentativen Stichprobe , 2014 .

[65]  Joachim M. Buhmann,et al.  Convolutional Decision Trees for Feature Learning and Segmentation , 2014, GCPR.

[66]  Gerald Penn,et al.  Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[67]  Cícero Nogueira dos Santos,et al.  Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[68]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[69]  Damien Gervasoni,et al.  Unsupervised online classifier in sleep scoring for sleep deprivation studies. , 2015, Sleep.

[70]  David M. W. Powers,et al.  What the F-measure doesn't measure: Features, Flaws, Fallacies and Fixes , 2015, ArXiv.

[71]  Joachim M. Buhmann,et al.  Transformation-Invariant Convolutional Jungles , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[72]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.

[73]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[74]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[75]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[76]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[77]  P. Hanly,et al.  Staging Sleep in Polysomnograms: Analysis of Inter-Scorer Variability. , 2016, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[78]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[79]  Yike Guo,et al.  Automatic Sleep Stage Scoring with Single-Channel EEG Using Convolutional Neural Networks , 2016, ArXiv.

[80]  F. Vollenweider,et al.  Neuronal oscillations and synchronicity associated with gamma-hydroxybutyrate during resting-state in healthy male volunteers , 2017, Psychopharmacology.

[81]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[82]  Olga Sourina,et al.  Large-Scale Automated Sleep Staging , 2017, Sleep.

[83]  Chao Wu,et al.  DeepSleepNet: A Model for Automatic Sleep Stage Scoring Based on Raw Single-Channel EEG , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[84]  Stanislas Chambon,et al.  A Deep Learning Architecture for Temporal Sleep Stage Classification Using Multivariate and Multimodal Time Series , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[85]  Robert Riener,et al.  The Effect of a Slowly Rocking Bed on Sleep , 2018, Scientific Reports.

[86]  Allan I Pack,et al.  Reliability of the American Academy of Sleep Medicine Rules for Assessing Sleep Depth in Clinical Practice. , 2018, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[87]  Robert Riener,et al.  Automatic artefact detection in single‐channel sleep EEG recordings , 2019, Journal of sleep research.