U-Sleep: resilient to AASM guidelines

AASM guidelines are the results of decades of efforts aiming at standardizing sleep scoring procedure, in order to have a commonly used methodology. The guidelines cover several aspects from the technical/digital specifications, e.g., recommended EEG derivations, to detailed sleep scoring rules accordingly to age. In the context of sleep scoring automation, deep learning has demonstrated better performance compared to many other techniques. Usually, clinical expertise and official guidelines are fundamental to support automated sleep scoring algorithms in solving the task. In this paper we show that a deep learning based sleep scoring algorithm may not need to fully exploit the clinical knowledge or 1 ar X iv :2 20 9. 11 17 3v 2 [ ee ss .S P] 2 3 Se p 20 22 Springer Nature 2021 LTEX template 2 U-Sleep: resilient to AASM guidelines to strictly follow the AASM guidelines. Specifically, we demonstrate that U-Sleep, a state-of-the-art sleep scoring algorithm, can be strong enough to solve the scoring task even using clinically non-recommended or nonconventional derivations, and with no need to exploit information about the chronological age of the subjects. We finally strengthen a well-known finding that using data from multiple data centers always results in a better performing model compared with training on a single cohort. Indeed, we show that this latter statement is still valid even by increasing the size and the heterogeneity of the single data cohort. In all our experiments we used 28528 polysomnography studies from 13 different clinical studies.

[1]  F. Faraci,et al.  DeepSleepNet-Lite: A Simplified Automatic Sleep Stage Scoring Model With Uncertainty Estimates , 2021, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[2]  C. Igel,et al.  U-Sleep: resilient high-frequency sleep staging , 2021, npj Digital Medicine.

[3]  Martin Brandt,et al.  An unexpectedly large count of trees in the West African Sahara and Sahel , 2020, Nature.

[4]  Aditya Krishna Menon,et al.  Does label smoothing mitigate label noise? , 2020, ICML.

[5]  Tuan Leng Tay,et al.  U-Net: deep learning for cell counting, detection, and morphometry , 2018, Nature Methods.

[6]  Qiao Li,et al.  You Snooze, You Win: the PhysioNet/Computing in Cardiology Challenge 2018 , 2018, 2018 Computing in Cardiology Conference (CinC).

[7]  Guo-Qiang Zhang,et al.  The National Sleep Research Resource: towards a sleep data commons , 2018, BCB.

[8]  Jacques Prado,et al.  0315 Inter- And Intra-expert Variability In Sleep Scoring: Comparison Between Visual And Automatic Analysis , 2018 .

[9]  Wei Wang,et al.  Gastric Banding Surgery versus Continuous Positive Airway Pressure for Obstructive Sleep Apnea: A Randomized Controlled Trial. , 2017, American journal of respiratory and critical care medicine.

[10]  Jonathon Shlens,et al.  A Learned Representation For Artistic Style , 2016, ICLR.

[11]  P. Hanly,et al.  Staging Sleep in Polysomnograms: Analysis of Inter-Scorer Variability. , 2016, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[12]  M. Grigg‐Damberger,et al.  The Visual Scoring of Sleep in Infants 0 to 2 Months of Age. , 2016, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[13]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  S. Redline,et al.  Racial/Ethnic Differences in Sleep Disturbances: The Multi-Ethnic Study of Atherosclerosis (MESA). , 2015, Sleep.

[15]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[16]  S. Redline,et al.  Relationships between sleep stages and changes in cognitive function in older men: the MrOS Sleep Study. , 2015, Sleep.

[17]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18]  Raanan Arens,et al.  A randomized trial of adenotonsillectomy for childhood sleep apnea. , 2013, The New England journal of medicine.

[19]  R. Rosenberg,et al.  The American Academy of Sleep Medicine inter-scorer reliability program: sleep stage scoring. , 2013, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[20]  C. Iber,et al.  A multisite randomized trial of portable sleep studies and positive airway pressure autotitration versus laboratory-based polysomnography for the diagnosis and treatment of obstructive sleep apnea: the HomePAP study. , 2012, Sleep.

[21]  S. Ancoli-Israel,et al.  Associations Between Sleep Architecture and Sleep‐Disordered Breathing and Cognition in Older Community‐Dwelling Men: The Osteoporotic Fractures in Men Sleep Study , 2011, Journal of the American Geriatrics Society.

[22]  Raanan Arens,et al.  The Childhood Adenotonsillectomy Trial (CHAT): rationale, design, and challenges of a randomized controlled trial evaluating a standard surgical procedure in a pediatric population. , 2011, Sleep.

[23]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[24]  P. Anderer,et al.  Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard , 2009, Journal of sleep research.

[25]  Susan Redline,et al.  Sleep‐Disordered Breathing and Cognition in Older Women , 2008, Journal of the American Geriatrics Society.

[26]  C. Guilleminault,et al.  Meta-analysis of quantitative sleep parameters from childhood to old age in healthy individuals: developing normative sleep values across the human lifespan. , 2004, Sleep.

[27]  S. Redline,et al.  Prevalence and risk factors for sleep-disordered breathing in 8- to 11-year-old children: association with race and prematurity. , 2003, The Journal of pediatrics.

[28]  Aeilko H. Zwinderman,et al.  Analysis of a sleep-dependent neuronal feedback loop: the slow-wave microcontinuity of the EEG , 2000, IEEE Transactions on Biomedical Engineering.

[29]  J. Samet,et al.  The Sleep Heart Health Study: design, rationale, and methods. , 1997, Sleep.

[30]  Harry K. Genant,et al.  Appendicular Bone Density and Age Predict Hip Fracture in Women , 1990 .

[31]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[32]  Lutz Prechelt,et al.  Early Stopping-But When? , 1996, Neural Networks: Tricks of the Trade.

[33]  S. Redline,et al.  The familial aggregation of obstructive sleep apnea. , 1995, American journal of respiratory and critical care medicine.

[34]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .