Reliability of the American Academy of Sleep Medicine Rules for Assessing Sleep Depth in Clinical Practice.

STUDY OBJECTIVES The American Academy of Sleep Medicine has published manuals for scoring polysomnograms that recommend time spent in non-rapid eye movement sleep stages (stage N1, N2, and N3 sleep) be reported. Given the well-established large interrater variability in scoring stage N1 and N3 sleep, we determined the range of time in stage N1 and N3 sleep scored by a large number of technologists when compared to reasonably estimated true values. METHODS Polysomnograms of 70 females were scored by 10 highly trained sleep technologists, two each from five different academic sleep laboratories. Range and confidence interval (CI = difference between the 5th and 95th percentiles) of the 10 times spent in stage N1 and N3 sleep assigned in each polysomnogram were determined. Average values of times spent in stage N1 and N3 sleep generated by the 10 technologists in each polysomnogram were considered representative of the true values for the individual polysomnogram. Accuracy of different technologists in estimating delta wave duration was determined by comparing their scores to digitally determined durations. RESULTS The CI range of the ten N1 scores was 4 to 39 percent of total sleep time (% TST) in different polysomnograms (mean CI ± standard deviation = 11.1 ± 7.1 % TST). Corresponding range for N3 was 1 to 28 % TST (14.4 ± 6.1 % TST). For stage N1 and N3 sleep, very low or very high values were reported for virtually all polysomnograms by different technologists. Technologists varied widely in their assignment of stage N3 sleep, scoring that stage when the digitally determined time of delta waves ranged from 3 to 17 seconds. CONCLUSIONS Manual scoring of non-rapid eye movement sleep stages is highly unreliable among highly trained, experienced technologists. Measures of sleep continuity and depth that are reliable and clinically relevant should be a focus of clinical research.

[1]  C. K. Mahutte,et al.  Effect of hypercapnia on the arousal response to airway occlusion during sleep in normal subjects. , 1993, Journal of applied physiology.

[2]  P. Hanly,et al.  Odds ratio product of sleep EEG as a continuous measure of sleep state. , 2015, Sleep.

[3]  Pietro Perona,et al.  Sleep spindle detection: crowdsourcing and evaluating performance of experts, non-experts, and automated methods , 2014, Nature Methods.

[4]  D. Rapoport,et al.  Interobserver agreement among sleep scorers from different centers in a large dataset. , 2000, Sleep.

[5]  R. Berry,et al.  Within-night variation in respiratory effort preceding apnea termination and EEG delta power in sleep apnea. , 1998, Journal of applied physiology.

[6]  Thomas Penzel,et al.  Agreement in the scoring of respiratory events and sleep among international sleep centers. , 2013, Sleep.

[7]  P. Hanly,et al.  Immediate postarousal sleep dynamics: an important determinant of sleep stability in obstructive sleep apnea. , 2016, Journal of applied physiology.

[8]  J. Röschke,et al.  Discrimination of sleep stages: a comparison between spectral and nonlinear EEG measures. , 1996, Electroencephalography and clinical neurophysiology.

[9]  Poul Jennum,et al.  Inter-expert and intra-expert reliability in sleep spindle scoring , 2015, Clinical Neurophysiology.

[10]  Koby Todros,et al.  Assessment of automated scoring of polysomnographic recordings in a population with suspected sleep-disordered breathing. , 2004, Sleep.

[11]  Jacques Martinerie,et al.  Entropy maps characterize drug effects on brain dynamics in Alzheimer's disease , 1998, Neuroscience Letters.

[12]  Eiji Shimizu,et al.  Approximate entropy of human respiratory movement during eye-closed waking and different sleep stages. , 2003, Chest.

[13]  P. Hanly,et al.  Staging Sleep in Polysomnograms: Analysis of Inter-Scorer Variability. , 2016, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[14]  R Ferri,et al.  Comparison between the results of an automatic and a visual scoring of sleep EEG recordings. , 1989, Sleep.

[15]  M. Younes,et al.  Accuracy of Automatic Polysomnography Scoring Using Frontal Electrodes. , 2016, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[16]  P. Hanly,et al.  Minimizing Interrater Variability in Staging Sleep by Use of Computer-Derived Features. , 2016, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[17]  A. Krystal,et al.  Measuring sleep quality. , 2008, Sleep medicine.

[18]  E. Bruce,et al.  Sample Entropy Tracks Changes in Electroencephalogram Power Spectrum With Sleep State and Aging , 2009, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[19]  Michael C. K. Khoo,et al.  Determining a continuous marker for sleep depth , 2007, Comput. Biol. Medicine.

[20]  M. Younes,et al.  Enhancements to the multiple sleep latency test , 2016, Nature and science of sleep.

[21]  M. Younes,et al.  Utility of Technologist Editing of Polysomnography Scoring Performed by a Validated Automatic System. , 2015, Annals of the American Thoracic Society.

[22]  Harold L. Williams,et al.  RESPONSES TO AUDITORY STIMULATION, SLEEP LOSS AND THE EEG STAGES OF SLEEP. , 1964, Electroencephalography and clinical neurophysiology.

[23]  Eleni Giannouli,et al.  Performance of a New Portable Wireless Sleep Monitor. , 2017, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[24]  Kazuhiko Fukuda,et al.  Proposed supplements and amendments to ‘A Manual of Standardized Terminology, Techniques and Scoring System for Sleep Stages of Human Subjects’, the Rechtschaffen & Kales (1968) standard , 2001, Psychiatry and clinical neurosciences.

[25]  A. Pack,et al.  Performance of an automated polysomnography scoring system versus computer-assisted manual scoring. , 2013, Sleep.

[26]  M. Younes,et al.  Assessment of intervention-related changes in non-rapid-eye-movement sleep depth: importance of sleep depth changes within stage 2. , 2017, Sleep medicine.

[27]  Thomas Penzel,et al.  Process and outcome for international reliability in sleep scoring , 2015, Sleep and Breathing.

[28]  N. Collop Scoring variability between polysomnography technologists in different sleep laboratories. , 2002, Sleep medicine.

[29]  Georg Dorffner,et al.  Computer-Assisted Automated Scoring of Polysomnograms Using the Somnolyzer System. , 2015, Sleep.

[30]  A. Schlögl,et al.  Interrater reliability between scorers from eight European sleep laboratories in subjects with different sleep disorders , 2004, Journal of sleep research.

[31]  Atul Malhotra,et al.  Agreement in computer-assisted manual scoring of polysomnograms across sleep centers. , 2013, Sleep.

[32]  Roberto Hornero,et al.  Analysis of regularity in the EEG background activity of Alzheimer's disease patients with Approximate Entropy , 2005, Clinical Neurophysiology.

[33]  S. Redline,et al.  Reliability of scoring respiratory disturbance indices and sleep staging. , 1998, Sleep.

[34]  C. Shapiro,et al.  Ventilatory and arousal responses to added inspiratory resistance during sleep. , 1989, The American review of respiratory disease.

[35]  P. F. Meier,et al.  Dimensional complexity and spectral properties of the human sleep EEG , 2003, Clinical Neurophysiology.

[36]  A. Schlögl,et al.  An E-Health Solution for Automatic Sleep Classification according to Rechtschaffen and Kales: Validation Study of the Somnolyzer 24 × 7 Utilizing the Siesta Database , 2005, Neuropsychobiology.