Sleep Apnea Detection via Depth Video and Audio Feature Learning

Obstructive sleep apnea, characterized by repetitive obstruction in the upper airway during sleep, is a common sleep disorder that could significantly compromise sleep quality and quality of life in general. The obstructive respiratory events can be detected by attended in-laboratory or unattended ambulatory sleep studies. Such studies require many attachments to a patient's body to track respiratory and physiological changes, which can be uncomfortable and compromise the patient's sleep quality. In this paper, we propose to record depth video and audio of a patient using a Microsoft Kinect camera during his/her sleep, and extract relevant features to correlate with obstructive respiratory events scored manually by a scientific officer based on data collected by Philips system Alice6 LDxS that is commonly used in sleep clinics. Specifically, we first propose an alternating-frame H.264 video encoding scheme and bit recovery scheme at the decoder. Next, we perform depth video temporal denoising using a motion vector graph smoothness prior. Then, we build a dual-ellipse model and track a patient's chest and abdominal movements in the denoised videos. Finally, we extract features from both depth video and audio for classifier training and respiratory event detection. Experimental results show 1) that our depth video compression scheme outperforms a competitor that records only the 8 most significant bits, 2) our graph-based temporal denoising scheme reduces the flickering effect without over-smoothing, and 3) our trained classifiers can deduce respiratory events scored manually based on data collected by system Alice6 LDxS with high accuracy.

[1]  R. Stiefelhagen,et al.  Breath rate monitoring during sleep using near-ir imagery and PCA , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[2]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[3]  Sergio Escalera,et al.  Automatic Sleep System Recommendation by Multi-modal RBG-Depth-Pressure Anthropometric Analysis , 2017, International Journal of Computer Vision.

[4]  Sneha Kumar Kasera,et al.  Monitoring Breathing via Signal Strength in Wireless Networks , 2011, IEEE Transactions on Mobile Computing.

[5]  Nuria Oliver,et al.  HealthGear: Automatic Sleep Apnea Detection and Monitoring with a Mobile Phone , 2007, J. Commun..

[6]  Ram Nevatia,et al.  Body Part Detection for Human Pose Estimation and Tracking , 2007, 2007 IEEE Workshop on Motion and Video Computing (WMVC'07).

[7]  Liang-Gee Chen,et al.  Survey on Block Matching Motion Estimation Algorithms and Architectures with New Results , 2006, J. VLSI Signal Process..

[8]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[9]  William H. Press,et al.  Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .

[10]  Min Hong,et al.  Sleep Monitoring System Using Kinect Sensor , 2015, Int. J. Distributed Sens. Networks.

[11]  Andrew Hunter,et al.  Unconstrained Video Monitoring of Breathing Behavior and Application to Diagnosis of Sleep Apnea , 2014, IEEE Transactions on Biomedical Engineering.

[12]  Gene Cheung,et al.  Sleep monitoring via depth video compression & analysis , 2014, 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[13]  Vangelis Metsis,et al.  Non-invasive analysis of sleep patterns via multimodal sensor input , 2012, Personal and Ubiquitous Computing.

[14]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[15]  Oscar C. Au,et al.  Redefining self-similarity in natural images for denoising using graph signal gradient , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[16]  Niclas Palmius,et al.  SleepAp: An automated obstructive sleep apnoea screening application for smartphones , 2013, Computing in Cardiology 2013.

[17]  Theodoros Giannakopoulos,et al.  Introduction to Audio Analysis: A MATLAB® Approach , 2014 .

[18]  Jeffrey C. Lagarias,et al.  Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions , 1998, SIAM J. Optim..

[19]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[20]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[21]  Ronald R. Coifman,et al.  Wavelet analysis and signal processing , 1990 .

[22]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[23]  S. Mallat A wavelet tour of signal processing , 1998 .

[24]  Sergio Escalera,et al.  Multi-part body segmentation based on depth maps for soft biometry analysis , 2015, Pattern Recognit. Lett..

[25]  Gamini Dissanayake,et al.  Driver Drowsiness Classification Using Fuzzy Wavelet-Packet-Based Feature-Extraction Algorithm , 2011, IEEE Transactions on Biomedical Engineering.

[26]  S. Quan,et al.  Rules for scoring respiratory events in sleep: update of the 2007 AASM Manual for the Scoring of Sleep and Associated Events. Deliberations of the Sleep Apnea Definitions Task Force of the American Academy of Sleep Medicine. , 2012, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[27]  Thomas Penzel,et al.  Sleep Detection Using De-identified Depth Data , 2014, J. Mobile Multimedia.

[28]  Jun Yu,et al.  Semantic preserving distance metric learning and applications , 2014, Inf. Sci..

[29]  Alexandra Branzan Albu,et al.  Towards an Intelligent Bed Sensor: Non-intrusive Monitoring of Sleep Irregularities with Computer Vision Techniques , 2010, 2010 20th International Conference on Pattern Recognition.

[30]  Hannu Toivonen,et al.  Unobtrusive online monitoring of sleep at home , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[31]  W. Gander,et al.  Least-squares fitting of circles and ellipses , 1994 .

[32]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[33]  D. Falie,et al.  Respiratory motion visualization and the sleep apnea diagnosis with the time of flight (ToF) camera , 2008 .

[34]  Paul L. Rosin Analysing Error of Fit Functions for Ellipses , 1996, BMVC.

[35]  Linlin Jiang,et al.  Automatic sleep monitoring system for home healthcare , 2012, Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics.

[36]  A. Chesson,et al.  The AASM Manual for the Scoring of Sleep and Associated Events: Rules, Terminology, and Techinical Specifications , 2007 .

[37]  Oscar C. Au,et al.  Multiresolution Graph Fourier Transform for Compression of Piecewise Smooth Images , 2015, IEEE Transactions on Image Processing.

[38]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Stphane Mallat,et al.  A Wavelet Tour of Signal Processing, Third Edition: The Sparse Way , 2008 .

[40]  Ales Procházka,et al.  Feature extraction using MS Kinect and data fusion in analysis of sleep disorders , 2015, 2015 International Workshop on Computational Intelligence for Multimedia Understanding (IWCIM).

[41]  Sander Oude Elberink,et al.  Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.

[42]  Truong Q. Nguyen,et al.  An Augmented Lagrangian Method for Total Variation Video Restoration , 2011, IEEE Transactions on Image Processing.

[43]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[44]  D. Eberly Distance from a Point to an Ellipse, an Ellipsoid, or a Hyperellipsoid , 2006 .

[45]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[46]  Fanglin Chen,et al.  Unobtrusive sleep monitoring using smartphones , 2013, 2013 7th International Conference on Pervasive Computing Technologies for Healthcare and Workshops.

[47]  Erry,et al.  Prospective study of the association between sleep-disordered breathing and hypertension. , 2000, The New England journal of medicine.

[48]  Oscar C. Au,et al.  Depth map denoising using graph-based transform and group sparsity , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[49]  Minh N. Do,et al.  Depth Video Enhancement Based on Weighted Mode Filtering , 2012, IEEE Transactions on Image Processing.

[50]  Beno Benhabib,et al.  Accurate parameter estimation of quadratic curves from grey-level images , 1991, CVGIP Image Underst..

[51]  A. Loblaw,et al.  Remote respiratory sensing with an infrared camera using the Kinect TM infrared projector , 2013 .

[52]  Michael W. Berry,et al.  Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[53]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[54]  Gene Cheung,et al.  Graph-based depth video denoising and event detection for sleep monitoring , 2014, 2014 IEEE 16th International Workshop on Multimedia Signal Processing (MMSP).

[55]  Marco Aiello,et al.  Human Breath Detection using a Microphone , 2013 .

[56]  David H. Eberly,et al.  3D Game Engine Design, Second Edition: A Practical Approach to Real-Time Computer Graphics (The Morgan Kaufmann Series in Interactive 3D Technology) , 2006 .

[57]  Andrew Hunter,et al.  Erratum to: Robust Pose Recognition of the Obscured Human Body , 2011, International Journal of Computer Vision.

[58]  Kenneth C. Pohlmann,et al.  Principles of Digital Audio , 1986 .

[59]  Dmitriy Vatolin,et al.  Temporal filtering for depth maps generated by Kinect depth camera , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[60]  Ming-Sui Lee,et al.  Multiparameter Sleep Monitoring Using a Depth Camera , 2012, BIOSTEC.

[61]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[62]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[63]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[64]  Yuan Yan Tang,et al.  High-Order Distance-Based Multiview Stochastic Learning in Image Classification , 2014, IEEE Transactions on Cybernetics.

[65]  Atul Malhotra,et al.  Obstructive sleep apnoea , 2002, The Lancet.

[66]  James T. Patrie,et al.  Development and Preliminary Validation of Heart Rate and Breathing Rate Detection Using a Passive, Ballistocardiography-Based Sleep Monitoring System , 2009, IEEE Transactions on Information Technology in Biomedicine.

[67]  Yi-Ping Hung,et al.  A sleep monitoring system based on audio, video and depth information for detecting sleep events , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[68]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[69]  Oscar C. Au,et al.  Optimal graph laplacian regularization for natural image denoising , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[70]  Oscar C. Au,et al.  Depth map compression using multi-resolution graph-based transform for depth-image-based rendering , 2012, 2012 19th IEEE International Conference on Image Processing.

[71]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[72]  João S. Domingos,et al.  A review of current sleep screening applications for smartphones , 2013, Physiological measurement.

[73]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.