Assessment of Feature Selection and Classification Approaches to Enhance Information from overnight oximetry in the Context of Apnea Diagnosis

This study is aimed at assessing the usefulness of different feature selection and classification methodologies in the context of sleep apnea hypopnea syndrome (SAHS) detection. Feature extraction, selection and classification stages were applied to analyze blood oxygen saturation (SaO2) recordings in order to simplify polysomnography (PSG), the gold standard diagnostic methodology for SAHS. Statistical, spectral and nonlinear measures were computed to compose the initial feature set. Principal component analysis (PCA), forward stepwise feature selection (FSFS) and genetic algorithms (GAs) were applied to select feature subsets. Fisher's linear discriminant (FLD), logistic regression (LR) and support vector machines (SVMs) were applied in the classification stage. Optimum classification algorithms from each combination of these feature selection and classification approaches were prospectively validated on datasets from two independent sleep units. FSFS + LR achieved the highest diagnostic performance using a small feature subset (4 features), reaching 83.2% accuracy in the validation set and 88.7% accuracy in the test set. Similarly, GAs + SVM also achieved high generalization capability using a small number of input features (7 features), with 84.2% accuracy on the validation set and 84.5% accuracy in the test set. Our results suggest that reduced subsets of complementary features (25% to 50% of total features) and classifiers with high generalization ability could provide high-performance screening tools in the context of SAHS.

[1]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[2]  V. Kapur,et al.  Obstructive sleep apnea devices for out-of-center (OOC) testing: technology evaluation. , 2011, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[3]  Roberto Hornero,et al.  Radial basis function classifiers to help in the diagnosis of the obstructive sleep apnoea syndrome from nocturnal oximetry , 2008, Medical & Biological Engineering & Computing.

[4]  Enrico Zio,et al.  Genetic algorithm-based wrapper approach for grouping condition monitoring signals of nuclear power plant components , 2011, Integr. Comput. Aided Eng..

[5]  Eric R. Ziegel,et al.  Applied Multivariate Data Analysis , 2002, Technometrics.

[6]  D. L. Hudson,et al.  Applying continuous chaotic modeling to cardiac signal analysis , 1996 .

[7]  B. Gersh,et al.  Obstructive sleep apnea: implications for cardiac and vascular disease. , 2003, JAMA.

[8]  J. Victor Marcos,et al.  Feature selection from nocturnal oximetry using genetic algorithms to assist in obstructive sleep apnoea diagnosis. , 2012, Medical engineering & physics.

[9]  C. Stam,et al.  Nonlinear dynamical analysis of EEG and MEG: Review of an emerging field , 2005, Clinical Neurophysiology.

[10]  B. Grant,et al.  Prediction of the apnea-hypopnea index from overnight pulse oximetry. , 2003, Chest.

[11]  L. Olson,et al.  Prediction of sleep‐disordered breathing by unattended overnight oximetry , 1999, Journal of sleep research.

[12]  Robert F. Ling,et al.  Applied Multivariate Data Analysis, Vol. I: Regression and Experimental Design (J. D. Jobson) , 1992, SIAM Rev..

[13]  Kezhi Mao,et al.  Fast orthogonal forward selection algorithm for feature subset selection , 2002, IEEE Trans. Neural Networks.

[14]  Arkady Borisov,et al.  Ranking-Based Kernels in Applied Biomedical Diagnostics Using a Support Vector Machine , 2011, Int. J. Neural Syst..

[15]  U. Rajendra Acharya,et al.  Application of Recurrence Quantification Analysis for the Automated Identification of Epileptic EEG Signals , 2011, Int. J. Neural Syst..

[16]  D. Kristo,et al.  Overnight pulse oximetry for sleep-disordered breathing in adults: a review. , 2001, Chest.

[17]  Sam Kwong,et al.  Genetic algorithms and their applications , 1996, IEEE Signal Process. Mag..

[18]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[19]  Erik W. Jensen,et al.  EEG complexity as a measure of depth of anesthesia for patients , 2001, IEEE Trans. Biomed. Eng..

[20]  Marimuthu Palaniswami,et al.  Support Vector Machines for Automated Recognition of Obstructive Sleep Apnea Syndrome From ECG Recordings , 2009, IEEE Transactions on Information Technology in Biomedicine.

[21]  John H. Kalivas,et al.  Comparison of Forward Selection, Backward Elimination, and Generalized Simulated Annealing for Variable Selection , 1993 .

[22]  Steven M. Pincus Assessing Serial Irregularity and Its Implications for Health , 2001, Annals of the New York Academy of Sciences.

[23]  J. Victor Marcos,et al.  Assessment of four statistical pattern recognition techniques to assist in obstructive sleep apnoea diagnosis from nocturnal oximetry. , 2009, Medical engineering & physics.

[24]  Inés María Galván,et al.  Recursive Discriminant Regression Analysis to Find Homogeneous Groups , 2011, Int. J. Neural Syst..

[25]  D. L. Hudson,et al.  New chaotic methods for biomedical signal analysis , 2000, Proceedings 2000 IEEE EMBS International Conference on Information Technology Applications in Biomedicine. ITAB-ITIS 2000. Joint Meeting Third IEEE EMBS International Conference on Information Technol.

[26]  Sami F. Masri,et al.  Finite Element Model Updating Using Evolutionary Strategy for Damage Detection , 2011, Comput. Aided Civ. Infrastructure Eng..

[27]  Roberto Hornero,et al.  Automated detection of obstructive sleep apnoea syndrome from oxygen saturation recordings using linear discriminant analysis , 2010, Medical & Biological Engineering & Computing.

[28]  U. Rajendra Acharya,et al.  Automatic Detection of Epileptic EEG Signals Using Higher Order cumulant Features , 2011, Int. J. Neural Syst..

[29]  Christophe Croux,et al.  An Information Criterion for Variable Selection in Support Vector Machines , 2008, J. Mach. Learn. Res..

[30]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[31]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[32]  W. Flemons,et al.  Clinical usefulness of home oximetry compared with polysomnography for assessment of sleep apnea. , 2005, American journal of respiratory and critical care medicine.

[33]  D. Abásolo,et al.  Extraction of spectral based measures from MEG background oscillations in Alzheimer's disease. , 2007, Medical engineering & physics.

[34]  Roberto Hornero,et al.  Improving diagnostic ability of blood oxygen saturation from overnight pulse oximetry in obstructive sleep apnea detection by means of central tendency measure , 2007, Artif. Intell. Medicine.

[35]  F. Gudé,et al.  Oximetry spectral analysis in the diagnosis of obstructive sleep apnoea. , 1999, Clinical science.

[36]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[37]  W. Flemons,et al.  Home diagnosis of sleep apnea: a systematic review of the literature. An evidence review cosponsored by the American Academy of Sleep Medicine, the American College of Chest Physicians, and the American Thoracic Society. , 2003, Chest.

[38]  A. Schwartz,et al.  Adult obstructive sleep apnea: pathophysiology and diagnosis. , 2007, Chest.

[39]  W. M. Anderson,et al.  Clinical guidelines for the use of unattended portable monitors in the diagnosis of obstructive sleep apnea in adult patients. Portable Monitoring Task Force of the American Academy of Sleep Medicine. , 2007, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[40]  Giuseppe Quaranta,et al.  Modified Genetic Algorithm for the Dynamic Identification of Structural Systems Using Incomplete Measurements , 2011, Comput. Aided Civ. Infrastructure Eng..

[41]  D. Shucard,et al.  Validity of neural network in sleep apnea. , 1999, Sleep.

[42]  Christophe Croux,et al.  An Information Criterion for Variable Selection in Support Vector Machines , 2007 .

[43]  Panagiotis Patrinos,et al.  Variable Selection in Nonlinear Modeling Based on RBF Networks and Evolutionary Computation , 2010, Int. J. Neural Syst..

[44]  Jack Sklansky,et al.  A note on genetic algorithms for large-scale feature selection , 1989, Pattern Recognition Letters.

[45]  T. Young,et al.  Risk factors for obstructive sleep apnea in adults. , 2004, JAMA.

[46]  D. Abásolo,et al.  Nonlinear characteristics of blood oxygen saturation from nocturnal oximetry for obstructive sleep apnoea detection , 2006, Physiological measurement.

[47]  Ying Lee,et al.  A Computerized Feature Selection Method Using Genetic Algorithms to Forecast Freeway Accident Duration Times , 2010, Comput. Aided Civ. Infrastructure Eng..

[48]  W. D. Ray Applied Multivariate Data Analysis: Vol. 1, Regression and Experimental Design , 1992 .

[49]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[50]  Thomas W. Rauber,et al.  Diagnosing multiple faults in oil rig motor pumps using support vector machine classifier ensembles , 2011, Integr. Comput. Aided Eng..

[51]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[52]  Ludmila I Kuncheva,et al.  Classifier ensembles for fMRI data analysis: an experiment. , 2010, Magnetic resonance imaging.

[53]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[54]  G.F. Inbar,et al.  Feature selection for the classification of movements from single movement-related potentials , 2002, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[55]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[56]  W. Ward Flemons,et al.  Home diagnosis of sleep apnoeas: A systematic review of the literature , 2003 .

[57]  Chung-Chieh Yu,et al.  Smoothed periodogram of oxyhemoglobin saturation by pulse oximetry in sleep apnea syndrome: an automated analysis. , 2007, Chest.

[58]  Ron Kohavi,et al.  Wrappers for feature selection , 1997 .

[59]  S. Cerutti,et al.  Automatic screening of obstructive sleep apnea from the ECG based on empirical mode decomposition and wavelet analysis , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[60]  Ulrik Söderström,et al.  Reconstruction of occluded facial images using asymmetrical Principal Component Analysis , 2011, 2011 18th International Conference on Systems, Signals and Image Processing.

[61]  U. Rajendra Acharya,et al.  Analysis and Automatic Identification of Sleep Stages Using Higher Order Spectra , 2010, Int. J. Neural Syst..

[62]  Asoke K. Nandi,et al.  Evolution of superFeatures through genetic programming , 2011, Expert Syst. J. Knowl. Eng..

[63]  Roberto Hornero,et al.  Multivariate Analysis of Blood Oxygen Saturation Recordings in Obstructive Sleep Apnea Diagnosis , 2010, IEEE Transactions on Biomedical Engineering.

[64]  J. Victor Marcos,et al.  Linear and nonlinear analysis of airflow recordings to help in sleep apnoea–hypopnoea syndrome diagnosis , 2012, Physiological measurement.

[65]  Conor Heneghan,et al.  Automated processing of the single-lead electrocardiogram for the detection of obstructive sleep apnoea , 2003, IEEE Transactions on Biomedical Engineering.

[66]  J. Jobson,et al.  Applied Multivariate Data Analysis: Regression and Experimental Design , 1999 .

[67]  A. Gelman Scaling regression inputs by dividing by two standard deviations , 2008, Statistics in medicine.

[68]  U. Rajendra Acharya,et al.  Application of Non-Linear and Wavelet Based Features for the Automated Identification of Epileptic EEG signals , 2012, Int. J. Neural Syst..

[69]  Thomas Penzel,et al.  Comparison of detrended fluctuation analysis and spectral analysis for heart rate variability in sleep and sleep apnea , 2003, IEEE Transactions on Biomedical Engineering.

[70]  Marimuthu Palaniswami,et al.  Automated recognition of patients with obstructive sleep apnoea using wavelet-based features of electrocardiogram recordings , 2009, Comput. Biol. Medicine.

[71]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..