Automatic Feature Selection for Sleep/Wake Classification with Small Data Sets

This paper describes an automatic feature selection algorithm integrated into a classification framework developed to discriminate between sleep and wake states during the night. The feature selection algorithm proposed in this paper uses the Mahalanobis distance and the Spearman’s ranked-order correlation as selection criteria to restrict search in a large feature space. The algorithm was tested using a leave-one-subject-out cross-validation procedure on 15 single-night PSG recordings of healthy sleepers and then compared to the results of a standard Sequential Forward Search (SFS) algorithm. It achieved comparable performance in terms of Cohen’s kappa (k = 0.62) and the Area under the Precision-Recall curve (AUCPR = 0.59), but gave a significant computational time improvement by a factor of nearly 10. The feature selection procedure, applied on each iteration of the cross-validation, was found to be stable, consistently selecting a similar list of features. It selected an average of 10.33 features per iteration, nearly half of the 21 features selected by SFS. In addition, learning curves show that the training and testing performances converge faster than for SFS and that the final training-testing performance difference is smaller, suggesting that the new algorithm is more adequate for data sets with a small number of subjects.

[1]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[2]  Suzanne Lesecq,et al.  Feature selection for sleep/wake stages classification using data driven methods , 2007, Biomed. Signal Process. Control..

[3]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[4]  J. Friedman Regularized Discriminant Analysis , 1989 .

[5]  A. Rechtschaffen,et al.  Sleep deprivation in the rat by the disk-over-water method , 1995, Behavioural Brain Research.

[6]  C. Heneghan,et al.  Sleep staging using cardiorespiratory signals , 2007 .

[7]  Elke Naujokat,et al.  Sleep/wake detection based on cardiorespiratory signals and actigraphy , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[8]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[9]  Xi Long,et al.  Using dynamic time warping for sleep and wake discrimination , 2012, Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics.

[10]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[11]  Mokhtar Abdullah,et al.  On a Robust Correlation Coefficient , 1990 .

[12]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[13]  Ron Kohavi,et al.  The Case against Accuracy Estimation for Comparing Induction Algorithms , 1998, ICML.

[14]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[15]  E. Naujokat,et al.  Sleep Monitoring Through a Textile Recording System , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.