Dual-CNN based multi-modal sleep scoring with temporal correlation driven fine-tuning

Abstract Examination of dysfunctional brain dynamics often needs to tackle the difficulties of analyzing neural data of multiple modalities as recordings from a single source (such as EEG) do not always suffice in characterizing the brain states of interest. A typical example is sleep scoring from Polysomnography (PSG) for evaluation of sleep disorder, which has long been an onerous task and constantly results in unstable performance. Despite numerous successes achieved by the deep learning technologies recently booming along this direction, a grand challenge still remains: how to enable a stable solution to characterize the sleep stages (brain states) with temporal correlations embedded in the multi-modal recordings mainly from the brain activities without the support of sufficient a priori knowledge of the subjects. Aiming at the challenge, this study develops a deep learning framework for multi-modal sleep scoring. The framework uses a “dual-CNN” to simultaneously process inputs of two forms, i.e., temporal or time-frequency. The inputs of each branch may either consist of organized epochs of multiple modalities or their time-frequency representations. The fused deep features of the original multi-modal dataset will then be derived. The correlation of the original consecutive epochs embedded in the deep features can be reinforced through an RNN layer to aid classification with the final results fine-tuned by a customized Markov chain model, which considers the temporal correlations embedded in the deep features to amortize the potential bias resulted from the imbalance of brain states represented by the features. A case study of sleep scoring has been performed against a public Sleep-EDF dataset. Experimental results indicate that 1) the multi-modal features generated by the dual-CNN model help in significantly improve the accuracy of neural data classification; 2) the overall framework outperforms the state-of-the-art counterparts with an accuracy of 84.9% achieved in sleep scoring.

[1]  Xiaoqiang Lu,et al.  Exploiting Embedding Manifold of Autoencoders for Hyperspectral Anomaly Detection , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Hao Sun,et al.  A Feature Aggregation Convolutional Neural Network for Remote Sensing Scene Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Jasmin Kevric,et al.  Performance evaluation of empirical mode decomposition, discrete wavelet transform, and wavelet packed decomposition for automated epileptic seizure detection and prediction , 2018, Biomed. Signal Process. Control..

[4]  Vaibhava Goel,et al.  Deep multimodal learning for Audio-Visual Speech Recognition , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Maarten De Vos,et al.  Multichannel Sleep Stage Classification and Transfer Learning using Convolutional Neural Networks , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[6]  Xiaoli Li,et al.  Cloud‐aided online EEG classification system for brain healthcare: A case study of depression evaluation with a lightweight CNN , 2020, Softw. Pract. Exp..

[7]  Yan Guozheng,et al.  EEG feature extraction based on wavelet packet decomposition for brain computer interface , 2008 .

[8]  Fali Li,et al.  Neuroscience Information Toolbox: An Open Source Toolbox for EEG–fMRI Multimodal Fusion Analysis , 2018, Front. Neuroinform..

[9]  Yijun Wang,et al.  Amplitude and phase coupling measures for feature extraction in an EEG-based brain–computer interface , 2007, Journal of neural engineering.

[10]  Peng Xu,et al.  Inter-subject P300 variability relates to the efficiency of brain networks reconfigured from resting- to task-state: Evidence from a simultaneous event-related EEG-fMRI study , 2020, NeuroImage.

[11]  Gyanendra K. Verma,et al.  Multimodal fusion framework: A multiresolution approach for emotion classification and recognition from physiological signals , 2014, NeuroImage.

[12]  Jérémie Mattout,et al.  Symmetrical event-related EEG/fMRI information fusion in a variational Bayesian framework , 2007, NeuroImage.

[13]  Joon Son Chung,et al.  Deep Audio-Visual Speech Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Xiangtao Zheng,et al.  Multisource Compensation Network for Remote Sensing Cross-Domain Scene Classification , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[15]  Aeilko H. Zwinderman,et al.  Analysis of a sleep-dependent neuronal feedback loop: the slow-wave microcontinuity of the EEG , 2000, IEEE Transactions on Biomedical Engineering.

[16]  C. O’Reilly,et al.  Montreal Archive of Sleep Studies: an open‐access resource for instrument benchmarking and exploratory research , 2014, Journal of sleep research.

[17]  Stanislas Chambon,et al.  A Deep Learning Architecture for Temporal Sleep Stage Classification Using Multivariate and Multimodal Time Series , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[18]  Reza Sameni,et al.  A Distributed Classification Procedure for Automatic Sleep Stage Scoring Based on Instantaneous Electroencephalogram Phase and Envelope Features , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[19]  Esther Rodríguez-Villegas,et al.  An open-source toolbox for standardized use of PhysioNet Sleep EDF Expanded Database , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[20]  Hao Zhang,et al.  Incremental Factorization of Big Time Series Data with Blind Factor Approximation , 2019, IEEE Transactions on Knowledge and Data Engineering.

[21]  Jin Jiang,et al.  Time-frequency feature representation using energy concentration: An overview of recent advances , 2009, Digit. Signal Process..

[22]  Yike Guo,et al.  Automatic Sleep Stage Scoring Using Time-Frequency Analysis and Stacked Sparse Autoencoders , 2015, Annals of Biomedical Engineering.

[23]  Benyun Shi,et al.  Improving Brain E-Health Services via High-Performance EEG Classification With Grouping Bayesian Optimization , 2020, IEEE Transactions on Services Computing.

[24]  P. Brown,et al.  EEG–EMG, MEG–EMG and EMG–EMG frequency analysis: physiological principles and clinical applications , 2002, Clinical Neurophysiology.

[25]  Albert Y. Zomaya,et al.  Bayesian tensor factorization for multi-way analysis of multi-dimensional EEG , 2018, Neurocomputing.

[26]  Saeid Sanei,et al.  Deep learning for epileptic intracranial EEG data , 2016, 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP).

[27]  Tzyy-Ping Jung,et al.  Spatial Filtering for EEG-Based Regression Problems in Brain–Computer Interface (BCI) , 2017, IEEE Transactions on Fuzzy Systems.

[28]  Xiangtao Zheng,et al.  Sound Active Attention Framework for Remote Sensing Image Captioning , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Oliver Y. Chén,et al.  SeqSleepNet: End-to-End Hierarchical Recurrent Neural Network for Sequence-to-Sequence Automatic Sleep Staging , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[30]  L. Garnero,et al.  Combined MEG and EEG source imaging by minimization of mutual information , 1999, IEEE Transactions on Biomedical Engineering.

[31]  Chao Wu,et al.  DeepSleepNet: A Model for Automatic Sleep Stage Scoring Based on Raw Single-Channel EEG , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.