Multi-modality Sensor Data Classification with Selective Attention

Multimodal wearable sensor data classification plays an important role in ubiquitous computing and has a wide range of applications in scenarios from healthcare to entertainment. However, most existing work in this field employs domain-specific approaches and is thus ineffective in complex sit- uations where multi-modality sensor data are col- lected. Moreover, the wearable sensor data are less informative than the conventional data such as texts or images. In this paper, to improve the adapt- ability of such classification methods across differ- ent application domains, we turn this classification task into a game and apply a deep reinforcement learning scheme to deal with complex situations dynamically. Additionally, we introduce a selective attention mechanism into the reinforcement learn- ing scheme to focus on the crucial dimensions of the data. This mechanism helps to capture extra information from the signal and thus it is able to significantly improve the discriminative power of the classifier. We carry out several experiments on three wearable sensor datasets and demonstrate the competitive performance of the proposed approach compared to several state-of-the-art baselines.

[1]  Michel Tokic,et al.  Adaptive epsilon-Greedy Exploration in Reinforcement Learning Based on Value Difference , 2010, KI.

[2]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  統計数理研究所 Annals of the institute of statistical mathematics , 1949 .

[4]  A. Syvänen,et al.  Silhouette scores for assessment of SNP genotype clusters , 2005, BMC Genomics.

[5]  Vinod Chandran,et al.  Physical Activity Recognition Using Posterior-Adapted Class-Based Fusion of Multiaccelerometer Data , 2017, IEEE Journal of Biomedical and Health Informatics.

[6]  Bowen Zhou,et al.  LSTM-based Deep Learning Models for non-factoid answer selection , 2015, ArXiv.

[7]  Lina Yao,et al.  EEG-based Intention Recognition from Spatio-Temporal Representations via Cascade and Parallel Convolutional Recurrent Neural Networks , 2017, ArXiv.

[8]  Kyungmin Su,et al.  The PREP pipeline: standardized preprocessing for large-scale EEG analysis , 2015, Front. Neuroinform..

[9]  A. Laurentini,et al.  The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[11]  Jürgen Schmidhuber,et al.  LSTM recurrent networks learn simple context-free and context-sensitive languages , 2001, IEEE Trans. Neural Networks.

[12]  Lina Yao,et al.  Intent Recognition in Smart Living Through Deep Recurrent Neural Networks , 2017, ICONIP.

[13]  H. Akaike Fitting autoregressive models for prediction , 1969 .

[14]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[15]  Lina Yao,et al.  Converting Your Thoughts to Texts: Enabling Brain Typing via Deep Feature Learning of EEG Signals , 2017, 2018 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[16]  Michel Tokic Adaptive ε-greedy Exploration in Reinforcement Learning Based on Value Differences , 2010 .

[17]  James Bailey,et al.  From Shared Subspaces to Shared Landmarks: A Robust Multi-Source Classification Approach , 2017, AAAI.

[18]  Michael J. Watts,et al.  IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS Publication Information , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[19]  P Cavanagh,et al.  Attention-based motion perception. , 1992, Science.

[20]  Hermann Ney,et al.  LSTM Neural Networks for Language Modeling , 2012, INTERSPEECH.

[21]  Lina Yao,et al.  Compressive Representation for Device-Free Activity Recognition with Passive RFID Signal Strength , 2018, IEEE Transactions on Mobile Computing.

[22]  Silvia Conforto,et al.  Real time event-based segmentation to classify locomotion activities through a single inertial sensor , 2015, EAI Endorsed Trans. Mob. Commun. Appl..

[23]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[24]  J. Townshend,et al.  Land cover classification accuracy as a function of sensor spatial resolution , 1981 .

[25]  Geoffrey C. Fox,et al.  A Framework for Real Time Processing of Sensor Data in the Cloud , 2015, J. Sensors.

[26]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[27]  Charles Elkan,et al.  Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.

[28]  Nitin H. Vaidya,et al.  IEEE Transactions on Mobile Computing: Editorial , 2005 .