Dilated causal convolution with multi-head self attention for sensor human activity recognition

Systems of sensor human activity recognition are becoming increasingly popular in diverse fields such as healthcare and security. Yet, developing such systems poses inherent challenges due to the variations and complexity of human behaviors during the performance of physical activities. Recurrent neural networks, particularly long short-term memory have achieved promising results on numerous sequential learning problems, including sensor human activity recognition. However, parallelization is inhibited in recurrent networks due to sequential operation and computation that lead to slow training, occupying more memory and hard convergence. One-dimensional convolutional neural network processes input temporal sequential batches independently that lead to effectively executed operations in parallel. Despite that, a one-dimensional Convolutional Neural Network is not sensitive to the order of the time steps which is crucial for accurate and robust systems of sensor human activity recognition. To address this problem, we propose a network architecture based on dilated causal convolution and multi-head self-attention mechanisms that entirely dispense recurrent architectures to make efficient computation and maintain the ordering of the time steps. The proposed method is evaluated for human activities using smart home binary sensors data and wearable sensor data. Results of conducted extensive experiments on eight public and benchmark HAR data sets show that the proposed network outperforms the state-of-the-art models based on recurrent settings and temporal models.

[1]  Xiaohui Peng,et al.  Deep Learning for Sensor-based Activity Recognition: A Survey , 2017, Pattern Recognit. Lett..

[2]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[3]  Jens Lundström,et al.  Efficacy of Imbalanced Data Handling Methods on Deep Learning for Smart Homes Environments , 2020, SN Computer Science.

[4]  Oh-Wook Kwon,et al.  Integrating Dilated Convolution into DenseLSTM for Audio Source Separation , 2021 .

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  Davide Anguita,et al.  A Public Domain Dataset for Human Activity Recognition using Smartphones , 2013, ESANN.

[7]  Abdelsalam Helal,et al.  From Activity Recognition to Situation Recognition , 2013, ICOST.

[8]  Long Chen,et al.  Recurrent Neural Network for Human Activity Recognition in Smart Home , 2013 .

[9]  Honglak Lee,et al.  Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[10]  Wei Wang,et al.  Understanding and Modeling of WiFi Signal Based Human Activity Recognition , 2015, MobiCom.

[11]  Matthieu Geist,et al.  Convolutional and Recurrent Neural Networks for Activity Recognition in Smart Environment , 2015, BIRS-IMLKE.

[12]  Qinfeng Shi,et al.  Sensor enabled wearable RFID technology for mitigating the risk of falls near beds , 2013, 2013 IEEE International Conference on RFID (RFID).

[13]  Honglak Lee,et al.  Unsupervised feature learning for audio classification using convolutional deep belief networks , 2009, NIPS.

[14]  Chris D. Nugent,et al.  Ensemble classifier of long short-term memory with fuzzy temporal windows on binary sensors for activity recognition , 2018, Expert Syst. Appl..

[15]  Thomas Plötz,et al.  On attention models for human activity recognition , 2018, UbiComp.

[16]  Amani Ali Ahmed Ali,et al.  Survey on Segmentation and Recognition of Handwritten Arabic Script , 2020, SN Computer Science.

[17]  Gwenn Englebienne,et al.  Human activity recognition from wireless sensor network data: benchmark and software , 2011 .

[18]  Kwok-Wai Hung,et al.  Real-time video super resolution network using recurrent multi-branch dilated convolutions , 2021, Signal Process. Image Commun..

[19]  Takeshi Nishida,et al.  Deep recurrent neural network for mobile human activity recognition with high throughput , 2017, Artificial Life and Robotics.

[20]  Daniel Roggen,et al.  Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition , 2016, Sensors.

[21]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[22]  Faouzi Alaya Cheikh,et al.  Stacked Lstm Network for Human Activity Recognition Using Smartphone Data , 2019, 2019 8th European Workshop on Visual Information Processing (EUVIP).

[23]  Hanyu Wang,et al.  LSTM-CNN Architecture for Human Activity Recognition , 2020, IEEE Access.

[24]  Young-Koo Lee,et al.  Analysis and effects of smart home dataset characteristics for daily life activity recognition , 2013, The Journal of Supercomputing.

[25]  Jens Lundström,et al.  Stability Analysis of the t-SNE Algorithm for Human Activity Pattern Data , 2018, 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[26]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[27]  Thomas Plötz,et al.  Ensembles of Deep LSTM Learners for Activity Recognition using Wearables , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[28]  Jianning Wu,et al.  A Novel Multichannel Dilated Convolution Neural Network for Human Activity Recognition , 2020, Mathematical Problems in Engineering.

[29]  Wai Lok Woo,et al.  Joint Learning of Temporal Models to Handle Imbalanced Data for Human Activity Recognition , 2020, Applied Sciences.

[30]  Araceli Sanchis,et al.  Activity Recognition Using Hybrid Generative/Discriminative Models on Home Environments Using Binary Sensors , 2013, Sensors.

[31]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[32]  Wen-Hui Chen,et al.  Self-Attention Networks for Human Activity Recognition Using Wearable Devices , 2020, 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[33]  Tara N. Sainath,et al.  Temporal Modeling Using Dilated Convolution and Gating for Voice-Activity-Detection , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[34]  Athanasios V. Vasilakos,et al.  GCHAR: An efficient Group-based Context - aware human activity recognition on smartphone , 2017, J. Parallel Distributed Comput..

[35]  Miguel Altuve,et al.  Human Activity Recognition on Smartphones Using a Bidirectional LSTM Network , 2019, 2019 XXII Symposium on Image, Signal Processing and Artificial Vision (STSIVA).

[36]  Chris D. Nugent,et al.  Real-time Recognition of Interleaved Activities Based on Ensemble Classifier of Long Short-Term Memory with Fuzzy Temporal Windows , 2018, UCAmI.

[37]  Ruqiang Yan,et al.  Machine health monitoring with LSTM networks , 2016, 2016 10th International Conference on Sensing Technology (ICST).

[38]  Damith Chinthana Ranasinghe,et al.  Evaluation of Wearable Sensor Tag Data Segmentation Approaches for Real Time Activity Classification in Elderly , 2013, MobiQuitous.

[39]  Ying Wah Teh,et al.  Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges , 2018, Expert Syst. Appl..

[40]  Godwin Ogbuabor,et al.  Human Activity Recognition for Healthcare using Smartphones , 2018, ICMLC.

[41]  Mohamed-Rafik Bouguelia,et al.  Efficient Activity Recognition in Smart Homes Using Delayed Fuzzy Temporal Windows on Binary Sensors , 2020, IEEE Journal of Biomedical and Health Informatics.

[42]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[43]  Houqiang Li,et al.  Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition , 2018, IJCAI.

[44]  Damith Chinthana Ranasinghe,et al.  Recognising Activities in Real Time Using Body Worn Passive Sensors With Sparse Data Streams: To Interpolate or Not To Interpolate? , 2015, MobiQuitous.

[45]  Vladlen Koltun,et al.  An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.

[46]  Ling Pei,et al.  Weakly Supervised Human Activity Recognition From Wearable Sensors by Recurrent Attention Learning , 2019, IEEE Sensors Journal.

[47]  Davide Anguita,et al.  Transition-Aware Human Activity Recognition Using Smartphones , 2016, Neurocomputing.

[48]  David Ha,et al.  long short term memory , 2015 .

[49]  Yoshua Bengio,et al.  Deep Learning of Representations: Looking Forward , 2013, SLSP.