论文信息 - Towards multimodal deep learning for activity recognition on mobile devices

Towards multimodal deep learning for activity recognition on mobile devices

Current smartphones and smartwatches come equipped with a variety of sensors, from light sensor and inertial sensors to radio interfaces, enabling applications running on these devices to make sense of their surrounding environment. Rather than using sensors independently, combining their sensing capabilities facilitates more interesting and complex applications to emerge (e.g., user activity recognition). But differences between sensors ranging from sampling rate to data generation model (event triggered or continuous sampling) make integration of sensor streams challenging. Here we investigate the opportunity to use deep learning to perform this integration of sensor data from multiple sensors. The intuition is that neural networks can identify nonintuitive features largely from cross-sensor correlations which can result in a more accurate estimation. Initial results with a variant of a Restricted Boltzmann Machine (RBM), show better performance with this new approach compared to classic solutions.

[1] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[2] Amit Sethi,et al. Sports Video Classification from Multimodal Information Using Deep Neural Networks , 2013, AAAI Fall Symposia.

[3] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.

[4] Thomas Plötz,et al. Deep, Convolutional, and Recurrent Models for Human Activity Recognition Using Wearables , 2016, IJCAI.

[5] Mahesh K. Marina,et al. A semi-supervised learning approach for robust indoor-outdoor detection with smartphones , 2014, SenSys.

[6] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[7] Honglak Lee,et al. Improved Multimodal Deep Learning with Variation of Information , 2014, NIPS.

[8] Mikkel Baun Kjærgaard,et al. Smart Devices are Different: Assessing and MitigatingMobile Sensing Heterogeneities for Activity Recognition , 2015, SenSys.

[9] Nicholas D. Lane,et al. Can Deep Learning Revolutionize Mobile Sensing? , 2015, HotMobile.

[10] Wei Liu,et al. Multimodal Emotion Recognition Using Multimodal Deep Learning , 2016, ArXiv.

[11] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..