Heterogeneous Sensor Data Fusion By Deep Multimodal Encoding

Heterogeneous sensor data fusion is a challenging field that has gathered significant interest in recent years. Two of these challenges are learning from data with missing values, and finding shared representations for multimodal data to improve inference and prediction. In this paper, we propose a multimodal data fusion framework, the deep multimodal encoder (DME), based on deep learning techniques for sensor data compression, missing data imputation, and new modality prediction under multimodal scenarios. While traditional methods capture only the intramodal correlations, DME is able to mine both the intramodal correlations in the initial layers and the enhanced intermodal correlations in the deeper layers. In this way, the statistical structure of sensor data may be better exploited for data compression. By incorporating our new objective function, DME shows remarkable ability for missing data imputation tasks in sensor data. The shared multimodal representation learned by DME may be used directly for predicting new modalities. In experiments with a real-world dataset collected from a 40-node agriculture sensor network which contains three modalities, DME can achieve a root mean square error (RMSE) of missing data imputation which is only 20% of the traditional methods like K-nearest neighbors and sparse principal component analysis and the performance is robust to different missing rates. It can also reconstruct temperature modality from humidity and illuminance with an RMSE of $7\; {}^{\circ }$C, directly from a highly compressed (2.1%) shared representation that was learned from incomplete (80% missing) data.

[1]  Pramod K. Varshney,et al.  A New Framework for Distributed Detection With Conditionally Dependent Observations , 2012, IEEE Transactions on Signal Processing.

[2]  Francis R. Bach,et al.  Structured Sparse Principal Component Analysis , 2009, AISTATS.

[3]  Venkatesh Saligrama,et al.  Distributed Detection in Sensor Networks With Limited Range Multimodal Sensors , 2007, IEEE Transactions on Signal Processing.

[4]  Hwee Pink Tan,et al.  Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications , 2014, IEEE Communications Surveys & Tutorials.

[5]  S. Frick,et al.  Compressed Sensing , 2014, Computer Vision, A Reference Guide.

[6]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[7]  Venkatesh Saligrama,et al.  One-Bit Distributed Sensing and Coding for Field Estimation in Sensor Networks , 2007, IEEE Transactions on Signal Processing.

[8]  Vince D. Calhoun,et al.  Canonical Correlation Analysis for Data Fusion and Group Inferences , 2010, IEEE Signal Processing Magazine.

[9]  Xue Liu,et al.  Data loss and reconstruction in sensor networks , 2013, 2013 Proceedings IEEE INFOCOM.

[10]  Ian F. Akyildiz,et al.  Wireless sensor networks: a survey , 2002, Comput. Networks.

[11]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[12]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[13]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[14]  Tianrui Li,et al.  ST-MVL: Filling Missing Values in Geo-Sensory Time Series Data , 2016, IJCAI.

[15]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[16]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[17]  Fakhri Karray,et al.  Multisensor data fusion: A review of the state-of-the-art , 2013, Inf. Fusion.

[18]  Huiling Chen,et al.  Imputing missing values in sensor networks using sparse data representations , 2014, MSWiM '14.

[19]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[20]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[21]  Pascal Bianchi,et al.  Linear Precoders for the Detection of a Gaussian Process in Wireless Sensors Networks , 2011, IEEE Transactions on Signal Processing.

[22]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Pascal Vasseur,et al.  Introduction to Multisensor Data Fusion , 2005, The Industrial Information Technology Handbook.

[24]  Linghe Kong,et al.  Optimizing the Spatio-temporal Distribution of Cyber-Physical Systems for Environment Abstraction , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems.

[25]  Beng Chin Ooi,et al.  Effective deep learning-based multi-modal retrieval , 2015, The VLDB Journal.

[26]  Aylin Yener,et al.  Maximizing Quality of Information From Multiple Sensor Devices: The Exploration vs Exploitation Tradeoff , 2013, IEEE Journal of Selected Topics in Signal Processing.

[27]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[28]  Daniel Roggen,et al.  Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition , 2016, Sensors.