CODE: Compact IoT Data Collection with Precise Matrix Sampling and Efficient Inference

It is unpractical to conduct full-size data collection in ubiquitous IoT data systems due to the energy constraints of IoT sensors and large system scales. Although sparse sensing technologies have been proposed to infer missing data based on partial sampled data, they usually focus on data inference while neglecting the sampling process, restraining the inference efficiency. In addition, their inferring methods highly depend on data linearity correlations, which become less effective when data are not linearly correlated. In this paper, we propose, Compact IOT Data CollEction, namely CODE, to conduct precise data matrix sampling and efficient inference. Particularly, CODE integrates two major components, i.e., cluster-based matrix sampling and Generative Adversarial Networks (GAN)-based matrix inference, to reduce the data collection cost and guarantee the data benefits, respectively. In the sampling component, a cluster-based sampling approach is devised, in which data clustering is first conducted and then a two-step sampling is performed in accordance with the number of clusters and clustering errors. For the inference component, a GAN-based model is developed to estimate the full matrix, which consists of a generator network that learns to generate a fake matrix, and a discriminator network that learns to discriminate the fake matrix from the real one. A reference implementation of CODE is conducted under three operational large-scale IoT systems, and extensive data-driven experiment results are provided to demonstrate its efficiency and robustness.

[1]  Jianqiang Huang,et al.  Large-scale vehicle trajectory reconstruction with camera sensing network , 2021, MobiCom.

[2]  Desheng Zhang,et al.  Nationwide Deployment and Operation of a Virtual Arrival Detection System in the Wild , 2021, IEEE/ACM Transactions on Networking.

[3]  Ting-Zhu Huang,et al.  Fully-Connected Tensor Network Decomposition and Its Application to Higher-Order Tensor Completion , 2021, AAAI.

[4]  Yingying Chen,et al.  WiFi-Enabled User Authentication through Deep Learning in Daily Activities , 2021, ACM Trans. Internet Things.

[5]  Christophe Osswald,et al.  All-in-one: Toward hybrid data collection and energy saving mechanism in sensing-based IoT applications , 2021, Peer-to-Peer Networking and Applications.

[6]  I. Mezić,et al.  Data-driven analysis and forecasting of highway traffic dynamics , 2020, Nature Communications.

[7]  Desheng Zhang,et al.  Mobile Cyber-Physical Systems for Smart Cities , 2020, WWW.

[8]  Yang Wang,et al.  VeMo: Enabling Transparent Vehicular Mobility Modeling at Individual Levels with Full Penetration , 2018, MobiCom.

[9]  Ivan Markovsky,et al.  Low Rank Approximation - Algorithms, Implementation, Applications , 2018, Communications and Control Engineering.

[10]  Xiao Han,et al.  SPACE-TA , 2017, ACM Trans. Intell. Syst. Technol..

[11]  Fahim Arif,et al.  Smart urban planning using big data analytics based internet of things , 2017, UbiComp/ISWC Adjunct.

[12]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[13]  Yu-Bin Yang,et al.  Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections , 2016, NIPS.

[14]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[15]  Daqing Zhang,et al.  CCS-TA: quality-guaranteed online task allocation in compressive crowdsensing , 2015, UbiComp.

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Yin Zhang,et al.  Robust network compressive sensing , 2014, MobiCom.

[18]  Aaron C. Courville,et al.  Generative Adversarial Nets , 2014, NIPS.

[19]  Barnabás Póczos,et al.  Active learning and search on low-rank matrices , 2013, KDD.

[20]  Lawrence Carin,et al.  Active learning for online bayesian matrix factorization , 2012, KDD.

[21]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[22]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[23]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2008, Found. Comput. Math..

[24]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[25]  E.J. Candes Compressive Sampling , 2022 .