CORE-lens: simultaneous communication and object recognition with disentangled-GAN cameras

Optical camera communication (OCC) enabled by LED and embedded cameras has attracted extensive attention, thanks to its rich spectrum availability and ready deployability. However, the close interactions between OCC and the indoor spaces have created two major challenges. On one hand, the stripe pattern incurred by OCC may greatly damage the accuracy of image-based object recognition. On the other hand, the patterns inherent to indoor spaces can significantly degrade the decoding performance of reflected OCC. To this end, we propose CORE-Lens as a pipeline to make the mutual interference transparent to existing OR and OCC algorithms. Essentially, CORE-Lens treats the two challenges as two sides of a signal mixture issue: the signals transmitted by OCC get mixed with background images so well that their features become entangled. Consequently, CORE-Lens exploits the idea of disentangled representation learning to separate the mixed signals in the feature space: while the GAN-reconstructed clean background images are used to perform object recognition, OCC decoding is conducted on the residual of the original image after subtracting the reconstructed background. Our extensive experiments on evaluating the real-life performance of CORE-Lens evidently demonstrate its superiority over conventional approaches.

[1]  Sanglu Lu,et al.  MoiréPose: ultra high precision camera-to-screen pose estimation based on Moiré pattern , 2022, MobiCom.

[2]  Xuanzhe Liu,et al.  The Case for FPGA-Based Edge Computing , 2022, IEEE Transactions on Mobile Computing.

[3]  Soheil Feizi,et al.  A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Shujie Zhang,et al.  MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB Radar , 2021, SenSys.

[5]  Jun Luo,et al.  RF-Based Human Activity Recognition Using Signal Adapted Convolutional Neural Network , 2021, IEEE Transactions on Mobile Computing.

[6]  Jun Luo,et al.  MoVi-Fi: motion-robust vital signs waveform recovery via deep interpreted RF sensing , 2021, MobiCom.

[7]  Liangyin Chen,et al.  Improved optical camera communication systems using a freeform lens. , 2021, Optics express.

[8]  Vicente Matus,et al.  Convolutional autoencoder for exposure effects equalization and noise mitigation in optical camera communication. , 2021, Optics express.

[9]  Xuefen Chi,et al.  Non-line-of-sight optical camera communication aided by a pilot. , 2021, Optics letters.

[10]  Chi-Wai Chow,et al.  Display Light Panel and Rolling Shutter Image Sensor Based Optical Camera Communication (OCC) Using Frame-Averaging Background Removal and Neural Network , 2021, Journal of Lightwave Technology.

[11]  Lian-Kuan Chen,et al.  Li-poster: Real-time Non-line-of-sight Optical Camera Communication for Hand-held Smartphone Applications , 2021, 2021 Optical Fiber Communications Conference and Exhibition (OFC).

[12]  Chi-Wai Chow,et al.  Z-Score Averaging Neural Network and Background Content Removal for High Performance Rolling Shutter based Optical Camera Communication (OCC) , 2021, 2021 Optical Fiber Communications Conference and Exhibition (OFC).

[13]  Yanbing Yang,et al.  Pushing the Data Rate of Practical VLC via Combinatorial Light Emission , 2021, IEEE Transactions on Mobile Computing.

[14]  Xiaojun Jing,et al.  Integrating Sensing and Communications for Ubiquitous IoT: Applications, Trends, and Challenges , 2021, IEEE Network.

[15]  Amruta Pai,et al.  HRVCam: robust camera-based measurement of heart rate variability , 2021, Journal of biomedical optics.

[16]  Jun Luo,et al.  RF-net: a unified meta-learning framework for RF-enabled one-shot human activity recognition , 2020, SenSys.

[17]  Chenren Xu,et al.  Turboboosting Visible Light Backscatter Communication , 2020, SIGCOMM.

[18]  Yunxin Liu,et al.  SCYLLA: QoE-aware Continuous Mobile Vision with FPGA-based Dynamic Deep Neural Network Reconfiguration , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[19]  Ben Y. Zhao,et al.  Backdoor Attacks Against Deep Learning Systems in the Physical World , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Aleksander Madry,et al.  Noise or Signal: The Role of Image Backgrounds in Object Recognition , 2020, ICLR.

[21]  Xuanzhe Liu,et al.  Renovating road signs for infrastructure-to-vehicle networking: a visible light backscatter communication and networking approach , 2020, MobiCom.

[22]  Alberto Quattrini Li,et al.  PolarTag: Invisible Data with Light Polarization , 2020, HotMobile.

[23]  Chi-Wai Chow,et al.  CMOS camera based visible light communication (VLC) using grayscale value distribution and machine learning algorithm. , 2020, Optics express.

[24]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[25]  B. S. Manjunath,et al.  Caesar: cross-camera complex activity recognition , 2019, SenSys.

[26]  Guoying Zhao,et al.  Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[27]  Mauro Biagi,et al.  Non-Line-of-Sight MIMO Space-Time Division Multiplexing Visible Light Optical Camera Communications , 2019, Journal of Lightwave Technology.

[28]  Chao Cai,et al.  Ubiquitous Acoustic Sensing on Commodity IoT Devices: A Survey , 2019, IEEE Communications Surveys & Tutorials.

[29]  Nasir Saeed,et al.  Optical camera communications: Survey, use cases, challenges, and future trends , 2018, Phys. Commun..

[30]  Liqiong Liu,et al.  A Frame Averaging based Signal Tracing (FAST) Algorithm for Optical Camera Communications , 2018, 2018 Asia Communications and Photonics Conference (ACP).

[31]  Daniel McDuff,et al.  DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks , 2018, ECCV.

[32]  Jun Luo,et al.  Boosting the Throughput of LED-Camera VLC via Composite Light Emission , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[33]  Jun Luo,et al.  CeilingTalk: Lightweight Indoor Broadcast Through LED-Camera Communication , 2017, IEEE Transactions on Mobile Computing.

[34]  Abhishek Kumar,et al.  Variational Inference of Disentangled Latent Concepts from Unlabeled Observations , 2017, ICLR.

[35]  Jun Luo,et al.  ReflexCode: Coding with Superposed Reflection Light for LED-Camera Communication , 2017, MobiCom.

[36]  Guobin Shen,et al.  PassiveVLC: Enabling Practical Visible Light Backscatter Communication for Battery-free IoT Applications , 2017, MobiCom.

[37]  Chi Zhang,et al.  Automating Visual Privacy Protection Using a Smart LED , 2017, MobiCom.

[38]  Yingying Chen,et al.  Secret key distribution leveraging color shift over visible light channel , 2017, 2017 IEEE Conference on Communications and Network Security (CNS).

[39]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[40]  Siddharth Krishna Kumar,et al.  On weight initialization in deep neural networks , 2017, ArXiv.

[41]  Jun Luo,et al.  CeilingSee: Device-free occupancy inference through lighting infrastructure based LED sensing , 2017, 2017 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[42]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Christopher Burgess,et al.  beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[44]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Kris M. Kitani,et al.  Going Deeper into First-Person Activity Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Jun Luo,et al.  CeilingCast: Energy efficient and location-bound broadcast through LED-camera communication , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[47]  Archan Misra,et al.  LightSense: Exploiting smart bulbs for practical multimodal localization , 2016, 2016 IEEE International Conference on Pervasive Computing and Communication Workshops (PerCom Workshops).

[48]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[49]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Kate Ching-Ju Lin,et al.  RollingLight: Enabling Line-of-Sight Light-to-Camera Communications , 2015, MobiSys.

[51]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[53]  Aaron C. Courville,et al.  Generative Adversarial Nets , 2014, NIPS.

[54]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[55]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[56]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[57]  Harald Haas,et al.  Using a CMOS camera sensor for visible light communication , 2012, 2012 IEEE Globecom Workshops.

[58]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59]  Peter Harremoës,et al.  Rényi Divergence and Kullback-Leibler Divergence , 2012, IEEE Transactions on Information Theory.

[60]  Léon Bottou,et al.  Stochastic Gradient Descent Tricks , 2012, Neural Networks: Tricks of the Trade.