RF-net: a unified meta-learning framework for RF-enabled one-shot human activity recognition

Radio-Frequency (RF) based device-free Human Activity Recognition (HAR) rises as a promising solution for many applications. However, device-free (or contactless) sensing is often more sensitive to environment changes than device-based (or wearable) sensing. Also, RF datasets strictly require on-line labeling during collection, starkly different from image and text data collections where human interpretations can be leveraged to perform off-line labeling. Therefore, existing solutions to RF-HAR entail a laborious data collection process for adapting to new environments. To this end, we propose RF-Net as a meta-learning based approach to one-shot RF-HAR; it reduces the labeling efforts for environment adaptation to the minimum level. In particular, we first examine three representative RF sensing techniques and two major meta-learning approaches. The results motivate us to innovate in two designs: i) a dual-path base HAR network, where both time and frequency domains are dedicated to learning powerful RF features including spatial and attention-based temporal ones, and ii) a metric-based meta-learning framework to enhance the fast adaption capability of the base network, including an RF-specific metric module along with a residual classification module. We conduct extensive experiments based on all three RF sensing techniques in multiple real-world indoor environments; all results strongly demonstrate the efficacy of RF-Net compared with state-of-the-art baselines.

[1]  Tae-Seong Kim,et al.  Depth video-based human activity recognition system using translation and scaling invariant features for life logging at smart home , 2012, IEEE Transactions on Consumer Electronics.

[2]  Chuang Gan,et al.  TSM: Temporal Shift Module for Efficient Video Understanding , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[3]  David Wetherall,et al.  Tool release: gathering 802.11n traces with channel state information , 2011, CCRV.

[4]  Sung Ho Cho,et al.  IR-UWB Radar Sensor for Human Gesture Recognition by Using Machine Learning , 2016, 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[5]  Muhammad Shahzad,et al.  Position and Orientation Agnostic Gesture Recognition Using WiFi , 2017, MobiSys.

[6]  Lina Yao,et al.  Multi-modality Sensor Data Classification with Selective Attention , 2018, IJCAI.

[7]  Wei Wang,et al.  Understanding and Modeling of WiFi Signal Based Human Activity Recognition , 2015, MobiCom.

[8]  Chenglin Miao,et al.  Towards Environment Independent Device Free Human Activity Recognition , 2018, MobiCom.

[9]  Byoung-Tak Zhang,et al.  Bilinear Attention Networks , 2018, NeurIPS.

[10]  Xiangyu Zhang,et al.  ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[11]  Muhammad Shahzad,et al.  Multi-User Gesture Recognition Using WiFi , 2018, MobiSys.

[12]  J. Schulman,et al.  Reptile: a Scalable Metalearning Algorithm , 2018 .

[13]  Fadel Adib,et al.  Multi-Person Localization via RF Body Reflections , 2015, NSDI.

[14]  Shuangquan Wang,et al.  SignFi , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[15]  Rong Zheng,et al.  SST: Software Sonic Thermometer on Acoustic-enabled IoT Devices , 2020 .

[16]  Jian-Huang Lai,et al.  Jointly Learning Heterogeneous Features for RGB-D Activity Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Khaled A. Harras,et al.  A Ubiquitous WiFi-Based Fine-Grained Gesture Recognition System , 2019, IEEE Transactions on Mobile Computing.

[18]  Kris M. Kitani,et al.  Going Deeper into First-Person Activity Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[20]  Yunhao Liu,et al.  Pervasive Floorplan Generation Based on Only Inertial Sensing: Feasibility, Design, and Implementation , 2017, IEEE Journal on Selected Areas in Communications.

[21]  Antonio Torralba,et al.  RF-based 3D skeletons , 2018, SIGCOMM.

[22]  Yichen Wei,et al.  Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[23]  Antonio Torralba,et al.  Through-Wall Human Pose Estimation Using Radio Signals , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Kamin Whitehouse,et al.  The hitchhiker's guide to successful residential sensing deployments , 2011, SenSys.

[25]  Hang Li,et al.  Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.

[26]  Wei Cui,et al.  WiFi CSI Based Passive Human Activity Recognition Using Attention Based BLSTM , 2019, IEEE Transactions on Mobile Computing.

[27]  Sangki Yun,et al.  Indoor Follow Me Drone , 2017, MobiSys.

[28]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[29]  Shyamnath Gollakota,et al.  MilliSonic: Pushing the Limits of Acoustic Motion Tracking , 2019, CHI.

[30]  Paolo Napoletano,et al.  Benchmark Analysis of Representative Deep Neural Network Architectures , 2018, IEEE Access.

[31]  Yusheng Ji,et al.  RF-Sensing of Activities from Non-Cooperative Subjects in Device-Free Recognition Systems Using Ambient and Local Signals , 2014, IEEE Transactions on Mobile Computing.

[32]  Jun Luo,et al.  ReflexCode: Coding with Superposed Reflection Light for LED-Camera Communication , 2017, MobiCom.

[33]  Guoliang Xing,et al.  iSleep: unobtrusive sleep quality monitoring using smartphones , 2013, SenSys '13.

[34]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[35]  Xiang Li,et al.  Device-Free WiFi Human Sensing: From Pattern-Based to Model-Based Approaches , 2017, IEEE Communications Magazine.

[36]  Mubarak Shah,et al.  Task Agnostic Meta-Learning for Few-Shot Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Xu Zhang,et al.  V2iFi , 2020, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[38]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[39]  Song Han,et al.  Temporal Shift Module for Efficient Video Understanding , 2018, ArXiv.

[40]  Richard S. Zemel,et al.  Prototypical Networks for Few-shot Learning , 2017, NIPS.

[41]  Jun Luo,et al.  Counting via LED sensing: Inferring occupancy using lighting infrastructure , 2018, Pervasive Mob. Comput..

[42]  Volker Kempe,et al.  Inertial MEMS: Principles and Practice , 2011 .

[43]  Jinwoo Shin,et al.  MetaSense: few-shot adaptation to untrained conditions in deep mobile sensing , 2019, SenSys.

[44]  Xin Liu,et al.  EAR: Exploiting Uncontrollable Ambient RF Signals in Heterogeneous Networks for Gesture Recognition , 2018, SenSys.

[45]  Marco Gruteser,et al.  Capacitive Touch Communication: A Technique to Input Data through Devices' Touch Screen , 2014, IEEE Transactions on Mobile Computing.

[46]  Jin Woo Kim,et al.  A Hand Gesture Recognition Sensor Using Reflected Impulses , 2017, IEEE Sensors Journal.

[47]  Bernt Schiele,et al.  Meta-Transfer Learning for Few-Shot Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Feng Li,et al.  iLocScan: harnessing multipath for simultaneous indoor source localization and space scanning , 2014, SenSys.

[49]  Razvan Pascanu,et al.  Meta-Learning with Latent Embedding Optimization , 2018, ICLR.

[50]  Stephan Schraml,et al.  Embedded fall detection with a neural network and bio-inspired stereo vision , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[51]  Yu-Chiang Frank Wang,et al.  A Closer Look at Few-shot Classification , 2019, ICLR.

[52]  Jun Luo,et al.  CeilingSee: Device-free occupancy inference through lighting infrastructure based LED sensing , 2017, 2017 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[53]  Oriol Vinyals,et al.  Matching Networks for One Shot Learning , 2016, NIPS.

[54]  Zhe Chen,et al.  AWL: Turning Spatial Aliasing From Foe to Friend for Accurate WiFi Localization , 2017, CoNEXT.

[55]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Qin Lv,et al.  CapBand: Battery-free Successive Capacitance Sensing Wristband for Hand Gesture Recognition , 2018, SenSys.

[57]  Yunhao Liu,et al.  Zero-Effort Cross-Domain Gesture Recognition with Wi-Fi , 2019, MobiSys.

[58]  Guoliang Xing,et al.  MindfulWatch: A Smartwatch-Based System For Real-Time Respiration Monitoring During Meditation , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[59]  Kaishun Wu,et al.  WiFall: Device-free fall detection by wireless networks , 2017, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[60]  Bassem Mahafza,et al.  Radar Systems Analysis and Design Using MATLAB , 2000 .

[61]  Mani B. Srivastava,et al.  UWHear: through-wall extraction and separation of audio vibrations using wireless signals , 2020, SenSys.

[62]  Daan Wierstra,et al.  Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.

[63]  B. S. Manjunath,et al.  Caesar: cross-camera complex activity recognition , 2019, SenSys.

[64]  Guoliang Xing,et al.  FamilyLog: Monitoring Family Mealtime Activities by Mobile Devices , 2020, IEEE Transactions on Mobile Computing.

[65]  Guoliang Xing,et al.  PBN: towards practical activity recognition using smartphone-based body sensor networks , 2011, SenSys.

[66]  Hao He,et al.  RF-Based Fall Monitoring Using Convolutional Neural Networks , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[67]  Jun Luo,et al.  CeilingCast: Energy efficient and location-bound broadcast through LED-camera communication , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[68]  Gregory R. Koch,et al.  Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[69]  Zheng Wang,et al.  CrossSense: Towards Cross-Site and Large-Scale WiFi Sensing , 2018, MobiCom.

[70]  Kingshuk Chakravarty,et al.  Unobtrusive indoor surveillance of patients at home using multiple Kinect sensors , 2013, SenSys '13.

[71]  Minglu Li,et al.  BreathListener: Fine-grained Breathing Monitoring in Driving Environments Utilizing Acoustic Signals , 2019, MobiSys.

[72]  Mike Fraser,et al.  EchoFlex: Hand Gesture Recognition using Ultrasound Imaging , 2017, CHI.

[73]  Wei Wang,et al.  Keystroke Recognition Using WiFi Signals , 2015, MobiCom.

[74]  Jun Luo,et al.  AcuTe: acoustic thermometer empowered by a single smartphone , 2020, SenSys.

[75]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[76]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.