TinyRadarNN: Combining Spatial and Temporal Convolutional Neural Networks for Embedded Gesture Recognition with Short Range Radars

This work proposes a low-power high-accuracy embedded hand-gesture recognition algorithm targeting battery-operated wearable devices using low power short-range RADAR sensors. A 2D Convolutional Neural Network (CNN) using range frequency Doppler features is combined with a Temporal Convolutional Neural Network (TCN) for time sequence prediction. The final algorithm has a model size of only 46 thousand parameters, yielding a memory footprint of only 92 KB. Two datasets containing 11 challenging hand gestures performed by 26 different people have been recorded containing a total of 20,210 gesture instances. On the 11 hand gesture dataset, accuracies of 86.6% (26 users) and 92.4% (single user) have been achieved, which are comparable to the state-of-the-art, which achieves 87% (10 users) and 94% (single user), while using a TCN-based network that is 7500x smaller than the state-of-the-art. Furthermore, the gesture recognition classifier has been implemented on Parallel Ultra-Low Power Processor, demonstrating that real-time prediction is feasible with only 21 mW of power consumption for the full TCN sequence prediction network.

[1]  Michele Magno,et al.  DeepEmote: Towards multi-layer neural networks in a low power wearable multi-sensors bracelet , 2017, 2017 7th IEEE International Workshop on Advances in Sensors and Interfaces (IWASI).

[2]  E. Sasikala,et al.  A Survey on Hand Gesture Recognition Systems , 2018 .

[3]  Jan M. Rabaey,et al.  A Robust and Energy-Efficient Classifier Using Brain-Inspired Hyperdimensional Computing , 2016, ISLPED.

[4]  Hanchao Yu,et al.  Learning Effective Spatial–Temporal Features for sEMG Armband-Based Gesture Recognition , 2020, IEEE Internet of Things Journal.

[5]  Ivan Poupyrev,et al.  Interacting with Soli: Exploring Fine-Grained Dynamic Gesture Recognition in the Radio-Frequency Spectrum , 2016, UIST.

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Alina Delia Calin,et al.  Gesture Recognition on Kinect Time Series Data Using Dynamic Time Warping and Hidden Markov Models , 2016, 2016 18th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC).

[8]  Giancarlo Fortino,et al.  A framework for collaborative computing and multi-sensor data fusion in body sensor networks , 2015, Inf. Fusion.

[9]  Vikas Chandra,et al.  CMSIS-NN: Efficient Neural Network Kernels for Arm Cortex-M CPUs , 2018, ArXiv.

[10]  Alexandra Neukum,et al.  A Human-Machine Interface for Cooperative Highly Automated Driving , 2017 .

[11]  Austin Reiter,et al.  Interpretable 3D Human Action Analysis with Temporal Convolutional Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[12]  Stefan Poslad,et al.  Temporal Convolutional Networks for Multiperson Activity Recognition Using a 2-D LIDAR , 2020, IEEE Internet of Things Journal.

[13]  Luca Benini,et al.  CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Sebastian Böck,et al.  Temporal convolutional networks for musical audio beat tracking , 2019, 2019 27th European Signal Processing Conference (EUSIPCO).

[15]  Yuliang Sun,et al.  Automatic Radar-based Gesture Detection and Classification via a Region-based Deep Convolutional Neural Network , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Richard Hans Robert Hahnloser,et al.  Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit , 2000, Nature.

[17]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[18]  Khaled A. Harras,et al.  WiGest: A ubiquitous WiFi-based gesture recognition system , 2014, 2015 IEEE Conference on Computer Communications (INFOCOM).

[19]  Shwetak N. Patel,et al.  Whole-home gesture recognition using wireless signals , 2013, MobiCom.

[20]  Jin Woo Kim,et al.  A Hand Gesture Recognition Sensor Using Reflected Impulses , 2017, IEEE Sensors Journal.

[21]  Liang Zhang,et al.  Device-Free Human Gesture Recognition With Generative Adversarial Networks , 2020, IEEE Internet of Things Journal.

[22]  Avik Santra,et al.  Radar Gesture Recognition System in Presence of Interference using Self-Attention Neural Network , 2019, 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA).

[23]  Jinsung Yoon,et al.  Dynamic Prediction in Clinical Survival Analysis Using Temporal Convolutional Networks , 2020, IEEE Journal of Biomedical and Health Informatics.

[24]  Albert Y. Zomaya,et al.  Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence , 2019, IEEE Internet of Things Journal.

[25]  Jun Wan,et al.  Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  DeLiang Wang,et al.  TCNN: Temporal Convolutional Neural Network for Real-time Speech Enhancement in the Time Domain , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Gregory D. Hager,et al.  Temporal Convolutional Networks for Action Segmentation and Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Michele Magno,et al.  A RISC-V Based Open Hardware Platform for Always-On Wearable Smart Sensing , 2019, 2019 IEEE 8th International Workshop on Advances in Sensors and Interfaces (IWASI).

[29]  Lars Bauer,et al.  From Cloud Down to Things: An Overview of Machine Learning in Internet of Things , 2019, IEEE Internet of Things Journal.

[30]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Giancarlo Fortino,et al.  Multi-user activity recognition: Challenges and opportunities , 2020, Inf. Fusion.

[32]  Giancarlo Fortino,et al.  Data Mining at the IoT Edge , 2019, 2019 28th International Conference on Computer Communication and Networks (ICCCN).

[33]  Jong-Hwan Kim,et al.  Short-Range Radar Based Real-Time Hand Gesture Recognition Using LSTM Encoder , 2019, IEEE Access.

[34]  Roland Siegwart,et al.  Conference Presentation Slides on Kinect v2 for Mobile Robot Navigation: Evaluation and Modeling , 2015 .

[35]  Ruize Xu,et al.  MEMS Accelerometer Based Nonspecific-User Hand Gesture Recognition , 2012, IEEE Sensors Journal.

[36]  Ling Shao,et al.  Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Rob Miller,et al.  3D Tracking via Body Radio Reflections , 2014, NSDI.

[38]  Michele Magno,et al.  Low Power Embedded Gesture Recognition Using Novel Short-Range Radar Sensors , 2019, 2019 IEEE SENSORS.

[39]  Yuliang Sun,et al.  Gesture Classification with Handcrafted Micro-Doppler Features using a FMCW Radar , 2018, 2018 IEEE MTT-S International Conference on Microwaves for Intelligent Mobility (ICMIM).

[40]  Vladlen Koltun,et al.  An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.

[41]  Analysis of the hands in egocentric vision: A survey , 2019, IEEE transactions on pattern analysis and machine intelligence.

[42]  Luca Benini,et al.  Near-Threshold RISC-V Core With DSP Extensions for Scalable IoT Endpoint Devices , 2016, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[43]  Xukan Ran,et al.  Deep Learning With Edge Computing: A Review , 2019, Proceedings of the IEEE.

[44]  Roozbeh Jafari,et al.  Orientation Independent Activity/Gesture Recognition Using Wearable Motion Sensors , 2019, IEEE Internet of Things Journal.

[45]  Bharti Bansal,et al.  Gesture Recognition: A Survey , 2016 .

[46]  Antonis A. Argyros,et al.  Vision-based Hand Gesture Recognition for Human-Computer Interaction , 2008 .

[47]  Luca Benini,et al.  Origami: A Convolutional Network Accelerator , 2015, ACM Great Lakes Symposium on VLSI.

[48]  Yang Zhang,et al.  Tomo: Wearable, Low-Cost Electrical Impedance Tomography for Hand Gesture Recognition , 2015, UIST.

[49]  Ivan Poupyrev,et al.  Soli , 2016, ACM Trans. Graph..

[50]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[51]  Cordelia Schmid,et al.  Long-Term Temporal Convolutions for Action Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[53]  Shyamnath Gollakota,et al.  Bringing Gesture Recognition to All Devices , 2014, NSDI.

[54]  Karam Dad Kallu,et al.  Hand Gestures Recognition Using Radar Sensors for Human-Computer-Interaction: A Review , 2021, Remote. Sens..

[55]  Lihua Xie,et al.  Learning Gestures From WiFi: A Siamese Recurrent Convolutional Architecture , 2019, IEEE Internet of Things Journal.

[56]  Joseph A. Paradiso,et al.  WristFlex: low-power gesture input with wrist-worn pressure sensors , 2014, UIST.

[57]  Hermann Ney,et al.  Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Luca Benini,et al.  Gesture Recognition in Ego-centric Videos Using Dense Trajectories and Hand Segmentation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.