Highly-Optimized Radar-Based Gesture Recognition System with Depthwise Expansion Module

The increasing integration of technology in our daily lives demands the development of more convenient human–computer interaction (HCI) methods. Most of the current hand-based HCI strategies exhibit various limitations, e.g., sensibility to variable lighting conditions and limitations on the operating environment. Further, the deployment of such systems is often not performed in resource-constrained contexts. Inspired by the MobileNetV1 deep learning network, this paper presents a novel hand gesture recognition system based on frequency-modulated continuous wave (FMCW) radar, exhibiting a higher recognition accuracy in comparison to the state-of-the-art systems. First of all, the paper introduces a method to simplify radar preprocessing while preserving the main information of the performed gestures. Then, a deep neural classifier with the novel Depthwise Expansion Module based on the depthwise separable convolutions is presented. The introduced classifier is optimized and deployed on the Coral Edge TPU board. The system defines and adopts eight different hand gestures performed by five users, offering a classification accuracy of 98.13% while operating in a low-power and resource-constrained environment.

[1]  Shaidah Jusoh,et al.  A systematic review on hand gesture recognition techniques, challenges and applications , 2019, PeerJ Comput. Sci..

[2]  Rana Ali Amjad,et al.  A White Paper on Neural Network Quantization , 2021, ArXiv.

[3]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[4]  Chih-Yang Lin,et al.  Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model , 2019, Sensors.

[5]  Guoming Tang,et al.  A Survey on Edge Computing Systems and Tools , 2019, Proceedings of the IEEE.

[6]  Bo Chen,et al.  Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Mario Vestias,et al.  Deep Learning on Edge , 2020, Smart Systems Design, Applications, and Challenges.

[8]  G. Ram Mohana Reddy,et al.  Hand Gesture-Based Stable PowerPoint Presentation Using Kinect , 2018 .

[9]  A. V. Bychkov,et al.  DOA Estimation in Radar Sensors with Colocated Antennas , 2020, 2020 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SYNCHROINFO).

[10]  Meng Zhang,et al.  Efficient Resource-Aware Convolutional Neural Architecture Search for Edge Computing with Pareto-Bayesian Optimization , 2021, Sensors.

[11]  Yiwen Sun,et al.  Survey of Edge Computing Based on a Generalized Framework and Some Recommendation , 2020, EDGE.

[12]  Pavlo Molchanov,et al.  Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Ling Shao,et al.  Learning Discriminative Representations from RGB-D Video Data , 2013, IJCAI.

[14]  Pavlo Molchanov,et al.  Multi-sensor system for driver's hand-gesture recognition , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[15]  Atefeh Shahroudnejad,et al.  A Survey on Understanding, Visualizations, and Explanation of Deep Neural Networks , 2021, ArXiv.

[16]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Erwin Biebl,et al.  Improved People Counting Algorithm for Indoor Environments using 60 GHz FMCW Radar , 2020, 2020 IEEE Radar Conference (RadarConf20).

[18]  Avik Santra,et al.  Analysis of Edge-Optimized Deep Learning Classifiers for Radar-Based Gesture Recognition , 2021, IEEE Access.

[19]  Rita Cucchiara,et al.  Hand Gestures for the Human-Car Interaction: The Briareo Dataset , 2019, ICIAP.

[20]  Thomas Serre,et al.  HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.

[21]  Seongjoo Lee,et al.  A Frame Detection Method for Real-Time Hand Gesture Recognition Systems Using CW-Radar , 2020, Sensors.

[22]  Saurabh Bilgaiyan,et al.  Recent Trends in HCI: A survey on Data Glove, LEAP Motion and Microsoft Kinect , 2018, 2018 IEEE International Conference on System, Computation, Automation and Networking (ICSCA).

[23]  Karam Dad Kallu,et al.  Hand Gestures Recognition Using Radar Sensors for Human-Computer-Interaction: A Review , 2021, Remote. Sens..

[24]  Georgiana Simion,et al.  A PointNet-Based Solution for 3D Hand Gesture Recognition , 2020, Sensors.

[25]  Matthew Ritchie,et al.  Micro-Doppler Gesture Recognition using Doppler, Time and Range Based Features , 2019, 2019 IEEE Radar Conference (RadarConf).

[26]  Kostas Vassilakis,et al.  Control of Smart Home Operations Using Natural Language Processing, Voice Recognition and IoT Technologies in a Multi-Tier Architecture , 2019, Designs.

[27]  Oleg Antropov,et al.  Vital Sign Monitoring Using FMCW Radar in Various Sleeping Scenarios , 2020, Sensors.

[28]  Seda McKilligan,et al.  A Systematic Literature Review for Human-Computer Interaction and Design Thinking Process Integration , 2018, HCI.

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Avik Santra,et al.  Short-Range Radar-Based Gesture Recognition System Using 3D CNN With Triplet Loss , 2019, IEEE Access.

[31]  Francesco Fioranelli,et al.  Dop‐NET: a micro‐Doppler radar data challenge , 2020, Electronics Letters.

[32]  Gianluca Paravati,et al.  Human-Computer Interaction in Smart Environments , 2015, Sensors.

[33]  Sebastià Sallent,et al.  Deep Learning at the Mobile Edge: Opportunities for 5G Networks , 2020, Applied Sciences.

[34]  Anis Koubaa,et al.  DeepBrain: Experimental Evaluation of Cloud-Based Computation Offloading and Edge Computing in the Internet-of-Drones for Deep Learning Applications , 2020, Sensors.

[35]  Qisong Wu,et al.  Dynamic Hand Gesture Recognition Using FMCW Radar Sensor for Driving Assistance , 2018, 2018 10th International Conference on Wireless Communications and Signal Processing (WCSP).

[36]  Tao Zhang,et al.  A Survey of Model Compression and Acceleration for Deep Neural Networks , 2017, ArXiv.

[37]  Sergio Escalera,et al.  ChaLearn Looking at People Challenge 2014: Dataset and Results , 2014, ECCV Workshops.

[38]  Yong Wang,et al.  A Novel Detection and Recognition Method for Continuous Hand Gesture Using FMCW Radar , 2020, IEEE Access.

[39]  Shiliang Sun,et al.  A Survey of Optimization Methods From a Machine Learning Perspective , 2019, IEEE Transactions on Cybernetics.

[40]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[41]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[42]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[43]  Ivan Poupyrev,et al.  Soli , 2016, ACM Trans. Graph..

[44]  Debi Prosad Dogra,et al.  Coupled HMM-based multi-sensor data fusion for sign language recognition , 2017, Pattern Recognit. Lett..

[45]  Syed Faraz Hasan,et al.  Review-Microwave Radar Sensing Systems for Search and Rescue Purposes , 2019, Sensors.

[46]  Fenglei Fan,et al.  On Interpretability of Artificial Neural Networks: A Survey , 2020, IEEE Transactions on Radiation and Plasma Medical Sciences.

[47]  Mu Zhou,et al.  Latern: Dynamic Continuous Hand Gesture Recognition Using FMCW Radar Sensor , 2018, IEEE Sensors Journal.

[48]  Sung Ho Cho,et al.  Hand Gesture Recognition Using an IR-UWB Radar with an Inception Module-Based Classifier , 2020, Sensors.

[49]  Ta-Sung Lee,et al.  Design of an FMCW radar baseband signal processing system for automotive application , 2016, SpringerPlus.

[50]  Mariusz Zubert,et al.  Novel Radar-based Gesture Recognition System using Optimized CNN-LSTM Deep Neural Network for Low-power Microcomputer Platform , 2021, ICAART.

[51]  Ivan Poupyrev,et al.  Interacting with Soli: Exploring Fine-Grained Dynamic Gesture Recognition in the Radio-Frequency Spectrum , 2016, UIST.

[52]  J. Katona,et al.  A Review of Human–Computer Interaction and Virtual Reality Research Fields in Cognitive InfoCommunications , 2021, Applied Sciences.

[53]  Robert Weigel,et al.  Multi-Modal Cross Learning for Improved People Counting using Short-Range FMCW Radar , 2020, 2020 IEEE International Radar Conference (RADAR).

[54]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[55]  Daniela Rus,et al.  Lost in Pruning: The Effects of Pruning Neural Networks beyond Test Accuracy , 2021, MLSys.

[56]  Andrew Zisserman,et al.  Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Seo Weon Heo,et al.  Automotive Frequency Modulated Continuous Wave Radar Interference Reduction Using Per-Vehicle Chirp Sequences , 2018, Sensors.

[58]  Hasanudin,et al.  Review of a framework for audiovisual dialog-based in human computer interaction , 2015, 2015 International Conference on Information & Communication Technology and Systems (ICTS).

[59]  R. Cucchiara,et al.  A Transformer-Based Network for Dynamic Hand Gesture Recognition , 2020, 2020 International Conference on 3D Vision (3DV).

[60]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[61]  Xiaofei Wang,et al.  Convergence of Edge Computing and Deep Learning: A Comprehensive Survey , 2019, IEEE Communications Surveys & Tutorials.

[62]  Moussa Ayyash,et al.  Edge-Computing Architectures for Internet of Things Applications: A Survey , 2020, Sensors.

[63]  Stephan Nebiker,et al.  Outdoor Mobile Mapping and AI-Based 3D Object Detection with Low-Cost RGB-D Cameras: The Use Case of On-Street Parking Statistics , 2021, Remote. Sens..

[64]  Soo-Hyung Kim,et al.  Gesture Recognition Based on 3D Human Pose Estimation and Body Part Segmentation for RGB Data Input , 2020, Applied Sciences.

[65]  Frank Hutter,et al.  Neural Architecture Search: A Survey , 2018, J. Mach. Learn. Res..

[66]  Quoc V. Le,et al.  MixConv: Mixed Depthwise Convolutional Kernels , 2019, BMVC.

[67]  Bill Z. Manaris,et al.  Natural Language Processing: A Human-Computer Interaction Perspective , 1998, Adv. Comput..

[68]  Xukan Ran,et al.  Deep Learning With Edge Computing: A Review , 2019, Proceedings of the IEEE.

[69]  Carlos Sagüés,et al.  Human-Computer Interaction Based on Hand Gestures Using RGB-D Sensors , 2013, Sensors.

[70]  Mário P. Véstias,et al.  A Survey of Convolutional Neural Networks on Edge with Reconfigurable Computing , 2019, Algorithms.

[71]  Julien N. Siems,et al.  Dynamic Pruning of a Neural Network via Gradient Signal-to-Noise Ratio , 2021 .

[72]  Avik Santra,et al.  Radar Gesture Recognition System in Presence of Interference using Self-Attention Neural Network , 2019, 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA).

[73]  Muhammad Shafique,et al.  An Updated Survey of Efficient Hardware Architectures for Accelerating Deep Convolutional Neural Networks , 2020, Future Internet.

[74]  Hong Zhu,et al.  Hyper-Parameter Optimization: A Review of Algorithms and Applications , 2020, ArXiv.

[75]  Jakob Abeßer,et al.  A Review of Deep Learning Based Methods for Acoustic Scene Classification , 2020, Applied Sciences.

[76]  Hao Cheng,et al.  Pruning Filter in Filter , 2020, NeurIPS.

[77]  Manju Khari,et al.  Gesture Recognition of RGB and RGB-D Static Images Using Convolutional Neural Networks , 2019, Int. J. Interact. Multim. Artif. Intell..

[78]  Mubarak Shah,et al.  UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[79]  James O' Neill An Overview of Neural Network Compression , 2020, ArXiv.

[80]  Rita Cucchiara,et al.  Multimodal Hand Gesture Classification for the Human-Car Interaction , 2020, Informatics.

[81]  Avik Santra,et al.  Robust Gesture Recognition Using Millimetric-Wave Radar System , 2018, IEEE Sensors Letters.

[82]  Ivan Poupyrev,et al.  SOLI: A Tiny Device for a New Human Machine Interface , 2021, 2021 IEEE International Solid- State Circuits Conference (ISSCC).

[83]  Zhiru Zhang,et al.  Improving Neural Network Quantization without Retraining using Outlier Channel Splitting , 2019, ICML.

[84]  A. Raghuvanshi,et al.  Facial Expression Recognition with Convolutional Neural Networks , 2016 .

[85]  Albert Y. Zomaya,et al.  Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence , 2019, IEEE Internet of Things Journal.