Deep Learning Based Air-Writing Recognition with the Choice of Proper Interpolation Technique

The act of writing letters or words in free space with body movements is known as air-writing. Air-writing recognition is a special case of gesture recognition in which gestures correspond to characters and digits written in the air. Air-writing, unlike general gestures, does not require the memorization of predefined special gesture patterns. Rather, it is sensitive to the subject and language of interest. Traditional air-writing requires an extra device containing sensor(s), while the wide adoption of smart-bands eliminates the requirement of the extra device. Therefore, air-writing recognition systems are becoming more flexible day by day. However, the variability of signal duration is a key problem in developing an air-writing recognition model. Inconsistent signal duration is obvious due to the nature of the writing and data-recording process. To make the signals consistent in length, researchers attempted various strategies including padding and truncating, but these procedures result in significant data loss. Interpolation is a statistical technique that can be employed for time-series signals to ensure minimum data loss. In this paper, we extensively investigated different interpolation techniques on seven publicly available air-writing datasets and developed a method to recognize air-written characters using a 2D-CNN model. In both user-dependent and user-independent principles, our method outperformed all the state-of-the-art methods by a clear margin for all datasets.

[1]  Biing-Hwang Juang,et al.  Air-Writing Recognition—Part I: Modeling and Recognition of Characters, Words, and Connecting Motions , 2016, IEEE Transactions on Human-Machine Systems.

[2]  Chen Chen,et al.  Deep Fisher discriminant learning for mobile hand gesture recognition , 2017, Pattern Recognit..

[3]  Xiaoli Li,et al.  Deep Convolutional Neural Networks on Multichannel Time Series for Human Activity Recognition , 2015, IJCAI.

[4]  Eric Dubois,et al.  Image up-sampling using total-variation regularization with a new observation model , 2005, IEEE Transactions on Image Processing.

[5]  François Clemens,et al.  Interpolation in Time Series : An Introductive Overview of Existing Methods, Their Performance Criteria and Uncertainty Assessment , 2017 .

[6]  Lianwen Jin,et al.  Air-Writing Translater: A Novel Unsupervised Domain Adaptation Method for Inertia-Trajectory Translation of In-air Handwriting , 2019, ArXiv.

[7]  Xiaodong Gu,et al.  Towards dropout training for convolutional neural networks , 2015, Neural Networks.

[8]  Ani Nahapetian,et al.  AirDraw: Leveraging smart watch motion sensors for mobile human computer interactions , 2016, 2016 13th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[9]  Taoying Li,et al.  SRPM–CNN: a combined model based on slide relative position matrix and CNN for time series classification , 2021, Complex & Intelligent Systems.

[10]  Tien-Ying Kuo,et al.  Egocentric-View Fingertip Detection for Air Writing Based on Convolutional Neural Networks † , 2021, Sensors.

[11]  Seungjin Choi,et al.  Convolutional neural networks for human activity recognition using multiple accelerometer and gyroscope sensors , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[12]  Kunihiko Fukushima,et al.  Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Visual Pattern Recognition , 1982 .

[13]  A. Gnauck Interpolation and approximation of water quality time series and process identification , 2004, Analytical and bioanalytical chemistry.

[14]  Shigeru Shimamoto,et al.  Wearable Air-Writing Recognition System employing Dynamic Time Warping , 2021, 2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC).

[15]  Jong-Hwan Kim,et al.  Writing in The Air: Unconstrained Text Recognition from Finger Movement Using Spatio-Temporal Convolution , 2021, IEEE Transactions on Artificial Intelligence.

[16]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[17]  Kent Lyons,et al.  The Gesture Watch: A Wireless Contact-free Gesture based Wrist Interface , 2007, 2007 11th IEEE International Symposium on Wearable Computers.

[18]  Yu-Liang Hsu,et al.  An Inertial Pen With Dynamic Time Warping Recognizer for Handwriting and Gesture Recognition , 2015, IEEE Sensors Journal.

[19]  Avik Santra,et al.  Air-Writing with Sparse Network of Radars using Spatio-Temporal Learning , 2021, 2020 25th International Conference on Pattern Recognition (ICPR).

[20]  Biing-Hwang Juang,et al.  6DMG: a new 6D motion gesture database , 2012, MMSys '12.

[21]  Tarun Gulati,et al.  Zooming Digital Images using Interpolation Techniques , 2013 .

[22]  Yang Xue,et al.  A Long Term Memory Recognition Framework on Multi-Complexity Motion Gestures , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[23]  Nojun Kwak,et al.  Analysis on the Dropout Effect in Convolutional Neural Networks , 2016, ACCV.

[24]  N. V. Subba Reddy,et al.  Effects of padding on LSTMs and CNNs , 2019, ArXiv.

[25]  Lin Zhong,et al.  uWave: Accelerometer-based personalized gesture recognition and its applications , 2009, 2009 IEEE International Conference on Pervasive Computing and Communications.

[26]  Anind K. Dey,et al.  Serendipity: Finger Gesture Recognition using an Off-the-Shelf Smartwatch , 2016, CHI.

[27]  Dandu Amarnatha Reddy Vision Based Hand Gesture Recognition for Human Computer Interaction , 2018 .

[28]  Tanja Schultz,et al.  Airwriting: bringing text entry to wearable computers , 2013, XRDS.

[29]  Bo Yu,et al.  Convolutional Neural Networks for human activity recognition using mobile sensors , 2014, 6th International Conference on Mobile Computing, Applications and Services.

[30]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[31]  Kuo-Chin Fan,et al.  Air-writing recognition using reverse time ordered stroke context , 2021, J. Vis. Commun. Image Represent..

[32]  Bonhwa Ku,et al.  Alpha-numeric hand gesture recognition based on fusion of spatial feature modelling and temporal feature modelling , 2016 .

[33]  Biing-Hwang Juang,et al.  Air-Writing Recognition—Part II: Detection and Recognition of Writing Activity in Continuous Stream of Motion Data , 2016, IEEE Transactions on Human-Machine Systems.

[34]  Chaur-Heh Hsieh,et al.  Air-Writing Recognition Based on Deep Convolutional Neural Networks , 2021, IEEE Access.

[35]  Alireza Entezari,et al.  Uncertainty Quantification in Linear Interpolation for Isosurface Extraction , 2013, IEEE Transactions on Visualization and Computer Graphics.

[36]  Vassilis Katsouros,et al.  Air-Writing Recognition using Deep Convolutional and Recurrent Neural Network Architectures , 2020, 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[37]  Ki-Chul Kwon,et al.  Implementation of a Character Recognition System Based on Finger-Joint Tracking Using a Depth Camera , 2021, IEEE Transactions on Human-Machine Systems.

[38]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2016, Texts in Computer Science.

[39]  Yuval Elovici,et al.  Handwritten Signature Verification Using Wrist-Worn Devices , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[40]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Tanja Schultz,et al.  Airwriting: a wearable handwriting recognition system , 2013, Personal and Ubiquitous Computing.

[42]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[43]  T. Filik,et al.  RF-Wri: An Efficient Framework for RF-Based Device-Free Air-Writing Recognition , 2021, IEEE Sensors Journal.

[44]  R. Keys Cubic convolution interpolation for digital image processing , 1981 .

[45]  Takao Onoye,et al.  Fusion Networks for Air-Writing Recognition , 2018, MMM.

[46]  P. Revesz Interpolation and Approximation , 2010 .

[47]  Nam Kim,et al.  Trajectory-Based Air-Writing Recognition Using Deep Neural Network and Depth Sensor , 2020, Sensors.

[48]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[49]  Gui-Bin Bian,et al.  Performance Analysis of Google Colaboratory as a Tool for Accelerating Deep Learning Applications , 2018, IEEE Access.

[50]  Emilio Carrizosa,et al.  Time series interpolation via global optimization of moments fitting , 2013, Eur. J. Oper. Res..

[51]  Parth H. Pathak,et al.  Finger-writing with Smartwatch: A Case for Finger and Hand Gesture Recognition using Smartwatch , 2015, HotMobile.

[52]  Rui Ye,et al.  Implementing transfer learning across different datasets for time series forecasting , 2021, Pattern Recognit..

[53]  Sang Min Yoon,et al.  Human activity recognition from accelerometer data using Convolutional Neural Network , 2017, 2017 IEEE International Conference on Big Data and Smart Computing (BigComp).

[54]  Ganesh R. Naik,et al.  CNN based approach for activity recognition using a wrist-worn accelerometer , 2017, 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[55]  Ki-Chul Kwon,et al.  Trajectory-Based Air-Writing Character Recognition Using Convolutional Neural Network , 2019, 2019 4th International Conference on Control, Robotics and Cybernetics (CRC).

[56]  Erez Shmueli,et al.  Air-writing recognition using smart-bands , 2020, Pervasive Mob. Comput..

[57]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[58]  Yang Xue,et al.  Air-writing characters modelling and recognition on modified CHMM , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[59]  Yixin Chen,et al.  SHOW , 2018, Silent Cinema.

[60]  Antonis A. Argyros,et al.  Vision-based Hand Gesture Recognition for Human-Computer Interaction , 2008 .

[61]  G. Lefebvre,et al.  3D gesture classification with convolutional neural networks , 2014, IEEE International Conference on Acoustics, Speech, and Signal Processing.