Learning to Recognize Handwriting Input with Acoustic Features

For mobile or wearable devices with a small touchscreen, handwriting input (instead of typing on the touchscreen) is highly desirable for efficient human-computer interaction. Previous passive acoustic-based handwriting solutions mainly focus on print-style capital input, which is inconsistent with people's daily habits and thus causes inconvenience. In this paper, we propose WritingRecorder, a novel universal text entry system that enables free-style lowercase handwriting recognition. WritingRecorder leverages the built-in microphone of the smartphones to record the handwritten sound, and then designs an adaptive segmentation method to detect letter fragments in real-time from the recorded sound. Then we design a neural network named Inception-LSTM to extract the hidden and unique acoustic pattern associated with the writing trajectory of each letter and thus classify each letter. Moreover, we adopt a word selection method based on language model, so as to recognize legislate words from all possible letter combinations. We implement WritingRecorder as an APP on mobile phones and conduct the extensive experimental evaluation. The results demonstrate that WritingRecorder works in real-time and can achieve 93.2% accuracy even for new users without collecting and training on their handwriting samples, under a series of practical scenarios.

[1]  Shaojie Tang,et al.  SoundWrite: Text Input on Surfaces through Mobile Acoustic Sensing , 2015, SmartObjects '15.

[2]  Wenzhe Li,et al.  Recognizing Text Through Sound Alone , 2011, AAAI.

[3]  Minglu Li,et al.  SteerTrack: Acoustic-Based Device-Free Steering Tracking Leveraging Smartphones , 2018, 2018 15th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON).

[4]  Shyamnath Gollakota,et al.  Contactless Sleep Apnea Detection on Smartphones , 2015, GetMobile Mob. Comput. Commun..

[5]  Tara N. Sainath,et al.  Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Wei Wang,et al.  Depth Aware Finger Tapping on Virtual Displays , 2018, MobiSys.

[7]  Desney S. Tan,et al.  FingerIO: Using Active Sonar for Fine-Grained Finger Tracking , 2016, CHI.

[8]  Chen Wang,et al.  Fine-grained sleep monitoring: Hearing your breathing with smartphones , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[9]  Pavlo Molchanov,et al.  Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Lei Yang,et al.  AudioGest: enabling fine-grained hand gesture detection by decoding echo signal , 2016, UbiComp.

[11]  Yunhao Liu,et al.  Context-free Attacks Using Keyboard Acoustic Emanations , 2014, CCS.

[12]  Yunhao Liu,et al.  Acousticcardiogram: Monitoring Heartbeats using Acoustic Signals on Smart Devices , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Qiang Wang,et al.  Benchmarking State-of-the-Art Deep Learning Software Tools , 2016, 2016 7th International Conference on Cloud Computing and Big Data (CCBD).

[15]  Thomas Deselaers,et al.  GyroPen: Gyroscopes for Pen-Input With Mobile Phones , 2015, IEEE Transactions on Human-Machine Systems.

[16]  I. Elamvazuthi,et al.  Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques , 2010, ArXiv.

[17]  Guoliang Xing,et al.  iSleep: unobtrusive sleep quality monitoring using smartphones , 2013, SenSys '13.

[18]  Wade Trappe,et al.  Auto++: Detecting Cars Using Embedded Microphones in Real-Time , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[19]  Sangki Yun,et al.  Strata: Fine-Grained Acoustic-based Device-Free Tracking , 2017, MobiSys.

[20]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[21]  Sangki Yun,et al.  Turning a Mobile Device into a Mouse in the Air , 2015, MobiSys.

[22]  Christopher Kermorvant,et al.  Dropout Improves Recurrent Neural Networks for Handwriting Recognition , 2013, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[23]  Anind K. Dey,et al.  The Limits of Expert Text Entry Speed on Mobile Keyboards with Autocorrect , 2019, MobileHCI.

[24]  Lior Wolf,et al.  CNN-N-Gram for HandwritingWord Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Yi-Chao Wu,et al.  Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models , 2017, Pattern Recognit..

[26]  Fan Li,et al.  D3-Guard: Acoustic-based Drowsy Driving Detection Using Smartphones , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[27]  Ning Wang,et al.  Ubiquitous Writer: Robust Text Input for Small Mobile Devices via Acoustic Sensing , 2019, IEEE Internet of Things Journal.

[28]  Bing Zhou,et al.  BatTracker: High Precision Infrastructure-free Mobile Device Tracking in Indoor Environments , 2017, SenSys.

[29]  Yunhao Liu,et al.  Vernier: Accurate and Fast Acoustic Motion Tracking Using Mobile Devices , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[30]  Panlong Yang,et al.  Your Table Can Be an Input Panel , 2019, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[31]  Xinyu Zhang,et al.  Ubiquitous keyboard for small mobile devices: harnessing multipath fading for fine-grained keystroke localization , 2014, MobiSys.

[32]  Klara Nahrstedt,et al.  WritingHacker: audio based eavesdropping of handwriting via mobile devices , 2016, UbiComp.

[33]  Pascal Scalart,et al.  Author manuscript, published in "IEEE Transactions on Audio, Speech, and Language Processing (2006)" 1 Improved Signal-to-Noise Ratio Estimation for Speech Enhancement , 2010 .

[34]  David Chu,et al.  SwordFight: enabling a new class of phone-to-phone action games on commodity phones , 2012, MobiSys '12.

[35]  Michael Rohs,et al.  Pentelligence: Combining Pen Tip Motion and Writing Sounds for Handwritten Digit Recognition , 2018, CHI.

[36]  Biing-Hwang Juang,et al.  Air-Writing Recognition—Part I: Modeling and Recognition of Characters, Words, and Connecting Motions , 2016, IEEE Transactions on Human-Machine Systems.

[37]  Parth H. Pathak,et al.  Finger-writing with Smartwatch: A Case for Finger and Hand Gesture Recognition using Smartwatch , 2015, HotMobile.

[38]  Hamid Soleimani,et al.  A Concise Temporal Data Representation Model for Prediction in Biomedical Wearable Devices , 2019, IEEE Internet of Things Journal.

[39]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Lili Qiu,et al.  CAT: high-precision acoustic motion tracking , 2016, MobiCom.

[41]  Lei Xie,et al.  VSkin: Sensing Touch Gestures on Surfaces of Mobile Devices Using Acoustic Signals , 2018, MobiCom.

[42]  Panlong Yang,et al.  WordRecorder: Accurate Acoustic-based Handwriting Recognition Using Deep Learning , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[43]  Jie Yang,et al.  Snooping Keystrokes with mm-level Audio Ranging on a Single Phone , 2015, MobiCom.

[44]  Dong Yu,et al.  Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[45]  Jiannong Cao,et al.  UbiTouch: ubiquitous smartphone touchpads using built-in proximity and ambient light sensors , 2016, UbiComp.

[46]  Wei Wang,et al.  Device-free gesture tracking using acoustic signals , 2016, MobiCom.

[47]  Minglu Li,et al.  Leveraging Audio Signals for Early Recognition of Inattentive Driving with Smartphones , 2018, IEEE Transactions on Mobile Computing.

[48]  Dorothea Blostein,et al.  Pen Acoustic Emissions for Text and Gesture Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.