CanalScan: Tongue-Jaw Movement Recognition via Ear Canal Deformation Sensing

Human-machine interface based on tongue-jaw movements has recently become one of the major technological trends. However, existing schemes have several limitations, such as requiring dedicated hardware and are usually uncomfortable to wear. This paper presents CanalScan, a nonintrusive system for tongue-jaw movement recognition using only commodity speaker and microphone mounted on ubiquitous off-the-shelf devices (e.g., smartphones). The basic idea is to send an acoustic signal, then captures its reflections and derive unique patterns of ear canal deformation caused by tongue-jaw movements. A dynamic segmentation method with Support Vector Domain Description is used to segment tongue-jaw movements. To combat sensor position-sensitive deficiency and ear-canal-shape-sensitive deficiency in multi-path reflections, we first design algorithms to assist users in adjusting the acoustic sensors to the same valid zone. Then we propose a data transformation mechanism to reduce the impacts of diversities in ear canal shapes and relative positions between sensors and the ear canal. CanalScan explores twelve unique and consistent features and applies a Random Forest classifier to distinguish tongue-jaw movements. Extensive experiments with twenty participants demonstrate that CanalScan achieves promising recognition for six tongue-jaw movements, is robust against various usage scenarios, and can be generalized to new users without retraining and adaptation.

[1]  J. Osborn,et al.  Mapping ear canal movement using area-based surface matching. , 2002, The Journal of the Acoustical Society of America.

[2]  Jingyuan Zhang,et al.  Tongible: a non-contact tongue-based interaction technique , 2012, ASSETS '12.

[3]  Fan Li,et al.  D3-Guard: Acoustic-based Drowsy Driving Detection Using Smartphones , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[4]  Minglu Li,et al.  SteerTrack: Acoustic-Based Device-Free Steering Tracking Leveraging Smartphones , 2018, 2018 15th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON).

[5]  Buntarou Shizuki,et al.  CanalSense: Face-Related Movement Recognition System based on Sensing Air Pressure in Ear Canals , 2017, UIST.

[6]  D. Šuta,et al.  Reference hearing thresholds in an extended frequency range as a function of age. , 2014, The Journal of the Acoustical Society of America.

[7]  Yunhao Liu,et al.  Acousticcardiogram: Monitoring Heartbeats using Acoustic Signals on Smart Devices , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[8]  Hiroki Watanabe,et al.  Facial expression recognition using ear canal transfer function , 2019, UbiComp.

[9]  Kang G. Shin,et al.  Cross-Platform Support for Rapid Development of Mobile Acoustic Sensing Applications , 2018, MobiSys.

[10]  Jason Wu,et al.  TongueBoard: An Oral Interface for Subtle Input , 2019, AH.

[11]  Wei Sun,et al.  EarEcho , 2019, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[12]  Robert P. W. Duin,et al.  Support vector domain description , 1999, Pattern Recognit. Lett..

[13]  Sangki Yun,et al.  Strata: Fine-Grained Acoustic-based Device-Free Tracking , 2017, MobiSys.

[14]  Yu Wang,et al.  AWash: Handwashing Assistance for the Elderly with Dementia via Wearables , 2021, IEEE INFOCOM 2021 - IEEE Conference on Computer Communications.

[15]  Yu Wang,et al.  PPGPass: Nonintrusive and Secure Mobile Two-Factor Authentication via Wearables , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[16]  Lothar Thiele,et al.  BARTON: Low Power Tongue Movement Sensing with In-Ear Barometers , 2017, 2017 IEEE 23rd International Conference on Parallel and Distributed Systems (ICPADS).

[17]  Tomoki Toda,et al.  Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Atsushi Nishikawa,et al.  Earable TEMPO: A Novel, Hands-Free Input Device that Uses the Movement of the Tongue Measured with a Wearable Ear Sensor , 2018, Sensors.

[19]  Hitoshi Imaoka,et al.  Fast and accurate personal authentication using ear acoustics , 2016, 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[20]  Thad Starner,et al.  Stick it in your ear: building an in-ear jaw movement sensor , 2015, UbiComp/ISWC Adjunct.

[21]  Gregory S. Hoeker,et al.  Ear Canal Anatomy and Activity , 2003 .

[22]  Zheng Li,et al.  Tongue-n-cheek: non-contact tongue gesture recognition , 2015, IPSN.

[23]  Sunghyun Choi,et al.  Chirp signal-based aerial acoustic communication for smart devices , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[24]  C.J. James,et al.  Independent Component Analysis for Extraction of Critical Features from Tongue Movement Ear Pressure Signals , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[25]  I. M. Ventry,et al.  Collapse of the ear canal during audiometry. , 1961, Archives of otolaryngology.

[26]  Anh Nguyen,et al.  TYTH-Typing On Your Teeth: Tongue-Teeth Localization for Human-Computer Interface , 2018, MobiSys.

[27]  Takafumi Koshinaka,et al.  Ear Acoustic Biometrics Using Inaudible Signals and Its Application to Continuous User Authentication , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[28]  Bodo Urban,et al.  EarFieldSensing: A Novel In-Ear Electric Field Sensing to Enrich Wearable Gesture Input through Facial Expressions , 2017, CHI.

[29]  Anton H. M. Akkermans,et al.  Acoustic ear recognition for person identification , 2005, Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID'05).

[30]  Maysam Ghovanloo,et al.  Tongue drive: a tongue operated magnetic sensor based wireless assistive technology for people with severe disabilities , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[31]  Anton H. M. Akkermans,et al.  Acoustic Ear Recognition , 2006, ICB.

[32]  Ravi Vaidyanathan,et al.  Ensemble classification for robust discrimination of multi-channel, multi-class tongue-movement ear pressure signals , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[33]  Maysam Ghovanloo,et al.  The tongue and ear interface: a wearable system for silent speech recognition , 2014, SEMWEB.

[34]  Hong Zhang,et al.  A multistage adaptive thresholding method , 2005, Pattern Recognit. Lett..

[35]  Ben Taskar,et al.  Non-intrusive tongue machine interface , 2014, CHI.

[36]  airFinger: Micro Finger Gesture Recognition via NIR Light Sensing for Smart Devices , 2020, 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS).

[37]  Yu Wang,et al.  EchoTrack: Acoustic device-free hand tracking on smart phones , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.