UWHear: through-wall extraction and separation of audio vibrations using wireless signals

An ability to detect, classify, and locate complex acoustic events can be a powerful tool to help smart systems build context-awareness, e.g., to make rich inferences about human behaviors in physical spaces. Conventional methods to measure acoustic signals employ microphones as sensors. As signals from multiple acoustic sources are blended during propagation to a sensor, such methods impose a dual challenge of separating the signal for an acoustic event from background noise and from other acoustic events of interest. Recent research has proposed using radio-frequency (RF) signals, e.g., Wi-Fi and millimeter-wave (mmWave), to sense sound directly from source vibrations. Whereas these works allow separating an acoustic event from background noise, they cannot monitor multiple sound sources simultaneously. In this paper, we present UWHear, a system that simultaneously recovers and separates sounds from multiple sources. Unlike previous works using continuous-wave RF, UWHear employs Impulse Radio Ultra-Wideband (IR-UWB) technology, in order to construct an enhanced audio sensing system tackling the above challenges. Further, IR-UWB radios can penetrate light building materials, which enables UWHear to operate in some non-line-of-sight (NLOS) conditions. In addition to providing a theoretical guarantee for audio recovery using RF pulses, we also implement an audio sensing prototype exploiting a commercial-off-the-shelf IR-UWB radar. Our experiments show that UWHear can effectively separate the content of two speakers that are placed only 25cm apart. UWHear can also capture and separate multiple sounds and vibrations of household appliances while being immune to non-target noise coming from other directions.

[1]  Athanasios Mouchtaris,et al.  Localizing multiple audio sources in a wireless acoustic sensor network , 2015, Signal Process..

[2]  Athanasios Mouchtaris,et al.  Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Jun Luo,et al.  RF-net: a unified meta-learning framework for RF-enabled one-shot human activity recognition , 2020, SenSys.

[4]  Takumi Kobayashi,et al.  Urban sound event classification based on local and global features aggregation , 2017 .

[5]  Chris Donahue,et al.  Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Ankit Shah,et al.  Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis , 2019, DCASE.

[7]  Jean Rouat,et al.  Robust sound source localization using a microphone array on a mobile robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[8]  Ian Oppermann,et al.  UWB Communication Systems—A Comprehensive Overview , 2006 .

[9]  Ning Liu,et al.  Bathroom Activity Monitoring Based on Sound , 2005, Pervasive.

[10]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[12]  Jun Luo,et al.  AcuTe: acoustic thermometer empowered by a single smartphone , 2020, SenSys.

[13]  Romit Roy Choudhury,et al.  LiquID: A Wireless Liquid IDentifier , 2018, MobiSys.

[14]  Xu Zhang,et al.  V2iFi , 2020, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[15]  Xavier Serra,et al.  A Wavenet for Speech Denoising , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Yossi Adi,et al.  Voice Separation with an Unknown Number of Multiple Speakers , 2020, ICML.

[17]  Justin Salamon,et al.  What’s all the Fuss about Free Universal Sound Separation Data? , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Alun D. Preece,et al.  DeepCEP: Deep Complex Event Processing Using Distributed Multimodal Information , 2019, 2019 IEEE International Conference on Smart Computing (SMARTCOMP).

[19]  Mani Srivastava,et al.  Deep Convolutional Bidirectional LSTM for Complex Activity Recognition with Missing Data , 2020 .

[20]  Jonathan Le Roux,et al.  Universal Sound Separation , 2019, 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[21]  Zhengxiong Li,et al.  WaveEar: Exploring a mmWave-based Noise-resistant Speech Sensing for Voice-User Interface , 2019, MobiSys.

[22]  K. Shikano,et al.  Blind Source Separation of Acoustic Signals Based on Multistage ICA Combining Frequency-Domain ICA and Time-Domain ICA , 2003, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[23]  Hiroshi Mizoguchi,et al.  Three ring microphone array for 3D sound localization and separation for mobile robot audition , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  Ramesh Govindan,et al.  CarMap: Fast 3D Feature Map Updates for Automobiles , 2020, NSDI.

[25]  S. Venkatesh,et al.  Implementation and analysis of respiration-rate estimation using impulse-based UWB , 2005, MILCOM 2005 - 2005 IEEE Military Communications Conference.

[26]  Efthymios Tzinis,et al.  Improving Universal Sound Separation Using Sound Classification , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  D. R. Gibbins,et al.  Clinical trials of a UWB imaging radar for breast cancer , 2010, Proceedings of the Fourth European Conference on Antennas and Propagation.

[28]  Sharmila Sengupta,et al.  Blind navigation proposal using SONAR , 2015, 2015 IEEE International Conference on Computer Graphics, Vision and Information Security (CGVIS).

[29]  Ramesh Govindan,et al.  AVR: Augmented Vehicular Reality , 2018, MobiSys.

[30]  Stefan Goetze,et al.  Voice activity detection driven acoustic event classification for monitoring in smart homes , 2010, 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010).

[31]  Elizabeth Heinrichs-Graham,et al.  Effects of Noise on Speech Recognition and Listening Effort in Children With Normal Hearing and Children With Mild Bilateral or Unilateral Hearing Loss. , 2016, Journal of speech, language, and hearing research : JSLHR.

[32]  Robert Mueller,et al.  Millimeter-Wave Propagation: Characterization and modeling toward fifth-generation systems. [Wireless Corner] , 2016, IEEE Antennas and Propagation Magazine.

[33]  Liang Liu,et al.  Design of Low-Power, 1GS/s Throughput FFT Processor for MIMO-OFDM UWB Communication System , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[34]  Gierad Laput,et al.  Vibrosight: Long-Range Vibrometry for Smart Environment Sensing , 2018, UIST.

[35]  Hiroshi Sawada,et al.  Blind Extraction of Dominant Target Sources Using ICA and Time-Frequency Masking , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[36]  Shyamnath Gollakota,et al.  Contactless Sleep Apnea Detection on Smartphones , 2015, GetMobile Mob. Comput. Commun..

[37]  Zhe Chen,et al.  Syncope Detection in Toilet Environments Using Wi-Fi Channel State Information , 2018, UbiComp/ISWC Adjunct.

[38]  T. Gulliver,et al.  Ultra-Wideband Impulse Radar Through-Wall Detection of Vital Signs , 2018, Scientific Reports.

[39]  Mani Srivastava,et al.  RadHAR: Human Activity Recognition from Point Clouds Generated through a Millimeter-wave Radar , 2019, mmNets.

[40]  Lei Yang,et al.  Making sense of mechanical vibration period with sub-millisecond accuracy using backscatter signals , 2016, MobiCom.

[41]  Shu Wang,et al.  Acoustic Eavesdropping through Wireless Vibrometry , 2015, MobiCom.

[42]  Steven F. Boll A spectral subtraction algorithm for suppression of acoustic noise in speech , 1979, ICASSP.

[43]  Tor Sverre Lande,et al.  A 118-mW Pulse-Based Radar SoC in 55-nm CMOS for Non-Contact Human Vital Signs Detection , 2017, IEEE Journal of Solid-State Circuits.

[44]  Zahra Jafari,et al.  Effect of signal to noise ratio on the speech perception ability of older adults , 2016, Medical journal of the Islamic Republic of Iran.

[45]  Marian Verhelst,et al.  The SINS Database for Detection of Daily Activities in a Home Environment Using an Acoustic Sensor Network , 2017, DCASE.

[46]  Frédo Durand,et al.  The visual microphone , 2014, ACM Trans. Graph..

[47]  Anthony Rowe,et al.  Demo Abstract: Welcome to My World: Demystifying Multi-User AR with the Cloud , 2018, 2018 17th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[48]  Vladlen Koltun,et al.  Speech Denoising with Deep Feature Losses , 2018, INTERSPEECH.

[49]  Yang Hu,et al.  BreathTrack: Tracking Indoor Human Breath Status via Commodity WiFi , 2019, IEEE Internet of Things Journal.

[50]  Fernando Seco Granja,et al.  Comparing Decawave and Bespoon UWB location systems: Indoor/outdoor performance analysis , 2016, 2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN).

[51]  Yifan Gong,et al.  An Overview of Noise-Robust Automatic Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[52]  Tuan Anh Nguyen,et al.  Energy intelligent buildings based on user activity: A survey , 2013 .

[53]  Fernando Seco Granja,et al.  Comparing Ubisense, BeSpoon, and DecaWave UWB Location Systems: Indoor Performance Analysis , 2017, IEEE Transactions on Instrumentation and Measurement.

[54]  Michael McLaughlin,et al.  Angle of arrival estimation using decawave DW1000 integrated circuits , 2017, 2017 14th Workshop on Positioning, Navigation and Communications (WPNC).

[55]  Athanasios Mouchtaris,et al.  Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[56]  Christian G. Reiff Acoustic source localization and cueing from an aerostat during the NATO SET-093 field experiment , 2009, Defense + Commercial Sensing.

[57]  Tuomas Virtanen,et al.  TUT database for acoustic scene classification and sound event detection , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[58]  Qiaosong Wang Towards Real-time 3D Reconstruction using Consumer UAVs , 2019, ArXiv.

[59]  Federico Thomas,et al.  Revisiting trilateration for robot localization , 2005, IEEE Transactions on Robotics.

[60]  Mani B. Srivastava,et al.  D-SLATS: Distributed Simultaneous Localization and Time Synchronization , 2017, MobiHoc.

[61]  Dan Istrate,et al.  Sound Detection and Classification for Medical Telesurvey , 2004 .

[62]  R. Kshetrimayum,et al.  An introduction to UWB communication systems , 2009, IEEE Potentials.

[63]  Deborah Estrin,et al.  Target classification and localization in habitat monitoring , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[64]  Hugh Robjohns,et al.  A brief history of microphones By , 2006 .

[65]  A.E. Fathy,et al.  Accurate UWB indoor localization system utilizing time difference of arrival approach , 2006, 2006 IEEE Radio and Wireless Symposium.