Time-Varying Vocal Folds Vibration Detection Using a 24 GHz Portable Auditory Radar

Time-varying vocal folds vibration information is of crucial importance in speech processing, and the traditional devices to acquire speech signals are easily smeared by the high background noise and voice interference. In this paper, we present a non-acoustic way to capture the human vocal folds vibration using a 24-GHz portable auditory radar. Since the vocal folds vibration only reaches several millimeters, the high operating frequency and the 4 × 4 array antennas are applied to achieve the high sensitivity. The Variational Mode Decomposition (VMD) based algorithm is proposed to decompose the radar-detected auditory signal into a sequence of intrinsic modes firstly, and then, extract the time-varying vocal folds vibration frequency from the corresponding mode. Feasibility demonstration, evaluation, and comparison are conducted with tonal and non-tonal languages, and the low relative errors show a high consistency between the radar-detected auditory time-varying vocal folds vibration and acoustic fundamental frequency, except that the auditory radar significantly improves the frequency-resolving power.

[1]  Sheng-Fuh Chang,et al.  Microwave Human Vocal Vibration Signal Detection Based on Doppler Radar Technology , 2010, IEEE Transactions on Microwave Theory and Techniques.

[2]  Xiaohua Zhu,et al.  Super-resolution spectral estimation in short-time non-contact vital sign measurement. , 2015, The Review of scientific instruments.

[3]  Gregory C. Burnett,et al.  The use of glottal electromagnetic micropower sensors (GEMS) in determining a voiced excitation function , 1999 .

[4]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[5]  R. San-Segundo,et al.  Robust speech detection for noisy environments , 2011, IEEE Aerospace and Electronic Systems Magazine.

[6]  Thomas Quatieri,et al.  Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .

[7]  J. Kobler,et al.  Measurements of glottal structure dynamics. , 2005, The Journal of the Acoustical Society of America.

[8]  William M. Campbell,et al.  Multisensor MELPe using parameter substitution , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Changzhi Li,et al.  Accurate DC offset calibration of Doppler radar via non-convex optimisation , 2015 .

[10]  Jin Joo Choi,et al.  Measurement of human heartbeat and respiration signals using phase detection radar. , 2007, The Review of scientific instruments.

[11]  Aggelos K. Katsaggelos,et al.  Noncontact Millimeter-Wave Real-Time Detection and Tracking of Heart Rate on an Ambulatory Subject , 2012, IEEE Transactions on Information Technology in Biomedicine.

[12]  Yang Zhang,et al.  A 94-GHz Millimeter-Wave Sensor for Speech Signal Acquisition , 2013, Sensors.

[13]  Changzhi Li,et al.  A Review on Recent Advances in Doppler Radar Sensors for Noncontact Healthcare Monitoring , 2013, IEEE Transactions on Microwave Theory and Techniques.

[14]  Pooja Jain,et al.  Event-Based Method for Instantaneous Fundamental Frequency Estimation from Voiced Speech Based on Eigenvalue Decomposition of the Hankel Matrix , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[15]  J F Holzrichter,et al.  Direct and indirect measures of speech articulator motions using low power EM sensors , 1999 .

[16]  Zhao Li,et al.  A Novel Method for Speech Acquisition and Enhancement by 94 GHz Millimeter-Wave Sensor , 2016, Sensors.

[17]  Changzhan Gu,et al.  Analysis and Experiment on the Modulation Sensitivity of Doppler Radar Vibration Measurement , 2013, IEEE Microwave and Wireless Components Letters.

[18]  Yanxue Wang,et al.  Research on variational mode decomposition and its application in detecting rub-impact fault of the rotor system , 2015 .

[19]  E. Graves,et al.  Radar remote monitoring of vital signs , 2009, IEEE Microwave Magazine.

[20]  Engin Erzin,et al.  Improving Throat Microphone Speech Recognition by Joint Analysis of Throat and Acoustic Microphone Recordings , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Yong Huang,et al.  Microwave life-detection systems for searching human subjects under earthquake rubble or behind barrier , 2000, IEEE Transactions on Biomedical Engineering.

[22]  D.P. Skinner,et al.  The cepstrum: A guide to processing , 1977, Proceedings of the IEEE.

[23]  Jenshan Lin,et al.  Range correlation and I/Q performance benefits in single-chip silicon Doppler radars for noncontact cardiopulmonary monitoring , 2004, IEEE Transactions on Microwave Theory and Techniques.

[24]  Dominique Zosso,et al.  Variational Mode Decomposition , 2014, IEEE Transactions on Signal Processing.

[25]  Zhi-Yong Tao,et al.  Detection of Dynamic Structures of Speech Fundamental Frequency in Tonal Languages , 2010, IEEE Signal Processing Letters.

[26]  Hao Lv,et al.  Smart radar sensor for speech detection and enhancement , 2013 .

[27]  J. Holzrichter,et al.  Speech articulator measurements using low power EM-wave sensors. , 1998, The Journal of the Acoustical Society of America.

[28]  Yanfeng Li,et al.  A Novel Radar Sensor for the Non-Contact Detection of Speech Signals , 2010, Sensors.

[29]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[30]  William M. Campbell,et al.  Multimodal Speaker Authentication using Nonacoustic Sensors , 2003 .

[31]  Xiaohua Zhu,et al.  Noncontact Vital Sign Detection based on Stepwise Atomic Norm Minimization , 2015, IEEE Signal Processing Letters.

[32]  Xiaohua Zhu,et al.  A portable 24-GHz auditory radar for non-contact speech sensing with background noise rejection and directional discrimination , 2016, 2016 IEEE MTT-S International Microwave Symposium (IMS).