Near-field signal acquisition for smartglasses using two acoustic vector-sensors

Smartglasses, in addition to their visual-output capabilities, often contain acoustic sensors for receiving the user's voice. However, operation in noisy environments may lead to significant degradation of the received signal. To address this issue, we propose employing an acoustic sensor array which is mounted on the eyeglasses frames. The signals from the array are processed by an algorithm with the purpose of acquiring the desired near-field speech signal produced by the wearer while suppressing noise signals originating from the environment. The array is comprised of two acoustic vector-sensors (AVSs) which are located at the fore of the glasses' temples. Each AVS consists of four collocated subsensors: one pressure sensor (with an omnidirectional response) and three particle-velocity sensors (with dipole responses) oriented in mutually orthogonal directions. The array configuration is designed to boost the input power of the desired signal, and to ensure that the characteristics of the noise at the different channels are sufficiently diverse (lending towards more effective noise suppression). Since changes in the array's position correspond to the desired speaker's movement, the relative source-receiver position remains unchanged; hence, the need to track fluctuations of the steering vector is avoided. Conversely, the spatial statistics of the noise are subject to rapid and abrupt changes due to sudden movement and rotation of the user's head. Consequently, the algorithm must be capable of rapid adaptation toward such changes. We propose an algorithm which incorporates detection of the desired speech in the time-frequency domain, and employs this information to adaptively update estimates of the noise statistics. The speech detection plays a key role in ensuring the quality of the output signal. We conduct controlled measurements of the array in noisy scenarios. The proposed algorithm preforms favorably with respect to conventional algorithms.

[1]  P. Stoica,et al.  Robust Adaptive Beamforming , 2013 .

[2]  A. Gualtierotti H. L. Van Trees, Detection, Estimation, and Modulation Theory, , 1976 .

[4]  Woodrow Barfield,et al.  Fundamentals of Wearable Computers and Augumented Reality , 2000 .

[5]  Ben D. Sawyer,et al.  Google Glass , 2014, Hum. Factors.

[6]  René Martinus Maria Derkx First-Order Adaptive Azimuthal Null-Steering for the Suppression of Two Directional Interferers , 2010, EURASIP J. Adv. Signal Process..

[7]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[8]  Allan D. Pierce,et al.  Acoustics , 1989 .

[9]  Gary W. Elko,et al.  A steerable and variable first-order differential microphone array , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Shengli Zhou,et al.  Iterative Sparse Channel Estimation and Decoding for Underwater MIMO-OFDM , 2009, OCEANS 2009.

[11]  E. Gilbert,et al.  Optimum design of directive antenna arrays subject to random variations , 1955 .

[12]  Cliff Randell Wearable Computing : A Review , 2006 .

[13]  Emanuel A. P. Habets,et al.  Robust beamforming using sensors with nonidentical directivity patterns , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[15]  Muawiyath Shujau,et al.  Designing Acoustic Vector Sensors for localisation of sound sources in air , 2009, 2009 17th European Signal Processing Conference.

[16]  Marc Moonen,et al.  Spatially pre-processed speech distortion weighted multi-channel Wiener filtering for noise reduction , 2003, Signal Process..

[17]  Elise Ackerman Google gets in your face [2013 Tech To Watch] , 2013, IEEE Spectrum.

[18]  Ulrich Heute,et al.  Speech‐Transmission Quality: Aspects and Assessment for Wideband vs. Narrowband Signals , 2008 .

[19]  Klaus Uwe Simmer,et al.  Superdirective Microphone Arrays , 2001, Microphone Arrays.

[20]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[21]  Gary W. Elko,et al.  A simple adaptive first-order differential microphone , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[22]  Petros Maragos,et al.  An optimum microphone array post-filter for speech applications , 2006, INTERSPEECH.

[23]  Mark Harris State of automation [News] , 2015 .

[24]  Hervé Bourlard,et al.  Microphone array post-filter based on noise field coherence , 2003, IEEE Trans. Speech Audio Process..

[25]  Israel Cohen,et al.  Speech enhancement based on the general transfer function GSC and postfiltering , 2003, IEEE Transactions on Speech and Audio Processing.

[26]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[27]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  Joerg Bitzer,et al.  Post-Filtering Techniques , 2001, Microphone Arrays.

[29]  M. Melamed Detection , 2021, SETI: Astronomy as a Contact Sport.

[30]  René Martinus Maria Derkx,et al.  Theoretical Analysis of a First-Order Azimuth-Steerable Superdirective Microphone Array , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[31]  O. L. Frost,et al.  An algorithm for linearly constrained adaptive array processing , 1972 .

[32]  Jeffrey L. Krolik,et al.  Relationships between adaptive minimum variance beamforming and optimal source localization , 2000, IEEE Trans. Signal Process..

[33]  de Hans-Elias Bree,et al.  An overview of microflown technologies , 2003 .