HRTF-based robust least-squares frequency-invariant polynomial beamforming

In this work, we propose a robust Head-Related Transfer Function (HRTF)-based polynomial beamformer design which accounts for the influence of a humanoid robot's head on the sound field. In addition, it allows for a flexible steering of our previously proposed robust HRTF-based beamformer design. We evaluate the HRTF-based polynomial beamformer design and compare it to the original HRTF-based beamformer design by means of signal-independent measures as well as word error rates of an off-the-shelf speech recognition system. Our results confirm the effectiveness of the polynomial beam-former design, which makes it a promising approach to robust beam-forming for robot audition.

[1]  Jon Barker,et al.  An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.

[2]  K. Abed-Meraim,et al.  From binaural to multimicrophone blind source separation using fixed beamforming with HRTFs , 2012, 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP).

[3]  Ning Ma,et al.  The CHiME corpus: a resource and a challenge for computational hearing in multisource environments , 2010, INTERSPEECH.

[4]  Walter Kellermann,et al.  Design of robust superdirective beamformers as a convex optimization problem , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Karim Abed-Meraim,et al.  Blind source separation for robot audition using fixed HRTF beamforming , 2012, EURASIP Journal on Advances in Signal Processing.

[6]  Lars Kai Hansen,et al.  Semi-blind source separation using head-related transfer functions [speech signal separation] , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Alexander I. Rudnicky,et al.  Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8]  Walter Kellermann,et al.  On the Impact of Localization Errors on HRTF-based Robust Least-Squares Beamforming , 2016, ArXiv.

[9]  Matti Hämäläinen,et al.  Filter-and-sum beamformer with adjustable filter characteristics , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[10]  A. W. M. van den Enden,et al.  Discrete Time Signal Processing , 1989 .

[11]  M. Schroeder Integrated‐impulse method measuring sound decay without using impulses , 1979 .

[12]  Harry L. Van Trees,et al.  Optimum Array Processing: Part IV of Detection, Estimation, and Modulation Theory , 2002 .

[13]  Walter Kellermann,et al.  HRTF-based robust least-squares frequency-invariant beamforming , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[14]  Martin Holters,et al.  IMPULSE RESPONSE MEASUREMENT TECHNIQUES AND THEIR APPLICABILITY IN THE REAL WORLD , 2009 .

[15]  Alan V. Oppenheim,et al.  Discrete-time signal processing (2nd ed.) , 1999 .

[16]  Heinrich Kuttruff,et al.  Room Acoustics, Fourth Edition , 2000 .

[17]  Martin Schneider,et al.  Design of robust two-dimensional polynomial beamformers as a convex optimization problem with application to robot audition , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).