论文信息 - HRTF-based robust least-squares frequency-invariant polynomial beamforming

HRTF-based robust least-squares frequency-invariant polynomial beamforming

In this work, we propose a robust Head-Related Transfer Function (HRTF)-based polynomial beamformer design which accounts for the influence of a humanoid robot's head on the sound field. In addition, it allows for a flexible steering of our previously proposed robust HRTF-based beamformer design. We evaluate the HRTF-based polynomial beamformer design and compare it to the original HRTF-based beamformer design by means of signal-independent measures as well as word error rates of an off-the-shelf speech recognition system. Our results confirm the effectiveness of the polynomial beam-former design, which makes it a promising approach to robust beam-forming for robot audition.

[1] Jon Barker,et al. An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.

[2] K. Abed-Meraim,et al. From binaural to multimicrophone blind source separation using fixed beamforming with HRTFs , 2012, 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP).

[3] Ning Ma,et al. The CHiME corpus: a resource and a challenge for computational hearing in multisource environments , 2010, INTERSPEECH.

[4] Walter Kellermann,et al. Design of robust superdirective beamformers as a convex optimization problem , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5] Karim Abed-Meraim,et al. Blind source separation for robot audition using fixed HRTF beamforming , 2012, EURASIP Journal on Advances in Signal Processing.

[6] Lars Kai Hansen,et al. Semi-blind source separation using head-related transfer functions [speech signal separation] , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7] Alexander I. Rudnicky,et al. Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8] Walter Kellermann,et al. On the Impact of Localization Errors on HRTF-based Robust Least-Squares Beamforming , 2016, ArXiv.

[9] Matti Hämäläinen,et al. Filter-and-sum beamformer with adjustable filter characteristics , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[10] A. W. M. van den Enden,et al. Discrete Time Signal Processing , 1989 .

[11] M. Schroeder. Integrated‐impulse method measuring sound decay without using impulses , 1979 .

[12] Harry L. Van Trees,et al. Optimum Array Processing: Part IV of Detection, Estimation, and Modulation Theory , 2002 .

[13] Walter Kellermann,et al. HRTF-based robust least-squares frequency-invariant beamforming , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[14] Martin Holters,et al. IMPULSE RESPONSE MEASUREMENT TECHNIQUES AND THEIR APPLICABILITY IN THE REAL WORLD , 2009 .

[15] Alan V. Oppenheim,et al. Discrete-time signal processing (2nd ed.) , 1999 .

[16] Heinrich Kuttruff,et al. Room Acoustics, Fourth Edition , 2000 .

[17] Martin Schneider,et al. Design of robust two-dimensional polynomial beamformers as a convex optimization problem with application to robot audition , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).