Analysis of distortion in audio signals introduced by microphone motion

Signals recorded by microphones form the basis for a wide range of audio signal processing systems. In some applications, such as humanoid robots, the microphones may be moving while recording the audio signals. A common practice is to assume that the microphone is stationary within a short time frame. Although this assumption may be reasonable under some conditions, there is currently no theoretical framework that predicts the level of signal distortion due to motion as a function of system parameters. This paper presents such a framework, for linear and circular microphone motion, providing upper bounds on the motion-induced distortion, and showing that the dependence of this upper bound on motion speed, signal frequency, and time-frame duration, is linear. A simulation study of a humanoid robot rotating its head while recording a speech signal validates the theoretical results.

[1]  M. Zoltowski,et al.  Spatial spectrum estimation in a coherent signal environment using an array in motion , 1986 .

[2]  William M. Carey,et al.  Application of synthetic‐aperture processing to towed‐array data , 1989 .

[3]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[4]  John A. Fawcett,et al.  Synthetic aperture processing for a towed array and a moving source , 1993 .

[5]  Anthony J. Weiss,et al.  Localization of multiple sources with moving arrays , 1998, IEEE Trans. Signal Process..

[6]  K. W. Lo,et al.  Flight path estimation using frequency measurements from a wide aperture acoustic array , 2001 .

[7]  Chungyong Lee,et al.  Temporal domain processing for a synthetic aperture array , 2002 .

[8]  Jean Rouat,et al.  Localization of simultaneous moving sound sources for mobile robot using a frequency- domain steered beamformer approach , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[9]  Martin Vetterli,et al.  Dynamic measurement of room impulse responses using a moving microphone. , 2007, The Journal of the Acoustical Society of America.

[10]  Francesco Ripamonti,et al.  Moving microphone arrays to reduce spatial aliasing in the beamforming technique: theoretical background and numerical investigation. , 2008, The Journal of the Acoustical Society of America.

[11]  Gerald Enzner,et al.  Perfect sequence lms for rapid acquisition of continuous-azimuth head related impulse responses , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[12]  M A Poletti Series expansions of rotating two and three dimensional sound fields. , 2010, The Journal of the Acoustical Society of America.

[13]  Enson Chang,et al.  Irregular array motion and extended integration for the suppression of spatial aliasing in passive sonar. , 2011, The Journal of the Acoustical Society of America.

[14]  Zoran Cvetkovic,et al.  Multichannel Dereverberation Theorems and Robustness Issues , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Martin Vetterli,et al.  Sampling and Reconstruction of Spatial Fields Using Mobile Sensors , 2012, IEEE Transactions on Signal Processing.

[16]  George B. Arfken Chapter 16 – Angular Momentum , 2013 .

[17]  Anthony J. Weiss,et al.  Single Receiver Emitter Geolocation Based on Signal Periodicity With Oscillator Instability , 2014, IEEE Transactions on Signal Processing.

[18]  Sascha Spors,et al.  Identification of dynamic acoustic systems by orthogonal expansion of time-variant impulse responses , 2014, 2014 6th International Symposium on Communications, Control and Signal Processing (ISCCSP).

[19]  Boaz Rafaely,et al.  Theoretical Framework for the Optimization of Microphone Array Configuration for Humanoid Robot Audition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[20]  Boaz Rafaely,et al.  Direction of Arrival Estimation Using Microphone Array Processing for Moving Humanoid Robots , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.