Enhanced Modulation Spectrum Using Space-Time Averaging For In-Building Acoustic Signature Identification

For most buildings, virtually all subsystems such as fans, generators, and motors generate acoustic energy. This acoustic energy can weakly penetrate walls and pass through hallways and conduits. The propagation paths are complex and, given the typically low energy of received acoustic signals, present challenges for the detection and classification of the subsystems. While conventional approaches would not be expected to work under these difficult conditions, a key observation can be made: these types of subsystems produce line spectra which consist of harmonics of a fundamental frequency. Acoustic propagation effects then strongly affect the relative energy of these harmonics. Modulation spectra, which make use of the frequency spacing instead of the relative energy of the harmonics, are especially insensitive to these frequency-dependent acoustic attenuation affects. When combined with temporal averaging and 3-dimensional spatial (over an array of acoustic sensors) processing, enhanced modulation spectra offer a new approach to the detection and classification of building subsystems which produce sound

[1]  Steven Greenberg,et al.  Robust speech recognition using the modulation spectrogram , 1998, Speech Commun..

[2]  Douglas E. Sturim,et al.  Automatic dysphonia recognition using biologically-inspired amplitude-modulation features , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[3]  Nima Mesgarani,et al.  Speech discrimination based on multiscale spectro-temporal modulations , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Les E. Atlas,et al.  Improved modulation spectrum through multi-scale modulation frequency decomposition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[5]  Les E. Atlas,et al.  Modulation-scale analysis for content identification , 2004, IEEE Transactions on Signal Processing.

[6]  Hynek Hermansky,et al.  Should recognizers have ears? , 1998, Speech Commun..

[7]  William A. Gardner,et al.  Statistical spectral analysis : a nonprobabilistic theory , 1986 .

[8]  Khaled H. Hamed,et al.  Time-frequency analysis , 2003 .

[9]  D. H. Kil,et al.  Pattern recognition and prediction with applications to signal characterization , 1996 .