Time-Scale Modification of Complex Acoustic Signals in Noise

Abstract : A new approach is introduced for time-scale modification of short- duration complex acoustic signals to improve their audibility. The method preserves an approximate time-scaled temporal envelope of a signal, thus capitalizing on the perceptual importance of the signal's temporal structure, while also maintaining the character of a noise background. The basis for the approach is a subband signal representation, derived from a filter bank analysis/synthesis, the channel phases of which are controlled to shape the temporal envelope of the time-scaled signal. Channel amplitudes and filter bank inputs are selected to shape the spectrum and correlation of the time-scaled background. The phase, amplitude, and input control are derived from locations of events that occur within filter bank outputs. A frame-based generalization of the method imposes phase consistency and background noise continuity across consecutive synthesis frames. The approach and its derivatives are applied to synthetic and actual complex acoustic signals consisting of closely spaced sequential time components. Time-Scale modification, Slow-motion audio replay, Complex acoustic signal, Signal enhancement, Noise background preservation, Improved audibility, Temporal envelope, Filter bank analysis/synthesis, Sine-wave.

[1]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[2]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[3]  A. Wilgus,et al.  High quality time-scale modification for speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Thomas E. Hanna Contributions of Envelope Information to Classification of Brief Sounds , 1990 .

[5]  Robert B. Dunn,et al.  Time-scale modification of complex acoustic signals , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Stéphane Mallat,et al.  Singularity detection and processing with wavelets , 1992, IEEE Trans. Inf. Theory.

[7]  Xavier Serra,et al.  A system for sound analysis/transformation/synthesis based on a deterministic plus stochastic decomposition , 1989 .

[8]  Thomas F. Quatieri,et al.  Shape invariant time-scale and pitch modification of speech , 1992, IEEE Trans. Signal Process..

[9]  Thomas F. Quatieri,et al.  Object detection by two-dimensional linear prediction , 1983, ICASSP.

[10]  David M. Rubin,et al.  Use of forecasting signatures to help distinguish periodicity, randomness, and chaos in ripples and other spatial patterns. , 1992, Chaos.

[11]  Jack Cartinhour An iterative time-frequency domain algorithm for reduction of peak-to-rms ratio , 1992, Digit. Signal Process..

[12]  R. J. McAulay,et al.  Speech transformations based on a sinusoidal representation , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Eric Moulines,et al.  HNS: Speech modification based on a harmonic+noise model , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Mark Dolson,et al.  The Phase Vocoder: A Tutorial , 1986 .

[15]  Thomas F. Quatieri,et al.  Phase coherence in speech reconstruction for enhancement and coding applications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[16]  B. Blesser,et al.  Audio dynamic range compression for minimum perceived distortion , 1969 .

[17]  Robert B. Dunn,et al.  Time-scale modification with temporal envelope invariance , 1993, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.