Pitch-synchronous wavelet representations of speech and music signals

A new wavelet representation is explored. The transform is based on a pitch-synchronous vector representation and it adapts to the oscillatory or aperiodic characteristics of signals. Pseudo-periodic signals are represented in terms of an asymptotically periodic trend and aperiodic fluctuations at several scales. The transform reverts to the ordinary wavelet transform over totally aperiodic signal segments. The pitch-synchronous wavelet transform is particularly suitable to the analysis, rate-reduction coding and synthesis of speech signals and it may serve as a preprocessing block in automatic speech recognition systems. Feature extraction such as separation of voice from noise in voiced consonants is easily performed by means of partial wavelet expansions. A stochastic model of aperiodic fluctuations is proposed. >

[1]  Eyal Yair,et al.  Pitch synchronous spectral analysis scheme for voiced speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2]  Ronald E. Crochiere,et al.  Real-Time Speech Coding , 1982, IEEE Trans. Commun..

[3]  M. Mathews,et al.  Pitch Synchronous Analysis of Voiced Sounds , 1961 .

[4]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[5]  Christopher Heil,et al.  Continuous and Discrete Wavelet Transforms , 1989, SIAM Rev..

[6]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  M. Portnoff Short-time Fourier analysis of sampled speech , 1981 .

[8]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[9]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[10]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[11]  C. W. Barnes,et al.  Discrete-time wavelet transforms and their generalizations , 1990, IEEE International Symposium on Circuits and Systems.

[12]  P. P. Vaidyanathan,et al.  Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial , 1990, Proc. IEEE.

[13]  Gianpaolo Evangelista,et al.  Comb and multiplexed wavelet transforms and their applications to signal processing , 1994, IEEE Trans. Signal Process..

[14]  Martin Vetterli,et al.  A theory of multirate filter banks , 1987, IEEE Trans. Acoust. Speech Signal Process..

[15]  Mark J. T. Smith,et al.  Exact reconstruction techniques for tree-structured subband coders , 1986, IEEE Trans. Acoust. Speech Signal Process..

[16]  Yair Shoham,et al.  New directions in subband coding , 1988, IEEE J. Sel. Areas Commun..

[17]  David Malah,et al.  Time-domain algorithms for harmonic bandwidth reduction and time scaling of speech signals , 1979 .

[18]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[19]  Martin Vetterli,et al.  Wavelets and filter banks: relationships and new results , 1990, International Conference on Acoustics, Speech, and Signal Processing.