Discrete all-pole modeling

A method for parametric modeling and spectral envelopes when only a discrete set of spectral points is given is introduced. This method, called discrete all-pole (DAP) modeling, uses a discrete version of the Itakura-Saito distortion measure as its error criterion. One result is an autocorrelation matching condition that overcomes the limitations of linear prediction and produces better fitting spectral envelopes for spectra that are representable by a relatively small discrete set of values, such as in voiced speech. An iterative algorithm for DAP modeling that is shown to converge to a unique global minimum is presented. Results of applying DAP modeling to real and synthetic speech are also presented. DAP modeling is extended to allow frequency-dependent weighting of the error measure, so that spectral accuracy can be enhanced in certain frequency regions. >

[1]  Man Mohan Sondhi,et al.  A frequency-weighted Itakura spectral distortion measure and its application to speech recognition in noise , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Bishnu S. Atal,et al.  Optimizing LPC filter parameters for multi-pulse excitation , 1983, ICASSP.

[3]  F. Itakura,et al.  A statistical method for estimation of speech spectral density and formant frequencies , 1970 .

[4]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[5]  Vijay K. Jain,et al.  Efficient algorithm for multi-pulse LPC analysis of speech , 1984, ICASSP.

[6]  S. Thomas Alexander,et al.  Low bit rate speech enhancement using a new method of multiple impulse excitation , 1984, ICASSP.

[7]  Joseph Picone,et al.  Joint estimation of the LPC parameters and the multi-pulse excitation , 1986, Speech Commun..

[8]  John Makhoul,et al.  Spectral linear prediction: Properties and applications , 1975 .

[9]  Robert D. Preuss,et al.  Autoregressive spectral estimation in noise with application to speech analysis , 1984, ICASSP.

[10]  Mark A. Clements,et al.  All-pole speech modeling with a maximally pulse-like residual , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  R. McAulay Maximum likelihood spectral estimation and its application to narrow-band speech coding , 1984 .

[12]  William E. Collins,et al.  IEEE ACOUSTICS, SPEECH, AND SIGNAL PROCESSING SOCIETY , 1979 .

[13]  Riichiro Mizoguchi,et al.  Speech analysis by selective linear prediction in the time domain , 1982, ICASSP.

[14]  Hynek Hermansky,et al.  Analysis and synthesis of speech based on spectral transform linear predictive method , 1983, ICASSP.

[15]  Hynek Hermansky,et al.  Spectral envelope sampling and interpolation in linear predictive analysis of speech , 1984, ICASSP.

[16]  V. Viswanathan,et al.  A harmonic deviations linear prediction vocoder for improved narrowband speech transmission , 1982, ICASSP.

[17]  A. El-Jaroudi,et al.  Discrete pole-zero modeling and applications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[18]  Chin-Hui Lee Robust linear prediction for speech analysis , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.