Pole-zero modeling of speech based on high-order pole model fitting and decomposition method

In this paper four pole-zero modeling algorithms of clean and noisy speech have been studied in a unified approach that is based on high-order pole model fitting and decomposition method. They are autocorrelation prediction (AP), modified Yule-Walker (MYW), modified least square (MLS), and modified least square with autocorrelation compensation (MLSAC) methods. They involve only linear equations, and therefore are computationally efficient. Among these algorithms, the MLSAC method appears to be the most effective in spectral envelope estimation of noisy as well as clean speech. According to our simulation results, the improvement resulting from the use of the MLSAC pole-zero model for noisy speech is equivalent to increasing signal-to-noise ratio (SNR) by about 5 dB when SNR of input speech is 10 dB or less. The use of a pole-zero model in multirate vocoding is also discussed.

[1]  F. Brophy,et al.  Considerations of the Padé approximant technique in the synthesis of recursive digital filters , 1973 .

[2]  Steven Kay,et al.  A new ARMA spectral estimator , 1980 .

[3]  B. Atal,et al.  Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.

[4]  J. Shanks RECURSION FILTERS FOR DIGITAL PROCESSING , 1967 .

[5]  Thomas P. Barnwell A comparison of parametrically different objective speech quality measures using correlation analysis with subjective quality results , 1980, ICASSP.

[6]  Chong Un,et al.  The Residual-Excited Linear Prediction Vocoder with Transmission Rate Below 9.6 kbits/s , 1975, IEEE Trans. Commun..

[7]  I. Konvalinka,et al.  Simultaneous estimation of poles and zeros in speech analysis and ITIF-iterative inverse filtering algorithm , 1979 .

[8]  T. Barnwell Correlation analysis of subjective and objective measures for speech quality , 1980, ICASSP.

[9]  Leonard Weiss,et al.  Prony’s Method, Z-Transforms, and Padé Approximation , 1963 .

[10]  Alan V. Oppenheim,et al.  Speech analysis homomorphic prediction , 1977 .

[11]  R. Roberts,et al.  The use of second-order information in the approximation of discreate-time linear systems , 1976 .

[12]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[13]  E. Parzen Some recent advances in time series modeling , 1974 .

[14]  Chong Kwan Un,et al.  Pole-zero modeling of noisy speech and its application to vocoding , 1982, ICASSP.

[15]  M. Atashroo Autocorrelation prediction , 1977 .

[16]  Craig K. Rushforth,et al.  Estimating the parameters of a noisy all-pole process using pole-zero modeling , 1979, ICASSP.

[17]  Will Gersch,et al.  Estimation of the autoregressive parameters of a mixed autoregressive moving-average time series , 1970 .

[18]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[19]  K. Steiglitz On the simultaneous estimation of poles and zeros in speech analysis , 1977 .

[20]  Chong Un,et al.  On pole-zero modeling of speech , 1980, ICASSP.