Analysis of speech signals of short pitch period by a sample-selective linear prediction

The conventional linear prediction analysis has difficulties in estimating the vocal tract characteristics of voiced sounds uttered by females or children. This paper shows that the vocal tract characteristics of those speech signals can be estimated accurately by the sample-selective linear prediction (SSLP) method proposed by the authors. The SSLP is a two-stage linear prediction analysis employing only relevant sample values in the second stage analysis, while the conventional linear prediction method employs all the sample values with equal weights as predicted values. The accuracy of the proposed method in estimating formant frequencies is examined on synthetic vowels of short pitch periods. The validity of the method is confirmed by inspecting the estimated spectral envelopes and distributions of the estimated formant frequencies of natural vowels uttered by a female.

[1]  Nobuhiro Miki,et al.  A speech analysis algorithm which eliminates the influence of pitch using the model reference adaptive system , 1982 .

[2]  Bishnu S. Atal,et al.  Linear prediction analysis of speech based on a pole-zero representation. , 1975, The Journal of the Acoustical Society of America.

[3]  Riichiro Mizoguchi,et al.  A sample selective linear prediction analysis of speech , 1984, ICASSP.

[4]  S. Chandra,et al.  Experimental comparison between stationary and nonstationary formulations of linear prediction applied to voiced speech analysis , 1974 .

[5]  Kenneth Steiglitz,et al.  The use of time-domain selection for improved linear prediction , 1977 .

[6]  Hiroya Fujisaki,et al.  Proposal and evaluation of models for the glottal source waveform , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  H. Strube Determination of the instant of glottal closure from the speech wave. , 1974, The Journal of the Acoustical Society of America.

[8]  K. Steiglitz On the simultaneous estimation of poles and zeros in speech analysis , 1977 .

[9]  Alan V. Oppenheim,et al.  Signal analysis by homomorphic prediction , 1976 .

[10]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[11]  B. Atal,et al.  Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.

[12]  H. Fujisaki,et al.  Adaptive analysis of speech based on a pole-zero representation , 1982 .

[13]  H. K. Dunn Methods of Measuring Vowel Formant Bandwidths , 1961 .