论文信息 - Use of periodicity and jitter as speech recognition features

Use of periodicity and jitter as speech recognition features

We investigate a class of features related to voicing parameters that indicate whether the vocal chords are vibrating. Features describing voicing characteristics of speech signals are integrated with an existing 38-dimensional feature vector consisting of first and second order time derivatives of the frame energy and of the cepstral coefficients with their first and second derivatives. HMM-based connected digit recognition experiments comparing the traditional and extended feature sets show that voicing features and spectral information are complementary and that improved speech recognition performance is obtained by combining the two sources of information.

David L. Thomson | Rathinavelu Chengalvarayan | R. Chengalvarayan | D. Thomson

[1] Lawrence R. Rabiner,et al. A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition , 1976 .

[2] Biing-Hwang Juang,et al. The segmental K-means algorithm for estimating parameters of hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[3] Joseph Picone,et al. Fast and accurate pitch detection using pattern recognition and adaptive time-domain analysis , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[5] Biing-Hwang Juang,et al. Minimum classification error rate methods for speech recognition , 1997, IEEE Trans. Speech Audio Process..

[6] Jean Schoentgen,et al. Predictable and random components of jitter , 1997, Speech Commun..

[7] Biing-Hwang Juang,et al. Context‐dependent acoustic subword modeling for connected digit recognition , 1993 .

[8] Jay G. Wilpon,et al. Discriminative feature selection for speech recognition , 1993, Comput. Speech Lang..

[9] Wu Chou,et al. Signal conditioned minimum error rate training , 1995, EUROSPEECH.