Hybrid segmental-LVQ/HMM for large vocabulary speech recognition

The authors have assessed the possibility of modeling phone trajectories to accomplish speech recognition. This approach has been considered as one of the ways to model context-dependency in speech recognition based on the acoustic variability of phones in the current database. A hybrid segmental learning vector quantization/hidden Markov model (SLVQ/HMM) system has been developed and evaluated on a telephone speech database. The authors have obtained 85.27% correct phrase recognition with SLVQ alone. By combining the likelihoods issued by SLVQ and by HMM, the authors have obtained 94.5% correct phrase recognition, a small improvement over that obtained with HMM alone.<<ETX>>

[1]  N. Sedgwick,et al.  A method for segmenting acoustic patterns, with applications to automatic speech recognition , 1977 .

[2]  T. Crystal,et al.  Segmental durations in connected‐speech signals: Current results , 1988 .

[3]  D. O'Shaughnessy,et al.  A*-admissible heuristics for rapid lexical access , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Shigeru Katagiri,et al.  A new connected word recognition algorithm based on HMM/LVQ segmentation and LVQ classification , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[5]  D. O'Shaughnessy,et al.  Using phoneme duration and energy contour information to improve large vocabulary isolated-word recognition , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.