A comparative study of several dynamic time-warping algorithms for connected-word recognition

Several different algorithms have been proposed for time registering a test pattern and a concatenated (isolated word) sequence of reference patterns for automatic connected-word recognition. These algorithms include the two-level, dynamic programming algorithm, the sampling approach and the level-building approach. In this paper, we discuss the theoretical differences and similarities among the various algorithms. An experimental comparison of these algorithms for a connected-digit recognition task is also given. The comparison shows that for typical applications, the level-building algorithm performs better than either the two-level DP matching or the sampling algorithm.

[1]  Lalit R. Bahl,et al.  Decoding for channels with insertions, deletions, and substitutions with applications to speech recognition , 1975, IEEE Trans. Inf. Theory.

[2]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[3]  T.B. Martin,et al.  Practical applications of voice input to machines , 1976, Proceedings of the IEEE.

[4]  A. E. Rosenberg,et al.  Evaluation of an automatic word recognition system over dialed‐up telephone lines , 1976 .

[5]  M. B. Herscher,et al.  Source data entry using voice input , 1976, ICASSP.

[6]  N. Dixon,et al.  A comparison of several speech-spectra classification methods , 1976 .

[7]  Aaron E. Rosenberg,et al.  Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .

[8]  S. Levinson,et al.  Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .

[9]  H. Sakoe,et al.  Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition , 1979 .

[10]  Aaron E. Rosenberg,et al.  Interactive clustering techniques for selecting speaker-independent reference templates for isolated word recognition , 1979 .

[11]  Aaron E. Rosenberg,et al.  Speaker-independent recognition of isolated words using clustering techniques , 1979 .

[12]  S. Moshier Talker‐independent speech recognition in commercial environments , 1979 .

[13]  Lawrence R. Rabiner,et al.  Application of dynamic time warping to connected digit recognition , 1980 .

[14]  J. G. Wilpon,et al.  A voice-controlled, repertory-dialer system , 1980, The Bell System Technical Journal.

[15]  B. Aldefeld,et al.  Automated directory listing retrieval system based on isolated word recognition , 1980, Proceedings of the IEEE.

[16]  Lawrence R. Rabiner,et al.  Connected digit recognition using a level-building DTW algorithm , 1981 .

[17]  C. Myers,et al.  A level building dynamic time warping algorithm for connected word recognition , 1981 .

[18]  Lawrence R. Rabiner,et al.  Connected word recognition using a level building dynamic time warping algorithm , 1981, ICASSP.