The use of a one-stage dynamic programming algorithm for connected word recognition

This paper is of tutorial nature and describes a one-stage dynamic programming algorithm for file problem of connected word recognition. The algorithm to be developed is essentially identical to one presented by Vintsyuk [1] and later by Bridle and Brown [2] ; but the notation and the presentation have been clarified. The derivation used for optimally time synchronizing a test pattern, consisting of a sequence of connected words, is straightforward and simple in comparison with other approaches decomposing the pattern matching problem into several levels. The approach presented relies basically on parameterizing the time warping path by a single index and on exploiting certain path constraints both in the word interior and at the word boundaries. The resulting algorithm turns out to be significantly more efficient than those proposed by Sakoe [3] as well as Myers and Rabiner [4], while providing the same accuracy in estimating the best possible matching string. Its most important feature is that the computational expenditure per word is independent of the number of words in the input string. Thus, it is well suited for recognizing comparatively long word sequences and for real-time operation. Furthermore, there is no need to specify the maximum number of words in the input string. The practical implementation of the algorithm is discussed; it requires no heuristic rules and no overhead. The algorithm can be modified to deal with syntactic constraints in terms of a finite state syntax.

[1]  R. Bellman Dynamic programming. , 1957, Science.

[2]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[3]  T. K. Vintsyuk Element-wise recognition of continuous speech composed of words from a specified dictionary , 1971, CYBERNETICS.

[4]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[5]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[6]  L. Rabiner,et al.  A statical decision approach to the recognition of connected digits , 1976 .

[7]  L. Rabiner,et al.  Statistical decision approach to the recognition of connected digits , 1976 .

[8]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[9]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[10]  H. Sakoe,et al.  Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition , 1979 .

[11]  R. Zelinski,et al.  A segmentation procedure for connected word recognition based on estimation principles , 1981, ICASSP.

[12]  C. Myers,et al.  A level building dynamic time warping algorithm for connected word recognition , 1981 .

[13]  J. Canning,et al.  LOGOS - A real time hardware continuous speech recognition system , 1982, ICASSP.

[14]  Michael D. Brown,et al.  An algorithm for connected word recognition , 1982, ICASSP.

[15]  James C. Spohrer,et al.  Partial traceback and dynamic programming , 1982, ICASSP.