A coarticulation model for continuous digit recognition

Between‐word coarticulation is one of the major problems in continuous speech recognition since it modifies the acoustic characteristics at the boundaries of words. With large vocabulary speech recognition the problem can be solved by introducing the concept of interword units. If the vocabulary is small enough (e.g., digits) all possible coarticulations between all words of the vocabulary can be modeled. In this study every digit is represented by three segments, namely, a core segment that can be assumed reasonably insensitive to any coarticulation effect and head and tail segments that represent, respectively, the initial and the final part of every work spoken in isolation. In addition, a set of juncture segments is defined that represent the junction between every possible pair of words. The recognition process is driven by a regular grammar that represents all the allowed segments sequences, namely, digits spoken in isolation as well as sequences of digits spoken continuously or with pauses between ...