A Fast Statistical Mixture Algorithm for On-Line Handwriting Recognition

The automatic recognition of online handwriting is considered from an information theoretic viewpoint. Emphasis is placed on the recognition of unconstrained handwriting, a general combination of cursively written word fragments and discretely written characters. Existing recognition algorithms, such as elastic matching, are severely challenged by the variability inherent in unconstrained handwriting. This motivates the development of a probabilistic framework suitable for the derivation of a fast statistical mixture algorithm. This algorithm exhibits about the same degree of complexity as elastic matching, while being more flexible and potentially more robust. The approach relies on a novel front-end processor that, unlike conventional character or stroke-based processing, articulates around a small elementary unit of handwriting called a frame. The algorithm is based on (1) producing feature vectors representing each frame in one (or several) feature spaces, (2) Gaussian K-means clustering in these spaces, and (3) mixture modeling, taking into account the contributions of all relevant clusters in each space. The approach is illustrated by a simple task involving an 81-character alphabet. Both writer-dependent and writer-independent recognition results are found to be competitive with their elastic matching counterparts. >

[1]  W. W. Bledsoe,et al.  Review of "Problem-Solving Methods in Artificial Intelligence by Nils J. Nilsson", McGraw-Hill Pub. , 1971, SGAR.

[2]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[3]  Charles C. Tappert Speed, Accuracy, and Flexibility Trade-Offs in on-Line Character Recognition , 1991, Int. J. Pattern Recognit. Artif. Intell..

[4]  Jerome R. Bellegarda,et al.  Tied mixture continuous parameter modeling for speech recognition , 1990, IEEE Trans. Acoust. Speech Signal Process..

[5]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Lambertus Schomaker Un-supervised learning of prototype allographs in cursive script recognition using invariant handwriting features , 1991 .

[7]  Nils J. Nilsson,et al.  Problem-solving methods in artificial intelligence , 1971, McGraw-Hill computer science series.

[8]  Ching Y. Suen,et al.  The State of the Art in Online Handwriting Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[10]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[11]  Robert M. Haralick,et al.  Optimal Sensor and Light Source Positioning for Machine Vision , 1995, Comput. Vis. Image Underst..

[12]  F. Jelinek Fast sequential decoding algorithm using a stack , 1969 .