Classification of forms with handwritten fields by planar hidden Markov models

In this article, we present a method for modelling physical structure of forms with handwritten fields, by means of pseudo-bidimensional hidden Markov models (PHMMs). This description is then used for automatic classification of types of forms. With the nature of the document, which comprises handwritten fields, position and dimensions of significant rectangles are variable. Moreover, the phenomena of merging and fragmentation, induce an additional variability in the number of rectangles. They characterize the physical structure of a class of forms. Modelling by PHMMs is developed and appears as a suitable tool to solve the problems of the 2D random variability arising from automatic classification of forms.

[1]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[2]  Roberto Pieraccini,et al.  Dynamic planar warping for optical character recognition , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[4]  Biing-Hwang Juang,et al.  The segmental K-means algorithm for estimating parameters of hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[5]  Oscar E. Agazzi,et al.  Keyword Spotting in Poorly Printed Documents using Pseudo 2-D Hidden Markov Models , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Saddok Kebairi,et al.  A Statistical Method for an Automatic Detection of Form Types , 1998, Document Analysis Systems.

[7]  Abdel Belaïd,et al.  Modélisation pseudo bidimensionnelle pour la reconnaissance de chaînes de caractères arabes imprimés , 1998 .

[8]  Roberto Pieraccini,et al.  Connected and degraded text recognition using planar hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Najoua Ben Amara Utilisation des modèles de Markov cachés planaires en reconnaissance de l'Ecriture Arabe imprimée , 1999 .

[10]  Rolf-Dieter Bippus 1-dimensional and pseudo 2-dimensional HMMs for the recognition of German literal amounts , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[11]  George Saon,et al.  Modèles markoviens uni- et bidimensionnels pour la reconnaissance de l'écriture manuscrite hors-ligne. (One and two-dimensional Markov models for off-line handwriting recognition) , 1997 .

[12]  Abdel Belaïd,et al.  Printed PAW recognition based on planar hidden Markov models , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[13]  Abdel Belaïd,et al.  Utilisation des processus markoviens en reconnaissance de l'écriture , 1997 .