A fuzzy model for unsupervised character classification

Abstract This paper presents a fuzzy logic approach to efficiently perform unsupervised character classification for improvement in robustness, correctness, and speed of a character recognition system. The characters are first split into seven typographical categories. The classification scheme uses pattern matching to classify the characters in each category into a set of fuzzy prototypes based on a nonlinear weighted similarity function. The fuzzy unsupervised character classification, which is natural in the representation of prototypes for character matching, is developed and a weighted fuzzy similarity measure is explored. The characteristics of the fuzzy model are discussed and used in speeding up the classification process. After classification, the character recognition which is simply applied on a smaller set of the fuzzy prototypes, becomes much easier and less time-consuming.

[1]  Mindy Bokser,et al.  Omnidocument technologies , 1992, Proc. IEEE.

[2]  Robert J. Schalkoff,et al.  Pattern recognition - statistical, structural and neural approaches , 1991 .

[3]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[4]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..

[5]  Eberhard Mandler,et al.  Document analysis-from pixels to contents , 1992 .

[6]  S. Impedovo,et al.  Optical Character Recognition - a Survey , 1991, Int. J. Pattern Recognit. Artif. Intell..

[7]  A. Kandel Fuzzy Mathematical Techniques With Applications , 1986 .

[8]  Frank Y. Shih,et al.  A document segmentation, classification and recognition system , 1992, Proceedings of the Second International Conference on Systems Integration.