Recognition of an Indian script using multilayer perceptrons and fuzzy features

Presents a multi-stage character recognition system for an Indian script, namely Bengali (also called Bangla) using fuzzy features and multilayer perceptrons (MLP). The fuzzy features are extracted from the Hough transform of a character pixel pattern. We first define a number of fuzzy sets on the Hough transform accumulator cells. The fuzzy sets are then combined by t-norms to generate feature vectors from each character. A set of fuzzy linguistic vectors is next generated from these feature vectors. The MLPs used for the classification have the fuzzy features as inputs. The MLP outputs also represent the "belongingness" of an input pattern to different fuzzy character pattern classes. To improve the recognition accuracy of Bengali characters, we divide all the patterns into three distinct sets. Each set of characters is once again divided into a number of mutually exclusive character pattern classes. During recognition, the class of each pattern is first determined, followed by recognition of the actual character within that class. The recognition accuracy of the system is more than 98%.

[1]  F. E. Terman,et al.  Integrated Electronics: Analog and Digital Circuits and Systems , 1972 .

[2]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[3]  Shamik Sural,et al.  An MLP using Hough transform based fuzzy feature extraction for Bengali script recognition , 1999, Pattern Recognit. Lett..

[4]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[5]  Bidyut Baran Chaudhuri,et al.  An OCR system to read two Indian language scripts: Bangla and Devnagari (Hindi) , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[6]  Josef Kittler,et al.  A survey of the hough transform , 1988, Comput. Vis. Graph. Image Process..

[7]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[8]  Bidyut Baran Chaudhuri,et al.  Automatic separation of machine-printed and hand-written text lines , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[9]  Bidyut B. Chaudhuri,et al.  Computer recognition of printed Bangla script , 1995 .

[10]  Bidyut Baran Chaudhuri,et al.  Segmentation of Bangla handwritten text into characters by recursive contour following , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[11]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[12]  Bidyut Baran Chaudhuri,et al.  A complete printed Bangla OCR system , 1998, Pattern Recognit..