Form Analysis by Neural Classification of Cells

Our aim in this paper is to present a generic approach for linearly combining multi neural classifier for cell analysis of forms. This approach can be applied in a preprocessing step in order to highlight the different kind of information filled in the form and to determine the appropriate treatment. Features used for the classification are relative to the text orientation and to its character morphology. Eight classes are extracted among numeric, alphabetic, vertical, horizontal, capitals, etc. Classifiers are multi-layered perceptrons considering firstly global features and refining the classification at each step by looking for more precise features. The recognition rate of the classifiers for 3. 500 cells issued from 19 forms is about 91%.

[1]  Yuki Hirayama Analyzing form images by using line-shared-adjacent cell relations , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[2]  Thomas Risse,et al.  Hough transform for line recognition: Complexity of evidence accumulation and cluster detection , 1989, Comput. Vis. Graph. Image Process..

[3]  Hiroyuki Arai,et al.  Form processing based on background region analysis , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[4]  Yolande Belaïd,et al.  Form Item Extraction Based on Line Searching , 1995, GREC.

[5]  Yasuto Ishitani,et al.  Flexible and Robust Model Matching based on Association Graph for Form Image Understanding , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[6]  Shigeyoshi Shimotsuji,et al.  Form identification based on cell structure , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[7]  Yuan Yan Tang,et al.  Four directional adjacency graphs (FDAG) and their application in locating fields in forms , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.