Segmentation of merged characters by neural networks and shortest path

Abstract A major problem with a neural network-based approach to printed character recognition is the segmentation of merged characters. A hybrid method is proposed which combines a neural network-based deferred segmentation scheme with conventional immediate segmentation techniques. In the deferred segmentation, a neural network is employed to distinguish single characters from composites. To find a proper vertical cut that separates a composite, a shortest-path algorithm seeking minimal-penalty curved cuts is used. Integrating those components with a multiresolution neural network OCR and an efficient spelling checker, the resulting system significantly improves its ability to read omnifont document text.

[1]  Charles L. Wilson,et al.  Methods for enhancing neural network handwritten character recognition , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[2]  Lindsay J. Evett,et al.  Fast dictionary look-up for contextual word recognition , 1990, Pattern Recognit..

[3]  S. Ohhashi,et al.  Alphanumeric character recognition using a connectionist model with the pocket algorithm , 1989, International 1989 Joint Conference on Neural Networks.

[4]  R.M.K. Sinha,et al.  Integrating word level knowledge in text recognition , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[5]  K Fukushima,et al.  Handwritten alphanumeric character recognition by the neocognitron , 1991, IEEE Trans. Neural Networks.

[6]  Lawrence D. Jackel,et al.  Reading handwritten digits: a ZIP code recognition system , 1992, Computer.

[7]  Theodosios Pavlidis,et al.  On the Recognition of Printed Characters of Any Font and Size , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  James D. Keeler,et al.  Integrated Segmentation and Recognition of Hand-Printed Numerals , 1990, NIPS.

[9]  Yann LeCun,et al.  Handwritten zip code recognition with multilayer networks , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[10]  Nobuyasu Itoh,et al.  A spelling correction method and its application to an OCR system , 1990, Pattern Recognit..

[11]  Jin Wang,et al.  Weight smoothing to improve network generalization , 1994, IEEE Trans. Neural Networks.

[12]  Kunihiko Fukushima,et al.  Character recognition with selective attention , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[13]  Chung-Hsien Wu,et al.  A shunting multilayer perceptron network for confusing/composite pattern recognition , 1991, Pattern Recognit..

[14]  Jin Wang,et al.  Resolving multifont character confusion with neural networks , 1993, Pattern Recognit..

[15]  Y. Kimura Distorted handwritten Kanji character pattern recognition by a learning algorithm minimizing output variation , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[16]  Jack L. Meador,et al.  Encoding a priori information in feedforward networks , 1991, Neural Networks.

[17]  Mohamad T. Musavi,et al.  A neural network approach to character recognition , 1989, Neural Networks.