East-Asian characters possess a rich hierarchical structure with each character comprising a unique spatial arrangement of radicals (sub-characters). In this paper, we present a new radical based approach for scaling neural network (NN) recognizers to thousands of East-Asian characters. The proposed off-line character recognizer comprises neural networks arranged in a graph. Each NN is one of three types: a radical-at-location (RAL) recognizer, a gater, or a combiner. Each radical-atlocation NN is a convolutional neural network that is designed to processes the whole character image and recognize radicals at a specific location in the character. Example locations include left-half, right-half, top-half, bottom-half, left-top quadrant, bottom-right quadrant, etc. Segmentation is completely avoided by allowing each RAL classifier to process the whole character image. Gater-NNs reduce the number of NNs that need to be evaluated at runtime and combiner-NNs combine RAL classifier outputs for final recognition. The proposed approach is tested on a real-world dataset containing 13.4 million handwritten Chinese character samples from 3665 classes. Experimental results indicate that the proposed approach scales well and achieves a low error rate.
[1]
Jun S. Huang,et al.
A transformation invariant matching algorithm for handwritten chinese character recognition
,
1990,
Pattern Recognit..
[2]
Korris Fu-Lai Chung,et al.
Offline handwritten Chinese character recognition via radical extraction and recognition
,
1997,
Proceedings of the Fourth International Conference on Document Analysis and Recognition.
[3]
Kuo-Chin Fan,et al.
Optical recognition of handwritten Chinese characters by hierarchical radical matching method
,
2001,
Pattern Recognit..
[4]
Daming Shi,et al.
Offline handwritten Chinese character recognition by radical decomposition
,
2003,
TALIP.
[5]
Patrice Y. Simard,et al.
Best practices for convolutional neural networks applied to visual document analysis
,
2003,
Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..
[6]
Masaki Nakagawa,et al.
'Online recognition of Chinese characters: the state-of-the-art
,
2004,
IEEE Transactions on Pattern Analysis and Machine Intelligence.