Research on Chinese characters recognition in complex background images

In this paper, an automatic recognition method of Chinese characters in static complex background is proposed and studied. Firstly, an feature extraction model combing Gabor filter and Sobel operator to is designed to get Chinese characters area. Secondly, K-means clustering algorithm is used to distinguish between the character area and the background area. Then, according to structure characteristics of Chinese character, two effective parameters defined as stroke across and the energy density are designed to recognize Chinese characters. Specific method is comparative analysis of the characters to be identified and the standard font. Finally, a set of experimental data is selected to carry out experiment and the experimental results approve that the proposed Chinese characters recognition method targeted at Text region in complex background has a good adaptability and robustness.

[1]  Xiaoyan Gu,et al.  Detecting Uyghur text in complex background images with convolutional neural network , 2017, Multimedia Tools and Applications.

[2]  Xue Gao,et al.  A New Feature Optimization Method Based on Two-Directional 2DLDA for Handwritten Chinese Character Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[3]  Qi Li,et al.  Recognition of Offline Handwritten Chinese Characters Using the Tesseract Open Source OCR Engine , 2016, 2016 8th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC).

[4]  Ching Y. Suen,et al.  Application of a Multilayer Decision Tree in Computer Recognition of Chinese Characters , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.