Stroke extraction for chinese characters using a trend-followed transcribing technique

Abstract The merit of the stroke extraction algorithms which utilize the thinning process is the ease of the feature abstracting from the skeleton of a character. The two main tasks for this kind of algorithms are to find the certain adjacent segmental strokes for being merged into a complete stroke, and to search the corner point to divide the bend segmental stroke into two or more individual strokes. This paper proposes an intuitive and effective stroke extraction method that passes through the distorted region and gets the reliable information of global features by applying the trend-followed transcribing technique to correctly accomplish the tasks. In our experiments, the most frequently used 1500 Chinese characters printed in both the Ming font and the Fang-Sung font with the size of 64 × 64 points are tested. The results of the experiments show that the rate for correctly extracting all strokes of a character is 97.8% for the Ming font and 98.4% for the Fang-Sung font. That is, the proposed stroke extraction algorithm is useful and reliable.

[1]  Paul Y. S. Cheung,et al.  Fuzzy-attribute graph with application to Chinese character recognition , 1992, IEEE Trans. Syst. Man Cybern..

[2]  Jun S. Huang,et al.  Stroke segmentation by bernstein-bezier curve fitting , 1990, Pattern Recognit..

[3]  Jun S. Huang,et al.  Separating similar complex Chinese characters by Walsh transform , 1987, Pattern Recognit..

[4]  Yun Xia,et al.  Skeletonization via the Realization of the Fire Front's Propagation and Extinction in Digital Binary Shapes , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Ling-Hwei Chen,et al.  Handwritten character recognition using a 2-layer random graph model by relaxation matching , 1990, Pattern Recognit..

[6]  Y. S. Cheung,et al.  A knowledge-based stroke-matching method for Chinese character recognition , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Jun S. Huang,et al.  A transformation invariant matching algorithm for handwritten chinese character recognition , 1990, Pattern Recognit..

[8]  Hideo Ogawa,et al.  Thinning and stroke segmentation for handwritten Chinese character recognition , 1982, Pattern Recognit..

[9]  Paul P. Wang,et al.  Machine recognition of printed Chinese characters via transformation algorithms , 1973, Pattern Recognit..

[10]  Ching Y. Suen,et al.  Application of a Multilayer Decision Tree in Computer Recognition of Chinese Characters , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Kim-Teng Lua,et al.  A new approach to stroke and feature point extraction in Chinese character recognition , 1991, Pattern Recognit. Lett..

[12]  Lin-Yu Tseng,et al.  An efficient knowledge-based stroke extraction method for multi-font chinese characters , 1992, Pattern Recognit..

[13]  Bei Li,et al.  A knowledge-based thinning algorithm , 1991, Pattern Recognit..

[14]  Ching Y. Suen,et al.  Hierarchical attributed graph representation and recognition of handwritten chinese characters , 1991, Pattern Recognit..

[15]  Fang-Hsuan Cheng,et al.  Research on Chinese OCR in Taiwan , 1991, Int. J. Pattern Recognit. Artif. Intell..

[16]  Kazuhiko Yamamoto,et al.  Research on Machine Recognition of Handprinted Characters , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Roland T. Chin,et al.  Analysis of Thinning Algorithms Using Mathematical Morphology , 1990, IEEE Trans. Pattern Anal. Mach. Intell..