SH-MQDF based calligraphic character recognition

Chinese Calligraphy draws a lot of attention for its beauty and elegance. Until now, lot of people don't know the semantic meaning of many calligraphic characters because they were written in ancient times. Technologies which can help users to recognize the unknown calligraphic characters are urgently required. However recognizing calligraphic characters is a challenging work due to the reasons: (1)complication. There are at least five different styles of the same character; (2)Deformation. Sometimes deliberate deformation in writing is performed, which makes the recognition even harder. In this paper, we present an approach for the recognition of calligraphic characters. Modified quadratic discriminant function (MQDF), which was the state-of-the-art classifier used in handwriting recognition, is introduced in recognizing calligraphic characters. Due to the high expenses of storage and computation of MQDF on large scale data, we use spectral hashing(SH) to dynamically reduce the whole training set to a much more relevant but much smaller set, called the approximate set. MQDF is then trained on the approximate set dynamically and classification is performed. We evaluated our approach on the calligraphic character dictionary (CCD), which is an expert labeled calligraphic character database. Our experiments show that our approach reduces the training time of classifier but doesn't lower the accuracies of recognition much. Furthermore, our approach outperforms the existing calligraphic character recognition methods on CCD.

[1]  Peng Liu,et al.  Calligraphy Beautification Method for Chinese Handwritings , 2012, 2012 Fourth International Conference on Digital Home.

[2]  Yueting Zhuang,et al.  Web based Chinese Calligraphy Learning with 3-D Visualization Method , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[3]  Pengcheng Gao,et al.  LSH-based large scale chinese calligraphic character recognition , 2013, JCDL '13.

[4]  Cheng-Lin Liu,et al.  Handwritten Chinese Character Recognition: Effects of Shape Normalization and Feature Extraction , 2006, SACH.

[5]  Fei Yin,et al.  Online and offline handwritten Chinese character recognition: Benchmarking on new databases , 2013, Pattern Recognit..

[6]  Masaki Nakagawa,et al.  Evaluation of prototype learning algorithms for nearest-neighbor classifier in application to handwritten character recognition , 2001, Pattern Recognit..

[7]  Hiroshi Sako,et al.  Handwritten digit recognition: investigation of normalization and feature extraction techniques , 2004, Pattern Recognit..

[8]  Kai Yu,et al.  Chinese calligraphy specific style rendering system , 2010, JCDL '10.

[9]  Hsi-Jian Lee,et al.  Dual-binarization and anisotropic diffusion of Chinese characters in calligraphy documents , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[10]  Yueting Zhuang,et al.  Skeleton-Based Recognition of Chinese Calligraphic Character Image , 2008, PCM.

[11]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[12]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[13]  Yueting Zhuang,et al.  Retrieval of Chinese Calligraphic Character Image , 2004, PCM.

[14]  Yunhe Pan,et al.  Automatic generation of artistic chinese calligraphy , 2004, IEEE Intelligent Systems.

[15]  Pengcheng Gao,et al.  Ontology‐based model for Chinese Calligraphy Synthesis , 2013, Comput. Graph. Forum.

[16]  Fumitaka Kimura,et al.  Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Jeffrey Scott Vitter,et al.  Proceedings of the thirtieth annual ACM symposium on Theory of computing , 1998, STOC 1998.

[18]  Shih-Fu Chang,et al.  Semi-Supervised Hashing for Large-Scale Search , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  J. W. Patrick,et al.  Fourth international conference on coal science , 1988 .

[20]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[21]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[22]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.