Calligraphy Word Style Recognition by KNN Based Feature Library Filtering

Chinese calligraphy works is a valuable part of the Chinese culture heritage. More and more calligraphy works are digitized, preserved and exhibited in digital library so that people can enjoy the calligraphers’ works conveniently. There are five main writing style categories of calligraphy words, namely, seal script, clerical script, standard script, semi-cursive script and cursive script. Users always want to appreciate the style-similar works simultaneously, so it’s necessary to classify the words by their writing style. In this study, we proposed a method based on KNN and feature vector filtering. Firstly, extract the SIFT points from the training images, building up a feature library for SIFT feature vectors. Then use a KNN-based method to filter the feature library so that the style-irrelevant feature points can be wiped out. At last we use the filtered feature library to classify the word images by a modified KNN classifier. Experiments show that SIFT feature has better recognition result than that of Gabor feature and GIST feature, but the large amount of feature vectors in the SIFT feature library makes the KNN searching rather slow. To accelerate the recognition speed, Spectral Hashing is used to index the feature library, which makes it faster to classify feature points and gives no side effect on the recognition ratio.

[1]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[2]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[3]  Cem Ünsalan,et al.  Urban-Area and Building Detection Using SIFT Keypoints and Graph Theory , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Yueting Zhuang,et al.  Latent Style Model: Discovering writing styles for calligraphy works , 2009, J. Vis. Commun. Image Represent..

[5]  Cordelia Schmid,et al.  Evaluation of GIST descriptors for web-scale image search , 2009, CIVR '09.

[6]  Jon Louis Bentley,et al.  Quad trees a data structure for retrieval on composite keys , 1974, Acta Informatica.

[7]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[8]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  George Nagy,et al.  The CADAL calligraphic database , 2011, HIP '11.

[10]  Stefan B. Williams,et al.  Reduced SIFT Features For Image Retrieval And Indoor Localisation , 2004 .

[11]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[12]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[13]  Yueting Zhuang,et al.  Discovering calligraphy style relationships by Supervised Learning Weighted Random Walk Model , 2009, Multimedia Systems.

[14]  Pengcheng Gao,et al.  LSH-based large scale chinese calligraphic character recognition , 2013, JCDL '13.

[15]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.