论文信息 - Text Image Retrieval Based on Picture Information Measurement and Model-KNN

Text Image Retrieval Based on Picture Information Measurement and Model-KNN

Various shortages to understand high-level semantic feature of data information exist in computer technology which makes it difficult for computers to retrieve document image based on semantic feature directly. The capability of document image retrieval algorithm depends much on suitable abstraction of statistical feature and the selection of classifier. To this problem, the normalized generalized picture information measurement (NPIMK) is introduced as the statistical feature. Meanwhile, an improved KNN classifier based on model is used to identify to which species one image belongs. Experimental results show that the document image retrieval algorithm based on NPIMK and mode-KNN is effective

Ge Guo | Xijian Ping | Juan Cheng | Xibo Duan

[1] Francesca Cesarini,et al. Automatic document classification and indexing in high-volume applications , 2001, International Journal on Document Analysis and Recognition.

[2] Morshed U. Chowdhury,et al. Image semantic classification by using SVM , 2003 .

[4] Sahibsingh A. Dudani. The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[5] Yaxin Bi,et al. KNN Model-Based Approach in Classification , 2003, OTM.

[6] Ping Xijian. Image Feature of Information Measurement and Document Image Classification , 2004 .

[7] Paolo Frasconi,et al. Hidden Tree Markov Models for Document Image Classification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..