Automatic Annotation of Images by a Statistical Learning Approach

A novel statistical learning approach for automatic annotation of images is presented. A minimum probability of error annotation is feasible with our approach. Firstly, an image is represented as a bag of feature vectors by dividing the image into small blocks, from each of which a six-dimension feature vector is extracted. Secondly, we established the probabilistic formulation for automatic annotation of images through estimating the Gaussian mixtures of each image and the common Gaussian mixtures of all the images with the same semantic label. At last, the steps of the training and annotation algorithm are given based on our probabilistic formulation. Experimental results show that the proposed supervised formulation achieve higher accuracy than previously published method.

[1]  Anil K. Jain,et al.  On image classification: city vs. landscape , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[2]  Anil K. Jain,et al.  On image classification: city images vs. landscapes , 1998, Pattern Recognit..

[3]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[4]  Nuno Vasconcelos,et al.  Image indexing with mixture hierarchies , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  David G. Stork,et al.  Pattern Classification , 1973 .

[7]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[8]  David A. Forsyth,et al.  Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[10]  Y. Mori,et al.  Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .

[11]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Ramesh C. Jain,et al.  ACM SIGMM retreat report on future directions in multimedia research , 2005, TOMCCAP.