A Divide and Conquer Method for Automatic Image Annotation

Fast and accurate automatic image annotation is of great significance. Linear regression provides a fast and simple automatic image annotation method. However, it is a linear model and it is trained on the whole training data set. The computational complexity of linear regression increases with the number of training samples. In this paper, we propose a new automatic image annotation method based on data grouping. First, training samples are mapped into a new space. Next, these samples are grouped in this new space by constrained clustering. Finally, a system consisting of a softmax gate network and multiple experts is trained on the partitioned data sets. Each expert is a single-hidden-layer feedforward neural network. Experimental results on three image annotation benchmark data sets show that our method achieves better results. In addition, our experimental results show that effective grouping of training set and training an expert on each sub training set can improve the automatic image annotation performance.

[1]  Cordelia Schmid,et al.  TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2]  Andrew Zisserman,et al.  Efficient additive kernels via explicit feature maps , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Yang Yu,et al.  Automatic image annotation using group sparsity , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Michael R. Lyu,et al.  A pseudoinverse learning algorithm for feedforward neural networks with stacked generalization applications to software reliability growth data , 2004, Neurocomputing.

[5]  Fabio Del Frate,et al.  Use of Neural Networks for Automatic Classification From High-Resolution Images , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[7]  Kilian Q. Weinberger,et al.  Fast Image Tagging , 2013, ICML.

[8]  Claire Cardie,et al.  Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Jae Won Lee,et al.  Content-based image classification using a neural network , 2004, Pattern Recognit. Lett..

[12]  Sungyoung Kim,et al.  Image Classification into Object / Non-object Classes , 2004, CIVR.