Linear Discriminant Analysis for Zero-shot Learning Image Retrieval

This paper introduces a new distance function for comparing images in the context of content-based image retrieval. Given a query and a large dataset to be searched, the system has to provide the user – as efficiently as possible – with a list of images ranked according to their distance to the query. Because of computational issues, traditional image search systems are generally based on conventional distance function such as the Euclidian distance or the dot product, avoiding the use of any training data nor expensive online metric learning algorithms. The drawback is that, in this case, the system can hardly cope with the variability of image contents. This paper proposes a simple yet efficient zero-shot learning algorithm that can learn a query-adapted distance function from a single image (the query) or from a few images (e.g. some user-selected images in a relevance feedback iteration), hence improving the quality of the retrieved images. This allows our system to work with any object categories without requiring any training data, and is hence more applicable in real world use cases. More interestingly, our system can learn the metric on the fly, at almost no cost, and the cost of the ranking function is as low as the dot product distance. By allowing the system to learn to rank the images, significantly and consistently improved results (over the conventional approaches) have been observed on the Oxford5k, Paris6k and Holiday1k datasets.

[1]  Tommy W. S. Chow,et al.  Soft label based Linear Discriminant Analysis for image recognition and retrieval , 2014, Comput. Vis. Image Underst..

[2]  Ke Lu,et al.  Image retrieval based on incremental subspace learning , 2005, Pattern Recognit..

[3]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Juyang Weng,et al.  Using Discriminant Eigenfeatures for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Yoshua Bengio,et al.  Zero-data Learning of New Tasks , 2008, AAAI.

[6]  Josef Kittler,et al.  Incremental Linear Discriminant Analysis Using Sufficient Spanning Set Approximations , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8]  Cordelia Schmid,et al.  Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Kristen Grauman,et al.  Relative attributes , 2011, 2011 International Conference on Computer Vision.

[11]  Thomas Mensink,et al.  Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.

[12]  Aleix M. Martínez,et al.  Kernel Optimization in Discriminant Analysis , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Chee Seng Chan,et al.  pLSA-based zero-shot learning , 2013, 2013 IEEE International Conference on Image Processing.

[14]  Hervé Jégou,et al.  Negative Evidences and Co-occurences in Image Retrieval: The Benefit of PCA and Whitening , 2012, ECCV.

[15]  Jun Guo,et al.  Linear Ranking Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Geoffrey E. Hinton,et al.  Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[17]  Xuelong Li,et al.  Direct kernel biased discriminant analysis: a new content-based image retrieval relevance feedback algorithm , 2006, IEEE Transactions on Multimedia.

[18]  Aleix M. Martínez,et al.  Subclass discriminant analysis , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Florent Perronnin,et al.  Large-scale image retrieval with compressed Fisher vectors , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.