Fusion of Local and Global Descriptors for Content-Based Image and Video Retrieval

Recently, fusion of descriptors has become a trend for improving the performance in image and video retrieval tasks. Descriptors can be global or local, depending on how they analyze visual content. Most of existing works have focused on the fusion of a single type of descriptor. Different from all of them, this paper aims to analyze the impact of combining global and local descriptors. Here, we perform a comparative study of different types of descriptors and all of their possible combinations. Extensive experiments of a rigorous experimental design show that global and local descriptors complement each other, such that, when combined, they outperform other combinations or single descriptors.

[1]  Ricardo da Silva Torres,et al.  Content-Based Image Retrieval: Theory and Applications , 2006, RITA.

[2]  Jurandy Almeida,et al.  Making colors worth more than a thousand words , 2008, SAC '08.

[3]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[4]  Jurandy Almeida,et al.  Rapid Video Summarization on Compressed Video , 2010, 2010 IEEE International Symposium on Multimedia.

[5]  Andrea Salgian,et al.  Combining local descriptors for 3D object recognition and categorization , 2008, 2008 19th International Conference on Pattern Recognition.

[6]  Jurandy Almeida,et al.  Comparison of video sequences with histograms of motion patterns , 2011, 2011 18th IEEE International Conference on Image Processing.

[7]  Ricardo da Silva Torres,et al.  Comparative study of global color and texture descriptors for web image retrieval , 2012, J. Vis. Commun. Image Represent..

[8]  Yiquan Wu,et al.  Shape-Based Image Retrieval Using Combining Global and Local Shape Features , 2009, 2009 2nd International Congress on Image and Signal Processing.

[9]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[10]  Jurandy Almeida,et al.  VISON: VIdeo Summarization for ONline applications , 2012, Pattern Recognit. Lett..

[11]  Edward A. Fox,et al.  A genetic programming framework for content-based image retrieval , 2009, Pattern Recognit..

[12]  Weiguo Fan,et al.  Relevance feedback based on genetic programming for image retrieval , 2011, Pattern Recognit. Lett..