论文信息 - Query Bootstrapping: A Visual Mining Based Query Expansion

Query Bootstrapping: A Visual Mining Based Query Expansion

Bag of Visual Words (BoVW) is an effective framework for image retrieval. Query expansion (QE) further boosts retrieval performance by refining a query with relevant visual words found from the geometric consistency check between the query image and highly ranked retrieved images obtained from the first round of retrieval. Since QE checks the pairwise consistency between query and highly ranked images, its performance may deteriorate when there are slight degradations in the query image. We propose Query Bootstrapping as a variant of QE to circumvent this problem by using the consistency of highly ranked images instead of pairwise consistency. In so doing, we regard frequently co-occurring visual words in highly ranked images as relevant visual words. Frequent itemset mining (FIM) is used to find such visual words efficiently. However, the FIM-based approach requires sensitive parameters to be fine-tuned, namely, support (min/max-support) and the number of top ranked images (top-k). Here, we propose an adaptive support algorithm that adaptively determines both the minimum support and maximum support by referring to the first round’s retrieval list. Selecting relevant images by using a geometric consistency check further boosts retrieval performance by reducing outlier images from a mining process. An important parameter for the LO-RANSAC algorithm that is used for the geometric consistency check, namely, inlier threshold, is automatically determined by our algorithm. We further introduce tf-fi-idf on top of tf-idf in order to take into account the frequency of inliers (fi) in the retrieved images. We evaluated the performance of QB in terms of mean average precision (mAP) on three benchmark datasets and found that it gave significant performance boosts of 5.37%, 9.65%, and 8.52% over that of state-of-the-art QE on Oxford 5k, Oxford 105k, and Paris 6k, respectively. key words: image retrieval, instance search, query expansion, frequent itemset mining, visual word mining, query bootstrapping, adaptive support, adaptive inlier threshold

Shin'ichi Satoh | Yusuke Uchida | Siriwat Kasamwattanarote

[1] Luc Van Gool,et al. Video mining with frequent itemset configurations , 2006 .

[2] Frédéric Jurie,et al. Image re-ranking based on statistics of frequent patterns , 2014, ICMR.

[3] Ramakrishnan Srikant,et al. Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[4] Shin'ichi Satoh,et al. Query-Adaptive Asymmetrical Dissimilarities for Visual Object Retrieval , 2013, 2013 IEEE International Conference on Computer Vision.

[5] Philip S. Yu,et al. Mining Colossal Frequent Patterns by Core Pattern Fusion , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[6] Shin'ichi Satoh,et al. Large vocabulary quantization for searching instances from videos , 2012, ICMR '12.

[7] Michael Isard,et al. Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[8] Michael Isard,et al. Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.

[10] Michael Isard,et al. Lost in quantization: Improving particular object retrieval in large scale image databases , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Hiroki Arimura,et al. An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases , 2004, Discovery Science.

[12] Gerard Salton,et al. Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[13] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14] Philip S. Yu,et al. Approximate Frequent Pattern Mining , 2007 .

[15] A. Smeaton,et al. TRECVID 2013 -- An Overview of the Goals, Tasks, Data, Evaluation Mechanisms, and Metrics | NIST , 2011 .

[16] Florent Domenach,et al. Mining Association Rules using Lattice Theory (6th Workshop on Stochastic Numerics) , 2004 .

[17] Jiri Matas,et al. Total recall II: Query expansion revisited , 2011, CVPR 2011.

[18] Andrew Zisserman,et al. Three things everyone should know to improve object retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19] David G. Lowe,et al. Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[20] Tzung-Pei Hong,et al. Mining association rules with multiple minimum supports using maximum constraints , 2005, Int. J. Approx. Reason..

[21] Jean-François Boulicaut,et al. Using transposition for pattern discovery from microarray data , 2003, DMKD '03.

[22] Tinne Tuytelaars,et al. Mining Multiple Queries for Image Retrieval: On-the-Fly Learning of an Object-Specific Mid-level Representation , 2013, 2013 IEEE International Conference on Computer Vision.

[23] Shin'ichi Satoh,et al. Tell Me about TV Commercials of This Product , 2014, MMM.

[24] Rioult. Mining strong emerging patterns in wide SAGE data , 2022 .

[25] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[26] Christian Borgelt,et al. An implementation of the FP-growth algorithm , 2005 .

[27] Ming Yang,et al. Discovery of Collocation Patterns: from Visual Words to Visual Phrases , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[28] Jiri Matas,et al. Fixing the Locally Optimized RANSAC , 2012, BMVC.

[29] Florent Domenach,et al. Mining Association Rules using Lattice Theory , 2004 .

[30] Jiri Matas,et al. Efficient representation of local geometry for large scale object retrieval , 2009, CVPR.