论文信息 - LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search

LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search

LiveSketch is a novel algorithm for searching large image collections using hand-sketched queries. LiveSketch tackles the inherent ambiguity of sketch search by creating visual suggestions that augment the query as it is drawn, making query specification an iterative rather than one-shot process that helps disambiguate users' search intent. Our technical contributions are: a triplet convnet architecture that incorporates an RNN based variational autoencoder to search for images using vector (stroke-based) queries; real-time clustering to identify likely search intents (and so, targets within the search embedding); and the use of backpropagation from those targets to perturb the input stroke sequence, so suggesting alterations to the query in order to guide the search. We show improvements in accuracy and time-to-task over contemporary baselines using a 67M image corpus.

[1] Richard E. Turner,et al. Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control , 2016, ICML.

[2] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Douglas Eck,et al. A Neural Representation of Sketch Drawings , 2017, ICLR.

[4] Adriana Kovashka,et al. Attribute Pivots for Guiding Relevance Feedback in Image Search , 2013, 2013 IEEE International Conference on Computer Vision.

[5] Ondrej Chum,et al. Asymmetric Feature Maps with Application to Sketch Based Retrieval , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Yong Jae Lee,et al. ShadowDraw: real-time user guidance for freehand drawing , 2011, ACM Trans. Graph..

[7] Leo Sampaio Ferraz Ribeiro,et al. Sketching out the details: Sketch-based image retrieval using convolutional neural networks with multi-stage regression , 2018, Comput. Graph..

[8] Delbert Dueck,et al. Clustering by Passing Messages Between Data Points , 2007, Science.

[9] Yong Jae Lee,et al. AverageExplorer: interactive exploration and alignment of visual data collections , 2014, ACM Trans. Graph..

[10] Rui Hu,et al. A performance evaluation of gradient field HOG descriptor for sketch based image retrieval , 2013, Comput. Vis. Image Underst..

[11] Michael Isard,et al. Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[13] John P. Collomosse,et al. Generalisation and Sharing in Triplet Convnets for Sketch based Visual Search , 2016, ArXiv.

[14] Honggang Zhang,et al. Sketch-based image retrieval via Siamese convolutional neural network , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[15] John P. Collomosse,et al. Interactive video asset retrieval using sketched queries , 2014, CVMP.

[16] Ondrej Chum,et al. CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples , 2016, ECCV.

[17] John P. Collomosse,et al. Scalable Sketch-Based Image Retrieval Using Color Gradient Features , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[18] Hailin Jin,et al. BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19] Seyed-Mohsen Moosavi-Dezfooli,et al. Universal Adversarial Perturbations , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] David H. Douglas,et al. ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[21] Yang Song,et al. Learning Fine-Grained Image Similarity with Deep Ranking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[22] John P. Collomosse,et al. Free-hand sketch grouping for video retrieval , 2008, 2008 19th International Conference on Pattern Recognition.

[23] Hailin Jin,et al. Sketching with Style: Visual Search with Sketches and Aesthetic Context , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[24] Hans Burkhardt,et al. SVM-based Relevance Feedback in Image Retrieval using Invariant Feature Histograms , 2005, MVA.

[25] James Hays,et al. The sketchy database , 2016, ACM Trans. Graph..

[26] Feng Liu,et al. Sketch Me That Shoe , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Adriana Kovashka,et al. WhittleSearch: Image search with relative attribute feedback , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[28] David A. Forsyth,et al. NO Need to Worry about Adversarial Examples in Object Detection in Autonomous Vehicles , 2017, ArXiv.

[29] John P. Collomosse,et al. Compact descriptors for sketch-based image retrieval using a triplet loss convolutional neural network , 2017, Comput. Vis. Image Underst..

[30] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[31] Cordelia Schmid,et al. Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32] Marc Alexa,et al. How do humans sketch objects? , 2012, ACM Trans. Graph..

[33] Junsong Yuan,et al. Query Adaptive Instance Search using Object Sketches , 2016, ACM Multimedia.

[34] Jun Guo,et al. SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[35] Ondrej Chum,et al. Deep Shape Matching , 2017, ECCV.

[36] Fang Wang,et al. Sketch-based 3D shape retrieval using Convolutional Neural Networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Albert Gordo,et al. Deep Image Retrieval: Learning Global Representations for Image Search , 2016, ECCV.

[38] Student,et al. THE PROBABLE ERROR OF A MEAN , 1908 .

[39] Logan Engstrom,et al. Synthesizing Robust Adversarial Examples , 2017, ICML.