Hybrid Regularization of Diffusion Process for Visual Re-Ranking

To improve the retrieval result obtained from a pairwise dissimilarity, many variants of diffusion process have been applied in visual re-ranking. In the framework of diffusion process, various contextual similarities can be obtained by solving an optimization problem, and the objective function consists of a smoothness constraint and a fitting constraint. And many improvements on the smoothness constraint have been made to reveal the underlying manifold structure. However, little attention has been paid to the fitting constraint, and how to build an effective fitting constraint still remains unclear. In this article, by deeply analyzing the role of fitting constraint, we firstly propose a novel variant of diffusion process named Hybrid Regularization of Diffusion Process (HyRDP). In HyRDP, we introduce a hybrid regularization framework containing a two-part fitting constraint, and the contextual dissimilarities can be learned from either a closed-form solution or an iterative solution. Furthermore, this article indicates that the basic idea of HyRDP is closely related to the mechanism behind Generalized Mean First-passage Time (GMFPT). GMFPT denotes the mean time-steps for the state transition from one state to any one in the given state set, and is firstly introduced as the contextual dissimilarity in this article. Finally, based on the semi-supervised learning framework, an iterative re-ranking process is developed. With this approach, the relevant objects on the manifold can be iteratively retrieved and labeled within finite iterations. The proposed algorithms are validated on various challenging databases, and the experimental performances demonstrate that retrieval results obtained from different types of measures can be effectively improved by using our methods.

[1]  Song Bai,et al.  Sparse Contextual Activation for Efficient Visual Re-Ranking , 2016, IEEE Transactions on Image Processing.

[2]  Ying Zhang,et al.  Learning context-sensitive similarity by shortest path propagation , 2011, Pattern Recognit..

[3]  Cordelia Schmid,et al.  Accurate Image Search Using the Contextual Dissimilarity Measure , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Zhuowen Tu,et al.  Integrating contour and skeleton for shape classification , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[5]  Wangshu Liu,et al.  Improving Shape Retrieval by Fusing Generalized Mean First-Passage Time , 2017, ICONIP.

[6]  Longin Jan Latecki,et al.  Re-Ranking via Metric Fusion for Object Retrieval and Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[8]  Peter Kontschieder,et al.  Beyond Pairwise Shape Similarity Analysis , 2009, ACCV.

[9]  Xinbo Gao,et al.  Triplet-Based Deep Hashing Network for Cross-Modal Retrieval , 2018, IEEE Transactions on Image Processing.

[10]  Remco C. Veltkamp,et al.  Shape matching: similarity measures and algorithms , 2001, Proceedings International Conference on Shape Modeling and Applications.

[11]  Longin Jan Latecki,et al.  Affinity learning on a tensor product graph with applications to shape and image retrieval , 2011, CVPR 2011.

[12]  Longin Jan Latecki,et al.  Affinity Learning with Diffusion on Tensor Product Graph , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Huchuan Lu,et al.  Inner and Inter Label Propagation: Salient Object Detection in the Wild , 2015, IEEE Transactions on Image Processing.

[14]  Albert Gordo,et al.  End-to-End Learning of Deep Visual Representations for Image Retrieval , 2016, International Journal of Computer Vision.

[15]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Lei Luo,et al.  Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion , 2013, IEEE Transactions on Multimedia.

[17]  Aykut Erdem,et al.  Dissimilarity between two skeletal trees in a context , 2009, Pattern Recognit..

[18]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[19]  Zhuowen Tu,et al.  Improving Shape Retrieval by Learning Graph Transduction , 2008, ECCV.

[20]  Rongrong Ji,et al.  Visual Reranking through Weakly Supervised Multi-graph Learning , 2013, 2013 IEEE International Conference on Computer Vision.

[21]  Horst Bischof,et al.  Diffusion Processes for Retrieval Revisited , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Qi Tian,et al.  Regularized Diffusion Process for Visual Retrieval , 2017, AAAI.

[23]  Huchuan Lu,et al.  Saliency Region Detection Based on Markov Absorption Probabilities , 2015, IEEE Transactions on Image Processing.

[24]  Tommi S. Jaakkola,et al.  Partially labeled classification with Markov random walks , 2001, NIPS.

[25]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[26]  Daniel Carlos Guimarães Pedronette,et al.  Unsupervised manifold learning using Reciprocal kNN Graphs in image re-ranking and rank aggregation tasks , 2014, Image Vis. Comput..

[27]  Yosi Keller,et al.  Improving Shape Retrieval by Spectral Matching and Meta Similarity , 2010, IEEE Transactions on Image Processing.

[28]  Qi Tian,et al.  Scalable Person Re-identification on Supervised Smoothed Manifold , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Ronald Fagin,et al.  Relaxing the Triangle Inequality in Pattern Matching , 2004, International Journal of Computer Vision.

[30]  Qi Tian,et al.  Regularized Diffusion Process on Bidirectional Context for Object Retrieval , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Ricardo da Silva Torres,et al.  Image Re-ranking and Rank Aggregation Based on Similarity of Ranked Lists , 2011, CAIP.

[32]  Qi Tian,et al.  Smooth Neighborhood Structure Mining on Multiple Affinity Graphs with Applications to Context-Sensitive Similarity , 2016, ECCV.

[33]  Ying Wu,et al.  Object retrieval and localization with spatially-constrained similarity measure and k-NN re-ranking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Longin Jan Latecki,et al.  Automatic Ensemble Diffusion for 3D Shape and Image Retrieval , 2019, IEEE Transactions on Image Processing.

[35]  Liang Chen,et al.  Nonlinear Graph Fusion for Multi-modal Classification of Alzheimer's Disease , 2015, MLMI.

[36]  Huchuan Lu,et al.  Saliency Detection via Absorbing Markov Chain , 2013, 2013 IEEE International Conference on Computer Vision.

[37]  Oskar Söderkvist,et al.  Computer Vision Classification of Leaves from Swedish Trees , 2001 .

[38]  Longin Jan Latecki,et al.  Locally constrained diffusion process on locally densified distance spaces with applications to shape retrieval , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Longin Jan Latecki,et al.  GIFT: A Real-Time and Scalable 3D Shape Search Engine , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Qi Tian,et al.  Ensemble Diffusion for Retrieval , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[41]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[42]  Bernhard Schölkopf,et al.  Ranking on Data Manifolds , 2003, NIPS.

[43]  Bo Wang,et al.  Unsupervised metric learning by Self-Smoothing Operator , 2011, 2011 International Conference on Computer Vision.

[44]  Yu Zhou,et al.  Similarity Fusion for Visual Tracking , 2015, International Journal of Computer Vision.

[45]  Ulrich Eckhardt,et al.  Shape descriptors for non-rigid shapes with a single closed contour , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[46]  Qi Tian,et al.  Accurate Image Search with Multi-Scale Contextual Evidences , 2016, International Journal of Computer Vision.

[47]  Zhuowen Tu,et al.  Learning Context-Sensitive Shape Similarity by Graph Transduction , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Bo Wang,et al.  Unsupervised metric fusion by cross diffusion , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  Bo Wang,et al.  Co-Transduction for Shape Retrieval , 2010, IEEE Transactions on Image Processing.

[50]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[51]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[52]  J. Klafter,et al.  First-passage times in complex scale-invariant media , 2007, Nature.

[53]  J. Stoyanov A Guide to First‐passage Processes , 2003 .

[54]  Jurandy Almeida,et al.  Unsupervised Distance Learning for Plant Species Identification , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[55]  LingHaibin,et al.  Shape Classification Using the Inner-Distance , 2007 .

[56]  Xiang Bai,et al.  Shape Vocabulary: A Robust and Efficient Shape Representation for Shape Matching , 2014, IEEE Transactions on Image Processing.