Efficient Multimedia Similarity Measurement Using Similar Elements

Online social networking techniques and large-scale multimedia systems are developing rapidly, which not only has brought great convenience to our daily life, but generated, collected, and stored large-scale multimedia data. This trend has put forward higher requirements and greater challenges on massive multimedia data retrieval. In this paper, we investigate the problem of image similarity measurement which is used to lots of applications. At first we propose the definition of similarity measurement of images and the related notions. Based on it we present a novel basic method of similarity measurement named SMIN. To improve the performance of calculation, we propose a novel indexing structure called SMI Temp Index (SMII for short). Besides, we establish an index of potential similar visual words off-line to solve to problem that the index cannot be reused. Experimental evaluations on two real image datasets demonstrate that our solution outperforms state-of-the-art method.

[1]  Allan Hanbury,et al.  Finding duplicate images in biology papers , 2017, SAC.

[2]  Lin Wu,et al.  Unsupervised Metric Fusion Over Multiview Data by Graph Random Walk-Based Cross-View Diffusion , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Lin Wu,et al.  What-and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification , 2017, Pattern Recognit..

[4]  Tomás Pajdla,et al.  Selecting image pairs for SfM by introducing Jaccard Similarity , 2017, IPSJ Transactions on Computer Vision and Applications.

[5]  Shumeet Baluja,et al.  VisualRank: Applying PageRank to Large-Scale Image Search , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Lin Wu,et al.  Iterative Views Agreement: An Iterative Low-Rank Based Structured Optimization Method to Multi-View Spectral Clustering , 2016, IJCAI.

[7]  Qingquan Li,et al.  Instance Similarity Deep Hashing for Multi-Label Image Retrieval , 2018, ArXiv.

[8]  Lin Wu,et al.  Multiview Spectral Clustering via Structured Low-Rank Matrix Factorization , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Qingwei Gao,et al.  Efficient near-duplicate image detection with a local-based binary representation , 2015, Multimedia Tools and Applications.

[10]  M. Fatih Demirci,et al.  Distinctive interest point selection for efficient near-duplicate image retrieval , 2016, 2016 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI).

[11]  Fang Huang,et al.  Efficient continuous top-k geo-image search on road network , 2018, Multimedia Tools and Applications.

[12]  Lin Wu,et al.  LBMCH: Learning Bridging Mapping for Cross-modal Hashing , 2015, SIGIR.

[13]  Fangyuan Wang,et al.  Large Scale Image Retrieval with Practical Spatial Weighting for Bag-of-Visual-Words , 2013, MMM.

[14]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[15]  Fang Huang,et al.  Efficient region of visual interests search for geo-multimedia data , 2018, Multimedia Tools and Applications.

[16]  Lin Wu,et al.  Crossing Generative Adversarial Networks for Cross-View Person Re-identification , 2018, Neurocomputing.

[17]  Sergei Fedorov,et al.  Large scale near-duplicate image retrieval using Triples of Adjacent Ranked Features (TARF) with embedded geometric information , 2016, ArXiv.

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  Ricardo da Silva Torres,et al.  Color and texture applied to a signature-based bag of visual words method for image retrieval , 2017, Multimedia Tools and Applications.

[20]  Koji Abe,et al.  Similarity Retrieval of Trademark Images by Vector Graphics Based on Shape Characteristics of Components , 2018, ICCAE.

[21]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[22]  Lin Wu,et al.  Beyond Low-Rank Representations: Orthogonal Clustering Basis Reconstruction with Optimized Graph Structure for Multi-view Spectral Clustering , 2017, Neural Networks.

[23]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[24]  Tanaya Guha,et al.  Image similarity measurement from sparse reconstruction errors , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[25]  Silvia Bertoluzza,et al.  A New Class of Wavelet-Based Metrics for Image Similarity Assessment , 2017, Journal of Mathematical Imaging and Vision.

[26]  Linda G. Shapiro,et al.  A SIFT descriptor with global context , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[27]  Kiyoshi Tanaka,et al.  Improving the Efficiency in Halftone Image Generation Based on Structure Similarity Index Measurement , 2012, IEICE Trans. Inf. Syst..

[28]  Bing Yang,et al.  Near-Duplicate Image Retrieval Based on Contextual Descriptor , 2015, IEEE Signal Processing Letters.

[29]  Adel M. Alimi,et al.  Bimodal biometric system for hand shape and palmprint recognition based on SIFT sparse representation , 2017, Multimedia Tools and Applications.

[30]  WuXinyu,et al.  Efficient near-duplicate image detection with a local-based binary representation , 2016 .

[31]  Fang Huang,et al.  Hierarchical information quadtree: efficient spatial temporal image search for multimedia stream , 2018, Multimedia Tools and Applications.

[32]  Lin Wu,et al.  Deep adaptive feature embedding with local sample distributions for person re-identification , 2017, Pattern Recognit..

[33]  Lin Wu,et al.  Shifting Hypergraphs by Probabilistic Voting , 2014, PAKDD.

[34]  Lin Wu,et al.  Efficient image and tag co-ranking: a bregman divergence optimization method , 2013, ACM Multimedia.

[35]  Er Aman,et al.  Content-Based Image Retrieval : A Comprehensive Study , 2019 .

[36]  Jong-Uk Hou,et al.  A SIFT features based blind watermarking for DIBR 3D images , 2018, Multimedia Tools and Applications.

[37]  Lin Wu,et al.  Effective Multi-Query Expansions: Robust Landmark Retrieval , 2015, ACM Multimedia.

[38]  Jing Zhang,et al.  Semantic Discriminative Metric Learning for Image Similarity Measurement , 2016, IEEE Transactions on Multimedia.

[39]  Lei Zhu,et al.  Efficient interactive search for geo-tagged multimedia data , 2018, Multimedia Tools and Applications.

[40]  Yuan Yan Tang,et al.  Mining near duplicate image groups , 2014, Multimedia Tools and Applications.

[41]  Lin Wu,et al.  Exploiting Correlation Consensus: Towards Subspace Clustering for Multi-modal Data , 2014, ACM Multimedia.

[42]  Lin Wu,et al.  Effective Multi-Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval , 2017, IEEE Transactions on Image Processing.

[43]  Lin Wu,et al.  Robust Hashing for Multi-View Data: Jointly Learning Low-Rank Kernelized Similarity Consensus and Hash Functions , 2016, Image Vis. Comput..

[44]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[45]  Lei Zhu,et al.  Temporal Activity Path Based Character Correction in Heterogeneous Social Networks via Multimedia Sources , 2018, Adv. Multim..

[46]  Shang-Lin Hsieh,et al.  A novel approach to detecting duplicate images using multiple hash tables , 2014, Multimedia Tools and Applications.

[47]  Yang Wang,et al.  Structured Deep Hashing with Convolutional Neural Networks for Fast Person Re-identification , 2017, Comput. Vis. Image Underst..

[48]  Yuping Zhang,et al.  MBR-SIFT: A mirror reflected invariant feature descriptor using a binary representation for image matching , 2017, PloS one.

[49]  Jianping Fan,et al.  MapReduce-based clustering for near-duplicate image identification , 2016, Multimedia Tools and Applications.

[50]  Yue Lu,et al.  Variable-Length Signature for Near-Duplicate Image Matching , 2015, IEEE Transactions on Image Processing.

[51]  Marcelo Cicconet,et al.  Image Forensics: Detecting duplication of scientific images with manipulation-invariant image similarity , 2018, ArXiv.

[52]  Lin Wu,et al.  Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification , 2018, IEEE Transactions on Multimedia.

[53]  Fang Huang,et al.  CNN-VWII: An Efficient Approach for Large-Scale Video Retrieval by Image Queries , 2018, Pattern Recognit. Lett..

[54]  Xue Li,et al.  Deep Attention-Based Spatially Recursive Networks for Fine-Grained Visual Recognition , 2019, IEEE Transactions on Cybernetics.

[55]  Yu Liu,et al.  Multi-focus image fusion with dense SIFT , 2015, Inf. Fusion.

[56]  Ling Shao,et al.  Cycle-Consistent Deep Generative Hashing for Cross-Modal Retrieval , 2018, IEEE Transactions on Image Processing.

[57]  Lin Wu,et al.  Robust Subspace Clustering for Multi-View Data by Exploiting Correlation Consensus , 2015, IEEE Transactions on Image Processing.

[58]  Savvas A. Chatzichristofis,et al.  Image moment invariants as local features for content based image retrieval using the Bag-of-Visual-Words model , 2015, Pattern Recognit. Lett..

[59]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[60]  Mihai Datcu,et al.  On the Use of Normalized Compression Distances for Image Similarity Detection , 2018, Entropy.

[61]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[62]  Zhili Zhou,et al.  Spatial descriptor embedding for near-duplicate image retrieval , 2018, Int. J. Embed. Syst..

[63]  Yang Wang,et al.  Towards metric fusion on multi-view data: a cross-view based graph random walk approach , 2013, CIKM.