Comparison of feature-based and image registration-based retrieval of image data using multidimensional data access methods

Abstract In information retrieval, efficient similarity search in multimedia collections is a critical task. In this paper, we present a rigorous comparison of three different approaches to the image retrieval problem, including cluster-based indexing, distance-based indexing, and multidimensional scaling methods. The time and accuracy trade-offs for each of these methods are demonstrated on three different image data sets. Similarity of images is obtained either by a feature-based similarity measure using four MPEG-7 low-level descriptors or by a whole image-based similarity measure. The effect of these similarity measurement techniques on the retrieval process is also evaluated through the performance tests performed on several data sets. We show that using low-level features of images in the similarity measurement function results in significantly better accuracy and time performance compared to the whole-image based approach. Moreover, an optimization of feature contributions to the distance measure for feature-based approach can identify the most relevant features and is necessary to obtain maximum accuracy. We further show that multidimensional scaling can achieve comparable accuracy, while speeding-up the query times significantly by allowing the use of spatial access methods.

[1]  Empirical evaluation of MPEG-7 XM color descriptors in content-based retrieval of semantic image categories , 2002, Object recognition supported by user interaction for service robots.

[2]  Qi Tian,et al.  Fast and robust short video clip search using an index structure , 2004, MIR '04.

[3]  Thomas Sikora,et al.  The MPEG-7 visual standard for content description-an overview , 2001, IEEE Trans. Circuits Syst. Video Technol..

[4]  Christos Faloutsos,et al.  Slim-Trees: High Performance Metric Trees Minimizing Overlap Between Nodes , 2000, EDBT.

[5]  Joshua B. Tenenbaum,et al.  Global Versus Local Methods in Nonlinear Dimensionality Reduction , 2002, NIPS.

[6]  Beng Chin Ooi,et al.  Hierarchical Indexing Structure for Efficient Similarity Search in Video Retrieval , 2006, IEEE Transactions on Knowledge and Data Engineering.

[7]  Kenneth Rose,et al.  VQ-index: an index structure for similarity searching in multimedia databases , 2002, MULTIMEDIA '02.

[8]  Amit P. Sheth,et al.  Semantic (Web) Technology In Action: Ontology Driven Information Systems for Search, Integration and Analysis , 2003, IEEE Data Eng. Bull..

[9]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[10]  Ismail Hakki Toroslu,et al.  Approximate similarity search in genomic sequence databases using landmark-guided embedding , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[11]  Manuel Barrena García,et al.  A flexible framework to ease nearest neighbor search in multidimensional data spaces , 2010, Data Knowl. Eng..

[12]  W. Torgerson Multidimensional scaling: I. Theory and method , 1952 .

[13]  David B. Lomet,et al.  The hB-tree: a multiattribute indexing method with good guaranteed performance , 1990, TODS.

[14]  Hung-Yi Lin,et al.  A new indexing method with high storage utilization and retrieval efficiency for large spatial databases , 2007, Inf. Softw. Technol..

[15]  Moncef Gabbouj,et al.  Hierarchical Cellular Tree: An Efficient Indexing Scheme for Content-Based Retrieval on Multimedia Databases , 2007, IEEE Transactions on Multimedia.

[16]  Joshua B. Tenenbaum,et al.  Sparse multidimensional scaling using land-mark points , 2004 .

[17]  Cristina Ribeiro,et al.  Multidimensional Descriptor Indexing: Exploring the BitMatrix , 2006, CIVR.

[18]  C.-C. Jay Kuo,et al.  Introduction to Content‐Based Image Retrieval—Overview of Key Techniques , 2002 .

[19]  Walid G. Aref,et al.  Spatio-Temporal Access Methods: Part 2 (2003 - 2010) , 2010, IEEE Data Eng. Bull..

[20]  V. P. Subramanyam Rallabandi,et al.  Image retrieval system using R-tree self-organizing map , 2007, Data Knowl. Eng..

[21]  Hanan Samet,et al.  Index-driven similarity search in metric spaces (Survey Article) , 2003, TODS.

[22]  Agma J. M. Traina,et al.  Supporting content-based image retrieval and computer-aided diagnosis systems with association rule-based techniques , 2009, Data Knowl. Eng..

[23]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[24]  Christian Böhm,et al.  Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases , 2001, CSUR.

[25]  Chabane Djeraba,et al.  KPYR: An Efficient Indexing Method , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[26]  Heng Tao Shen,et al.  Exploring Bit-Difference for Approximate KNN Search in High-dimensional Databases , 2005, ADC.

[27]  Y. Mori,et al.  Image-to-word transformation based on dividing and vector quantizing images with words , 1999 .

[28]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[29]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[30]  K. Revathy,et al.  USING MUTUAL INFORMATION AND CROSS CORRELATION AS METRICS FOR REGISTRATION OF IMAGES , 2008 .

[31]  Peter Widmayer,et al.  The LSD tree: spatial access to multidimensional and non-point objects , 1989, VLDB 1989.

[32]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[33]  Bikesh Kumar Singh,et al.  Retrieval of changes in moving objects in multiple and color images , 2011, ICCCS '11.

[34]  A. Murat Tekalp,et al.  Integrated semantic-syntactic video modeling for search and browsing , 2004, IEEE Transactions on Multimedia.

[35]  Huiyu Zhou,et al.  Object tracking using SIFT features and mean shift , 2009, Comput. Vis. Image Underst..

[36]  Kankerm Güner,et al.  MPEG-7 COMPLIANT ORDBMS BASED IMAGE STORAGE AND RETRIEVAL SYSTEM , 2004 .

[37]  Jongho Nang,et al.  An efficient indexing structure for content based multimedia retrieval with relevance feedback , 2007, SAC '07.

[38]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[39]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[40]  Horst M. Eidenberger,et al.  How good are the visual MPEG-7 features? , 2003, Visual Communications and Image Processing.

[41]  Israel Spiegler,et al.  CM-tree: A dynamic clustered index for similarity search in metric databases , 2007, Data Knowl. Eng..

[42]  Cristina Ribeiro,et al.  An Evaluation Framework for Multidimensional Multimedia Descriptor Indexing , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[43]  Jan Flusser,et al.  Image registration methods: a survey , 2003, Image Vis. Comput..

[44]  E. Oja,et al.  COMPARISON OF TECHNIQUES FOR CONTENT-BASED IMAGE RETRIEVAL , 2001 .

[45]  Ronald R. Yager,et al.  On ordered weighted averaging aggregation operators in multicriteria decisionmaking , 1988, IEEE Trans. Syst. Man Cybern..

[46]  A. Aydin Alatan,et al.  A MPEG-7 compliant Video Management System: BilVMS , 2003 .

[47]  Jeffrey C. Lagarias,et al.  Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions , 1998, SIAM J. Optim..

[48]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[49]  Yueting Zhuang,et al.  Indexing high-dimensional data in dual distance spaces: a symmetrical encoding approach , 2008, EDBT '08.

[50]  Walid G. Aref,et al.  Spatio-Temporal Access Methods , 2003, IEEE Data Eng. Bull..

[51]  Beng Chin Ooi,et al.  Efficient Indexing of High-Dimensional Data Through Dimensionality Reduction , 2000, Data Knowl. Eng..

[52]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[53]  Beng Chin Ooi,et al.  iDistance: An adaptive B+-tree based indexing method for nearest neighbor search , 2005, TODS.

[54]  Whoi-Yul Kim,et al.  Video/image retrieval system based on MPEG-7 (VIRS) , 2003, International Conference on Information Technology: Research and Education, 2003. Proceedings. ITRE2003..

[55]  Ronald R. Yager,et al.  On ordered weighted averaging aggregation operators in multicriteria decision-making , 1988 .

[56]  Kristen Grauman,et al.  Efficiently searching for similar images , 2010, Commun. ACM.

[57]  H. Buchner The Grid File : An Adaptable , Symmetric Multikey File Structure , 2001 .

[58]  Liang-Tien Chia,et al.  Mapping, indexing and querying of MPEG-7 descriptors in RDBMS with IXMDB , 2007, Data Knowl. Eng..

[59]  Hans-Peter Kriegel,et al.  The pyramid-technique: towards breaking the curse of dimensionality , 1998, SIGMOD '98.

[60]  Mohand-Said Hacid,et al.  A Database Approach for Modeling and Querying Video Data , 2000, IEEE Trans. Knowl. Data Eng..

[61]  Adnan Yazici,et al.  Slim-tree and BitMatrix index structures in image retrieval system using MPEG-7 Descriptors , 2008, 2008 International Workshop on Content-Based Multimedia Indexing.

[62]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[63]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[64]  B. S. Manjunath,et al.  Registration Techniques for Multisensor Remotely Sensed Imagery , 1996 .