Indexing for Image Retrieval: A Machine Learning Based Approach

In this paper, we explore the use of machine learning for multimedia indexing and retrieval involving single/multiple features. Indexing of large image collection has been well researched problem. However, machine learning for combination of features in image indexing and retrieval framework is not explored. In this context, the paper presents novel formulation of multiple kernel learning in hashing for multimedia indexing. The framework learns combination of multiple features/ modalities for defining composite document indices in genetic algorithm based framework. We have demonstrated the evaluation of framework on dataset of handwritten digit images. Subsequently, the utility of the framework is explored for development for multi-modal retrieval of document images.

[1]  Ritu Gupta,et al.  Statistical exploratory analysis of genetic algorithms , 2004, IEEE Transactions on Evolutionary Computation.

[2]  Gunnar Rätsch,et al.  Large Scale Multiple Kernel Learning , 2006, J. Mach. Learn. Res..

[3]  Kalyanmoy Deb,et al.  Understanding Interactions among Genetic Algorithm Parameters , 1998, FOGA.

[4]  Cordelia Schmid,et al.  A sparse texture representation using local affine regions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Martial Hebert,et al.  Rapid object indexing using locality sensitive hashing and joint 3D-signature space estimation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Panagiotis Papapetrou,et al.  Nearest Neighbor Retrieval Using Distance-Based Hashing , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[7]  Nam Chul Kim,et al.  Image retrieval using BDIP and BVLC moments , 2003, IEEE Trans. Circuits Syst. Video Technol..

[8]  Manik Varma,et al.  Character Recognition in Natural Images , 2009, VISAPP.

[9]  Shih-Fu Chang,et al.  Sequential Projection Learning for Hashing with Compact Codes , 2010, ICML.

[10]  Santanu Chaudhury,et al.  Word shape descriptor-based document image indexing: a new DBH-based approach , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[11]  Christian Böhm,et al.  Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases , 2001, CSUR.

[12]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[13]  Yves Grandvalet,et al.  More efficiency in multiple kernel learning , 2007, ICML '07.

[14]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[15]  Junichi Kanai,et al.  Character recognition , 1997 .

[16]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[17]  Santanu Chaudhury,et al.  Multiple Exemplar-Based Facial Image Retrieval Using Independent Component Analysis , 2006, IEEE Transactions on Image Processing.

[18]  Wang Weihong,et al.  A Scalable Content-based Image Retrieval Scheme Using Locality-sensitive Hashing , 2009, 2009 International Conference on Computational Intelligence and Natural Computing.

[19]  Xiaojun Qi,et al.  A novel fusion approach to content-based image retrieval , 2005, Pattern Recognit..

[20]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[21]  Emanuele Della Valle,et al.  An Introduction to Information Retrieval , 2013 .

[22]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[23]  K.R. Namuduri,et al.  Compact combination of MPEG-7 color and texture descriptors for image retrieval , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[24]  Bir Bhanu,et al.  Object detection via feature synthesis using MDL-based genetic programming , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[25]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[26]  Matthieu Cord,et al.  Combining visual dictionary, kernel-based similarity and learning strategy for image category retrieval , 2008, Comput. Vis. Image Underst..

[27]  Haiying Shen,et al.  An Efficient Similarity Searching Scheme in Massive Databases , 2008, 2008 The Third International Conference on Digital Telecommunications (icdt 2008).

[28]  Anne H. H. Ngu,et al.  Towards Effective Content-Based Music Retrieval With Multiple Acoustic Feature Combination , 2006, IEEE Transactions on Multimedia.

[29]  R. Manmatha,et al.  Features for word spotting in historical manuscripts , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[30]  Karl Aberer,et al.  Distributed similarity search in high dimensions using locality sensitive hashing , 2009, EDBT '09.

[31]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[32]  Chew Lim Tan,et al.  Keyword Spotting in Document Images through Word Shape Coding , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[33]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[34]  Santanu Chaudhury,et al.  A CRF Based Scheme for Overlapping Multi-colored Text Graphics Separation , 2011, 2011 International Conference on Document Analysis and Recognition.

[35]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[36]  Zhe Wang,et al.  Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search , 2007, VLDB.

[37]  Bir Bhanu,et al.  Evolutionary feature synthesis for object recognition , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[38]  Paul L. Rosin,et al.  Incorporating shape into histograms for CBIR , 2002, Pattern Recognit..

[39]  Vladimir A. Protopopescu,et al.  Information fusion for text classification - an experimental comparison , 2001, Pattern Recognit..

[40]  Nam Chul Kim,et al.  Content-Based Image Retrieval Using Multiresolution Color and Texture Features , 2008, IEEE Transactions on Multimedia.

[41]  T. Syeda-Mahmood Indexing of handwritten document images , 1997, Proceedings Workshop on Document Image Analysis (DIA'97).

[42]  Sebastian Nowozin,et al.  On feature combination for multiclass object classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.