Efficient content-based image retrieval using Multiple Support Vector Machines Ensemble

Highlights? Effective CBIR for non-texture images. ? An extremely fast CBIR system which uses Multiple Support Vector Machines Ensemble. ? Using Daubechies wavelet transformation for extracting the feature vectors of images. With the evolution of digital technology, there has been a significant increase in the number of images stored in electronic format. These range from personal collections to medical and scientific images that are currently collected in large databases. Many users and organizations now can acquire large numbers of images and it has been very important to retrieve relevant multimedia resources and to effectively locate matching images in the large databases. In this context, content-based image retrieval systems (CBIR) have become very popular for browsing, searching and retrieving images from a large database of digital images with minimum human intervention. The research community are competing for more efficient and effective methods as CBIR systems may be heavily employed in serving time critical applications in scientific and medical domains. This paper proposes an extremely fast CBIR system which uses Multiple Support Vector Machines Ensemble. We have used Daubechies wavelet transformation for extracting the feature vectors of images. The reported test results are very promising. Using data mining techniques not only improved the efficiency of the CBIR systems, but they also improved the accuracy of the overall process.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Stefan M. Rüger,et al.  Evaluation of Texture Features for Content-Based Image Retrieval , 2004, CIVR.

[3]  Peter N. Yianilos,et al.  Data structures and algorithms for nearest neighbor search in general metric spaces , 1993, SODA '93.

[4]  Shenghuo Zhu,et al.  A survey on wavelet applications in data mining , 2002, SKDD.

[5]  Kannan Ramchandran,et al.  Multimedia Analysis and Retrieval System (MARS) Project , 1996, Data Processing Clinic.

[6]  Gavin Powell,et al.  Beginning Database Design and Implementation , 2005 .

[7]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[8]  Tansel Özyer,et al.  Clustering by Integrating Multi-objective Optimization with Weighted K-Means and Validity Analysis , 2006, IDEAL.

[9]  Richard M. Timoney An Introduction to Wavelets , 2000 .

[10]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[11]  Akifumi Makinouchi,et al.  Content-Based Image Retrieval Technique Using Wavelet-Based Shift and Brightness Invariant Edge Feature , 2003, Int. J. Wavelets Multiresolution Inf. Process..

[12]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[13]  Jack Sklansky,et al.  Image Segmentation and Feature Extraction , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[15]  Shinji Ozawa,et al.  Semantic-meaningful content-based image retrieval in wavelet domain , 2003, MIR '03.

[16]  S. Sitharama Iyengar,et al.  Content based image retrieval systems , 1999, Proceedings 1999 IEEE Symposium on Application-Specific Systems and Software Engineering and Technology. ASSET'99 (Cat. No.PR00122).

[17]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[18]  Sam Lightstone,et al.  Physical Database Design: the database professional's guide to exploiting indexes, views, storage, and more , 2007 .

[19]  Reda Alhajj,et al.  WaveQ: Combining Wavelet Analysis and Clustering for Effective Image Retrieval , 2007, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07).

[20]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[22]  Frédéric Jurie,et al.  Randomized Clustering Forests for Image Classification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Jeffrey K. Uhlmann,et al.  Satisfying General Proximity/Similarity Queries with Metric Trees , 1991, Inf. Process. Lett..

[24]  Tshilidzi Marwala,et al.  Image Classification Using SVMs: One-against-One Vs One-against-All , 2007, ArXiv.

[25]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[26]  Ming Zhang,et al.  Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space , 2010, Knowledge and Information Systems.

[27]  Remco C. Veltkamp,et al.  Content-based image retrieval systems: A survey , 2000 .

[28]  Lipo Wang Support vector machines : theory and applications , 2005 .

[29]  Yizhuo Zhang,et al.  Constructing Multiple Support Vector Machines Ensemble Based on Fuzzy Integral and Rough Reducts , 2007, 2007 2nd IEEE Conference on Industrial Electronics and Applications.

[30]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[31]  Michalis Vazirgiannis,et al.  Clustering validity assessment: finding the optimal partitioning of a data set , 2001, Proceedings 2001 IEEE International Conference on Data Mining.