Applying neural network to combining the heterogeneous features in content-based image retrieval

Content of an image can be expressed in terms of different features such as color, texture, shape, or text annotations. Retrieval based on these features can be various by the way how to combine the feature values. Most of the existing approaches assume a linear relationship between different features, and also require the user to directly assign weights to features. In particular, as the number of feature classes increases, intuition about how to pick relative weights among features is lost. While this linear combining approach establishes the basis of content-based image retrieval (CBIR), the usefulness of such systems was limited due to the difficulty in representing human perception subjectivity. In this paper, we introduce a Neural Network- based Image Retrieval system, a human-computer interaction approach to CBIR using Radial Basis Function network. This approach determines nonlinear relationship between features so that more accurate similarity comparison between images can be supported and allows the user to submit a coarse initial query and continuously refine his information need via relevance feedback. The experimental results show that the proposed approach has the superior retrieval performance than the existing approaches such as linearly combining approach, the rank-based method, and the BackPropagation- based method.

[1]  C. Micchelli Interpolation of scattered data: Distance matrices and conditionally positive definite functions , 1986 .

[2]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[3]  Rosalind W. Picard,et al.  Interactive Learning Using a "Society of Models" , 2017, CVPR 1996.

[4]  S. Sclaroff,et al.  ImageRover: a content-based image browser for the World Wide Web , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[5]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[6]  B. S. Manjunath,et al.  Image indexing using a texture dictionary , 1995, Other Conferences.

[7]  Freddy Fierens,et al.  Interactive outlining: an improved approach using active contours , 1993, Electronic Imaging.

[8]  Tom Minka,et al.  Interactive learning with a "Society of Models" , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[10]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[11]  James Dowe,et al.  Content-based retrieval in multimedia imaging , 1993, Electronic Imaging.

[12]  Fang Liu,et al.  Periodicity, Directionality, and Randomness: Wold Features for Image Modeling and Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Ruggero Milanese,et al.  A Rotation, Translation, and Scale-Invariant Approach to Content-Based Image Retrieval , 1999, J. Vis. Commun. Image Represent..

[14]  Thomas S. Huang,et al.  Content-based image retrieval with relevance feedback in MARS , 1997, Proceedings of International Conference on Image Processing.

[15]  F. Girosi Some extensions of radial basis functions and their applications in artificial intelligence , 1992 .

[16]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[17]  Jooyoung Park,et al.  Universal Approximation Using Radial-Basis-Function Networks , 1991, Neural Computation.

[18]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[19]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[20]  Shih-Fu Chang,et al.  Image and video search engine for the World Wide Web , 1997, Electronic Imaging.

[21]  Thomas P. Minka,et al.  An image database browser that learns from user interaction , 1996 .

[22]  B. S. Manjunath,et al.  Texture features and learning similarity , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.