Learning Semantic Concepts from Visual Data Using Neural Networks

For content-based image retrieval techniques, query image is used to pick up and rank some relevant images from a database using some certain similarity metric. If semantic features are not involved in the modeling of visual data, the resulting system may demonstrate a disability of retrieving images likely associated with interesting semantic concepts of objects in the images. Therefore, issues on semantics representation, automatic extraction of semantic concepts from visual data, and effects of window size on the concepts recogni- tion are needed to study. This paper describes an approach towards these prob- lems. We first define a set of semantic concepts characterizing the outdoor im- ages. Then, a neural network is employed to memory the semantic concepts through pattern learning techniques. Lastly, the well-trained neural networks will perform as a classifier to identify the predefined semantics within an image. Empirical studies and comparison with decision tree techniques are carried out.

[1]  Lefteri H. Tsoukalas,et al.  Fuzzy and neural approaches in engineering , 1997 .

[2]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[3]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[4]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Michele Nappi,et al.  FIRST: Fractal Indexing and Retrieval SysTem for Image Databases , 1998, Image Vis. Comput..

[6]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[7]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[9]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[10]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[11]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[12]  Jitendra Malik,et al.  Blobworld: A System for Region-Based Image Indexing and Retrieval , 1999, VISUAL.

[13]  Alex Pentland,et al.  Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[14]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[15]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[16]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[17]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Tom Minka,et al.  Interactive learning with a "society of models" , 1997, Pattern Recognit..

[19]  Jacques de Villiers,et al.  Backpropagation neural nets with one and two hidden layers , 1993, IEEE Trans. Neural Networks.

[20]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Other Conferences.

[21]  Rosalind W. Picard,et al.  Interactive Learning Using a "Society of Models" , 2017, CVPR 1996.

[22]  Clement H. C. Leung Visual Information Systems , 1997, Lecture Notes in Computer Science.

[23]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[24]  Raimondo Schettini,et al.  A survey on methods for colour image indexing and retrieval in image databases , 2002 .

[25]  Thomas S. Huang,et al.  Image processing , 1971 .