Multi-stage classi cation of images from features and related text

The synergy of textual and visual information in Web documents provides great opportunity for improving the image indexing and searching capabilities of Web image search engines. We explore a new approach for automatically classifying images using image features and related text. In particular, we de ne a multi-stage classi cation system which progressively restricts the perceived class of each image through applications of increasingly specialized classi ers. Furthermore, we exploit the related textual information in a novel process that automatically constructs the training data for the image classi ers. We demonstrate initial results on classifying photographs and graphics from the Web.

[1]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[2]  Hayit Greenspan,et al.  Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.

[3]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[4]  C. Frankel,et al.  Distinguishing photographs and graphics on the World Wide Web , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[5]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[6]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[7]  Neil C. Rowe,et al.  Automatic Caption Localization for Photographs on World Wide Web Pages , 1998, Inf. Process. Manag..