Categorizing images in Web documents

The Web provides an increasingly powerful and popular publication mechanism. Web documents often contain a large number of images that serve various purposes. Identifying the functional categories of these images is an important task in Web repurposing. This article describes a study on the functional categorization of Web images using data collected from news Web sites. As the popularity of the Web soars, the content on the Web is increasingly accessed from wireless devices that have small screens and different bandwidths. Because many Web documents contain a large number of images serving different purposes, how to identify the function of each image so that it can be handled accordingly is an important issue in Web content repurposing. Much work remains to be done in function-based image classification of all images. Icons that appear regularly on Web sites (for example, newspaper logos) could be classified by analyzing different editions of the pages for repetitions. For the host class, a combination of detecting repetitive images and face recognition should help significantly.

[1]  Robert M. Gray,et al.  Text and picture segmentation by the distribution analysis of wavelet coefficients , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[2]  Bernd Girod,et al.  Classification of compound images based on transform coefficient likelihood , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[3]  John R. Smith,et al.  Detecting image purpose in World Wide Web documents , 1998, Electronic Imaging.

[4]  Tapas Kanungo What Fraction of Images on the Web Contain Text ? , 2001 .

[5]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[6]  Jianying Hu,et al.  Flexible Web document analysis for delivery to narrow-bandwidth devices , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[7]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[8]  Daniel P. Lopresti,et al.  Locating and Recognizing Text in WWW Images , 2000, Information Retrieval.

[10]  Andreas Girgensohn,et al.  Web Page Filtering and Re-Authoring for Mobile Users , 1999, Comput. J..

[11]  Andreas Paepcke,et al.  Seeing the whole in parts: text summarization for web browsing on handheld devices , 2001, WWW '01.

[12]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[13]  Yueting Zhuang,et al.  OCTOPUS: aggressive search of multi-modality data using multifaceted knowledge base , 2002, WWW '02.

[14]  Ethan V. Munson To Search for Images on the Web , Look at the Text , Then Look at the Images , 2001 .

[15]  Apostolos Antonacopoulos,et al.  An Anthropocentric Approach to Text Extraction from WWW Images , 2000 .

[16]  Robert M. Gray,et al.  Text segmentation in mixed-mode images using classification trees and transform tree-structured vector quantization , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.