Categorizing images in Web documents

The World Wide Web provides an increasingly powerful and popular publication mechanism. Web documents often contain a large number of images serving various different purposes. Identifying the functional categories of these images ahs important applications including information extraction, web mining, web page summarization and mobile access. An important first step towards designing algorithms for automatic categorization of images on the web is to identify the common categories and examine their properties and characteristics. This paper describes results from such an initial study using data collected from news web sites. We describe the image categories found in such web pages and their distributions, and identify the main research issues involved in automatically classifying images into these categories.

[1]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[2]  Apostolos Antonacopoulos,et al.  An Anthropocentric Approach to Text Extraction from WWW Images , 2000 .

[3]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[4]  David Doermann,et al.  Text enhancement in digital video , 1999, Electronic Imaging.

[5]  Tapas Kanungo What Fraction of Images on the Web Contain Text ? , 2001 .

[6]  Yueting Zhuang,et al.  OCTOPUS: aggressive search of multi-modality data using multifaceted knowledge base , 2002, WWW '02.

[7]  A. Gupta,et al.  Text segmentation in mixed-mode images , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[8]  Jianying Hu,et al.  Flexible Web document analysis for delivery to narrow-bandwidth devices , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[9]  Robert M. Gray,et al.  Text and picture segmentation by the distribution analysis of wavelet coefficients , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[10]  Andreas Girgensohn,et al.  Web Page Filtering and Re-Authoring for Mobile Users , 1999, Comput. J..

[11]  Andreas Paepcke,et al.  Seeing the whole in parts: text summarization for web browsing on handheld devices , 2001, WWW '01.

[12]  Daniel P. Lopresti,et al.  Locating and Recognizing Text in WWW Images , 2000, Information Retrieval.

[13]  Bernd Girod,et al.  Classification of compound images based on transform coefficient likelihood , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[14]  Rainer Lienhart,et al.  Automatic text recognition for video indexing , 1997, MULTIMEDIA '96.