Classifying images collected on the World Wide Web

This work presents the classification of images collected on the World Wide Web, using a supervised classification method, called ID3 (Itemized Dichotomizer 3). The classification consists in separating the images into two semantic classes: graphics and photographs. Photographs include natural scenes, like people, faces, animals, flowers, landscapes and cities. Graphics are logos, drawings, icons, maps, and backgrounds, usually generated by computer. To validate the classifier we used the k-fold cross-validation method. In the experimental tests 95.6% of the images were correctly classified.

[1]  Tosiyasu L. Kunii,et al.  Pictorial Data-Base Systems , 1981, Computer.

[2]  M. Lew,et al.  Webcrawling Using Sketches , 1997 .

[3]  Arnaldo de Albuquerque Araújo,et al.  Proposal of a classifier of images collected in the World Wide Web , 2001, Proceedings XIV Brazilian Symposium on Computer Graphics and Image Processing.

[4]  Xindong Wu,et al.  Induction By Attribute Elimination , 1999, IEEE Trans. Knowl. Data Eng..

[5]  한인구 THE IMPACT OF MEASUREMENT SCALE AND CORRELATION STRUCTURE ON CLASSIFICATION PERFORMANCE OF INDUCTIVE LEARNING AND STATISTICAL METHODS , 1991 .

[6]  S. Sclaroff,et al.  ImageRover: a content-based image browser for the World Wide Web , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[7]  Michael Adamowicz,et al.  Data base systems , 1976 .

[8]  Alex Pentland,et al.  Introduction to the Special Section on Digital Libraries: Representation and Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Ingoo Han,et al.  The impact of measurement scale and correlation structure on classification performance of inductive learning and statistical methods , 1996 .

[10]  Shengrui Wang,et al.  Image classification and retrieval on the World Wide Web , 1999, DL '99.

[11]  C. Frankel,et al.  Distinguishing photographs and graphics on the World Wide Web , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[12]  King-Sun Fu,et al.  Picture Query Languages for Pictorial Data-Base Systems , 1981, Computer.

[13]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[14]  Amarnath Gupta,et al.  Visual information retrieval , 1997, CACM.

[15]  Shi-Kuo Chang,et al.  An Intelligent Image Database System , 1988, IEEE Trans. Software Eng..

[16]  Vijay V. Raghavan,et al.  Content-Based Image Retrieval Systems - Guest Editors' Introduction , 1995, Computer.

[17]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[18]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.