Classifying offensive sites based on image content

This paper proposes a method for helping to identify adult web sites by using the imagecontent as means of detecting erotic material. The image content is classified by investigating probable skin-regions, and extracting their feature vectors. These feature vectors are based on color-, texture-, contour-, placement-, and relative size-information for a given region. The importance of the different elements in the feature vector is determined by a genetic algorithm. For each picture, the algorithm gives the probability that a certain picture has erotic content. By mapping all the images in a web site, and running the image-based classifier on the whole collection, we were able to set up a histogram of images with regards to the log-likelihood of erotic content for each image. Hence giving a good overview of the web site's content and at the same time leaving room for errors in the image-based classifier.The algorithm proved to be quite successful in our tests where all 20 sites where classified correctly. The image-based classifier is able to properly identify 89% of the evaluation images at an average processing speed of 11 images per second.Although this experiment focused on classifying adult web sites, small alterations to the system can be done, enabling classification of other kinds of images and web sites.

[1]  Min C. Shin,et al.  Does colorspace transformation make any difference on skin detection? , 2002, Sixth IEEE Workshop on Applications of Computer Vision, 2002. (WACV 2002). Proceedings..

[2]  Neal R. Harvey,et al.  Feature extraction from multiple data sources using genetic programming , 2002, SPIE Defense + Commercial Sensing.

[3]  Lars Bretzner,et al.  Hand gesture recognition using multi-scale colour features, hierarchical models and particle filtering , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[4]  Remco C. Veltkamp,et al.  State of the Art in Shape Matching , 2001, Principles of Visual Information Retrieval.

[5]  E. Granum,et al.  Skin colour detection under changing lighting conditions , 1999 .

[6]  Anil K. Jain,et al.  Image retrieval using color and shape , 1996, Pattern Recognit..

[7]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[8]  Anil K. Jain,et al.  Face Detection in Color Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  David A. Forsyth,et al.  Identifying nude pictures , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[10]  Leonidas J. Guibas,et al.  Shape-based Image Retrieval Using Geometric Hashing , 1997 .

[11]  Peter Alshuth,et al.  IRIS - Image retrieval for images and videos , 1996 .

[12]  Melanie Mitchell,et al.  Investigation of image feature extraction by a genetic algorithm , 1999, Optics + Photonics.

[13]  Elli Angelopoulou,et al.  Understanding the color of human skin , 2001, IS&T/SPIE Electronic Imaging.

[14]  James M. Rehg,et al.  Vision-based speaker detection using Bayesian networks , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[15]  Cyrus Shahabi,et al.  Image retrieval by shape: a comparative study , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[16]  Neal R. Harvey,et al.  Genetic algorithm for combining new and existing image processing tools for multispectral imagery , 2000, SPIE Defense + Commercial Sensing.

[17]  R. Poli Genetic programming for image analysis , 1996 .

[18]  Walter Alden Tackett,et al.  Genetic Programming for Feature Discovery and Image Discrimination , 1993, ICGA.

[19]  Alexander H. Waibel,et al.  Skin-Color Modeling and Adaptation , 1998, ACCV.

[20]  Bernt Schiele,et al.  Skin Patch Detection in Real-World Images , 2002, DAGM-Symposium.

[21]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Jong Soo Park,et al.  2-D Invariant Descriptors for Shape-Based Image Retrieval , 2001 .

[23]  Fritz Albregtsen,et al.  New texture features based on the complexity curve , 1999, Pattern Recognit..

[24]  J. Birgitta Martinkauppi,et al.  Behavior of skin color under varying illumination seen by different cameras at different color spaces , 2001, IS&T/SPIE Electronic Imaging.