Combining Classifiers for Web Violent Content Detection and Filtering

Keeping people away from litigious information becomes one of the most important research area in network information security. Indeed, Web filtering is used to prevent access to undesirable Web pages. In this paper we review some existing solutions, then we propose a violent Web content detection and filtering system called "WebAngels filter" which uses textual and structural analysis. "WebAngels filter" has the advantage of combining several data-mining algorithms for Web site classification. We discuss how the combination learning based methods can improve filtering performances. Our preliminary results show that it can detect and filter violent content effectively.

[1]  Liming Chen,et al.  WebGuard: a Web filtering engine combining textual, structural, and visual content-based analysis , 2006, IEEE Transactions on Knowledge and Data Engineering.

[2]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[3]  Ramayya Krishnan,et al.  A method for managing access to web pages: Filtering by Statistical Classification (FSC) applied to text , 2006, Decis. Support Syst..

[4]  Paul A. Watters,et al.  Statistical and structural approaches to filtering Internet pornography , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[5]  S. C. Hui,et al.  Neural Networks for Web Content Filtering , 2002, IEEE Intell. Syst..

[6]  Bjørn Olstad,et al.  Classifying offensive sites based on image content , 2004, Comput. Vis. Image Underst..

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.