Development of a real-time nudity censorship system on images

We propose a system for detecting and localizing nudity content in images and series of images (video), so that further action can be done on relevant segments (typically covering). We also attempt to make this system able to be run on video input, compromising frame rate as little as possible. In this paper we use a skin filtering method based on a Bayes rule, followed by a novel histogram backprojection of skin samples selected based on standard deviation and gradients. The filtered image is then clustered into a Bag-of-Visual-Words (BOVW) and classified using SVM. Localization is then achieved by shrinking the image window borders on which the classification is performed. The skin filtering method described here has an increased recall rate compared to filtering using pure Bayes rule. The nudity classification phase achieved a highest accuracy rate of 76.11% on a 5-fold cross validation test. Finally, the localization phase, when run on video clips, has successfully localized a minimum of a quarter of nude segments in a clip, although many cases of over segmentation still occur. The system is able to maintain a highest average frame rate of 15 FPS when run on videos using hardware specifications that we used.

[1]  Ricardo A. Baeza-Yates,et al.  Characterizing objectionable image content (pornography and nude images) of specific Web segments: Chile as a case study , 2005, Third Latin American Web Congress (LA-WEB'2005).

[2]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Arnaldo de Albuquerque Araújo,et al.  Nude Detection in Video Using Bag-of-Visual-Features , 2009, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing.

[6]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[7]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[8]  James M. Rehg,et al.  Statistical Color Models with Application to Skin Detection , 2004, International Journal of Computer Vision.

[9]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[10]  Ieee Staff,et al.  2019 7th International Conference on Information and Communication Technology (ICoICT) , 2014 .

[11]  Arnaldo de Albuquerque Araújo,et al.  A bag-of-features approach based on Hue-SIFT descriptor for nude detection , 2009, 2009 17th European Signal Processing Conference.

[12]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..