Combining SVM and CHMM classifiers for porno video recognition

Porno video recognition is important for Internet content monitoring. In this paper, a novel porno video recognition method by fusing the audio and video cues is proposed. Firstly, global color and texture features and local scale-invariant feature transform (SIFT) are extracted to train multiple support vector machine (SVM) classifiers for different erotic categories of image frames. And then, two continuous density hidden Markov models (CHMM) are built to recognize porno sounds. Finally, a fusion method based on Bayes rule is employed to combine the classification results by video and audio cues. The experimental results show that our model is better than six state-of-the-art methods.

[1]  Bo Xu,et al.  Recognition of blue movies by fusion of audio and video , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[2]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[3]  Subhajit Sanyal,et al.  Detection of pornographic content in internet images , 2011, MM '11.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Wang Guo-yin An expression recognition method based on local feature fusion , 2011 .

[6]  Wen Gao,et al.  Adult Image Detection Method Base-on Skin Color Model and Support Vector Machine , 2001 .

[7]  James Ze Wang,et al.  System for screening objectionable images , 1998, Comput. Commun..

[8]  David A. Forsyth,et al.  Automatic Detection of Human Nudes , 1999, International Journal of Computer Vision.

[9]  Tao Liu,et al.  BUPT at TRECVID 2007: Shot Boundary Detection , 2007, TRECVID.

[10]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Arnaldo de Albuquerque Araújo,et al.  A bag-of-features approach based on Hue-SIFT descriptor for nude detection , 2009, 2009 17th European Signal Processing Conference.

[12]  James M. Rehg,et al.  Fast Asymmetric Learning for Cascade Face Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Bin Li,et al.  Region-based Pornographic Image Detection , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[14]  James M. Rehg,et al.  Statistical Color Models with Application to Skin Detection , 2004, International Journal of Computer Vision.

[15]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.