Fusing Audio-Words with Visual Features for Pornographic Video Detection

The traditional approach of filtering pornographic videos on the Internet is based on visual features of key frames. However, it cannot meet users' needs owing to the proliferation of low-resolution videos. To improve the filtering performance, we propose a novel framework of fusing audio-words with visual features for pornographic video detection. Our intention is not only to fuse the two modalities of visual images and audio signals, but also to narrow down the semantic gap between low-level features and high-level concepts by using the mid-level feature "audio-words". To further improve the performance, we present the segmentation algorithm based on units of energy envelope and the decision algorithm based on periodic patterns. The results show that our approach outperforms the traditional one which is based on visual features and achieves satisfactory performance. Moreover, the proposed segmentation algorithm is better than the conventional one using the same length and the proposed decision algorithm exceeds the conventional one using thresholds.

[1]  Bo Xu,et al.  Recognition of blue movies by fusion of audio and video , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[2]  Qun Liu,et al.  Fast commercial detection based on audio retrieval , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[3]  Yang Liu,et al.  IMAGE GUARDER: AN INTELLIGENT DETECTOR FOR ADULT IMAGES , 2003 .

[4]  Wei Liang,et al.  A novel approach to musical genre classification using probabilistic latent semantic analysis model , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[5]  Oh-Jin Kwon,et al.  Automatic System for Filtering Obscene Video , 2008, 2008 10th International Conference on Advanced Communication Technology.

[6]  Sheng Tang,et al.  Pornprobe: an LDA-SVM based pornography detection system , 2009, ACM Multimedia.

[7]  Gerard Lacey,et al.  Multimodal Periodicity Analysis for Illicit Content Detection in Videos , 2006 .

[8]  Shumeet Baluja,et al.  Large scale image-based adult-content filtering , 2006, VISAPP.

[9]  Qi Tian,et al.  Periodicity Detection of Local Motion , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[10]  Seungmin Lee,et al.  Implementation of high performance objectionable video classification system , 2006, 2006 8th International Conference Advanced Communication Technology.

[11]  David A. Forsyth,et al.  Finding Naked People , 1996, ECCV.

[12]  Andreas Jakobsson,et al.  Classification of indecent videos by low complexity repetitive motion detection , 2008, 2008 37th IEEE Applied Imagery Pattern Recognition Workshop.

[13]  Zhiwu Lu,et al.  Semantic concept annotation based on audio PLSA model , 2009, MM '09.

[14]  Adrian Ulges,et al.  Detecting pornographic video content by combining image features with motion information , 2009, ACM Multimedia.

[15]  Hermann Ney,et al.  Bag-of-visual-words models for adult image classification and filtering , 2008, 2008 19th International Conference on Pattern Recognition.