Bimodal codebooks based adult video detection

Multi-modality based adult video detection is an effective approach of filtering pornography. However, existing methods lack accurate representation methods of multi-modality semantics. Addressing at the issue, we propose a novel method of bimodal codebooks based adult video detection. Firstly, the audio codebook is created by periodicity analysis from the labeled audio segments. Secondly, the visual codebook is generated by detecting regions-of-interest (ROI) on the basis of saliency analysis. Furthermore, we combine the two codebooks to represent the coocurrence semantics of bimodal signals. The results show that our approach outperforms some state-of-the-art methods.

[1]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[2]  Zhiwu Lu,et al.  Semantic concept annotation based on audio PLSA model , 2009, MM '09.

[3]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[4]  Georgios Tziritas,et al.  Face Detection Using Quantized Skin Color Regions Merging and Wavelet Packet Analysis , 1999, IEEE Trans. Multim..

[5]  Oh-Jin Kwon,et al.  Automatic System for Filtering Obscene Video , 2008, 2008 10th International Conference on Advanced Communication Technology.

[6]  Yang Liu,et al.  IMAGE GUARDER: AN INTELLIGENT DETECTOR FOR ADULT IMAGES , 2003 .

[7]  Ali Borji,et al.  State-of-the-Art in Visual Attention Modeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Sheng Tang,et al.  Pornprobe: an LDA-SVM based pornography detection system , 2009, ACM Multimedia.

[9]  Gerard Lacey,et al.  Multimodal Periodicity Analysis for Illicit Content Detection in Videos , 2006 .

[10]  Yongdong Zhang,et al.  Contextual Query Expansion for Image Retrieval , 2014, IEEE Transactions on Multimedia.

[11]  Liu Yizhi,et al.  Adult Image Detection Combining BoVW Based on Region of Interest and Color Moments , 2010 .

[12]  Sheng Tang,et al.  Robust common visual pattern discovery using graph matching , 2013, J. Vis. Commun. Image Represent..

[13]  Gao Wen Detecting Pornographic Images with Visual Words , 2008 .

[14]  Yizhi Liu,et al.  Constructing SURF visual-words for pornographic images detection , 2009, 2009 12th International Conference on Computers and Information Technology.

[15]  Shumeet Baluja,et al.  Large scale image-based adult-content filtering , 2006, VISAPP.

[16]  Qun Liu,et al.  Fast commercial detection based on audio retrieval , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[17]  Yongdong Zhang,et al.  A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors , 2014, IEEE Signal Processing Letters.

[18]  Patrick Le Callet,et al.  A coherent computational approach to model bottom-up visual attention , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Andreas Jakobsson,et al.  Classification of indecent videos by low complexity repetitive motion detection , 2008, 2008 37th IEEE Applied Imagery Pattern Recognition Workshop.

[20]  Hermann Ney,et al.  Bag-of-visual-words models for adult image classification and filtering , 2008, 2008 19th International Conference on Pattern Recognition.

[21]  Seungmin Lee,et al.  Implementation of high performance objectionable video classification system , 2006, 2006 8th International Conference Advanced Communication Technology.

[22]  David A. Forsyth,et al.  Finding Naked People , 1996, ECCV.

[23]  Yongdong Zhang,et al.  Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Bo Xu,et al.  Recognition of blue movies by fusion of audio and video , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[25]  Yongdong Zhang,et al.  Supervised Hash Coding With Deep Neural Network for Environment Perception of Intelligent Vehicles , 2018, IEEE Transactions on Intelligent Transportation Systems.

[26]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[27]  Wei Liang,et al.  A novel approach to musical genre classification using probabilistic latent semantic analysis model , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[28]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[29]  Yongdong Zhang,et al.  Effective Uyghur Language Text Detection in Complex Background Images for Traffic Prompt Identification , 2018, IEEE Transactions on Intelligent Transportation Systems.