论文信息 - An approach for offensive text detection and prevention in Social Networks

An approach for offensive text detection and prevention in Social Networks

Social Network has become a place where people from every corner of the world has established a virtual civilization. In this virtual community, people used to share their views, express their feelings, photos, videos, blogs, etc. Social Networking Sites like Facebook, Twitters, etc. has given a platform to share innumerable contents with just a click of a button. However, there is no restriction applied by them for the uploaded content. These uploaded content may contains abusive words, explicit images which may be unsuitable for social platforms. As such there is no defined mechanism for restricting offensive contents from publishing on social sites. To solve this problem we have used our proposed approach. In our approach we are developing a social network prototype for implementing our approach for automatic filtering of offensive content in social network. Many popular social networking sites today don't have proper mechanism for restricting offensive contents. They use reporting methods in which user report if the content is abuse. This requires substantial human efforts and time. In this paper, we applied pattern matching algorithm for offensive keyword detection from social networking comments and prevent it from publishing on social platform. Apart from conventional method of reporting abusive contents by users our approach does not requires any human intervention and thus restrict offensive words by detecting and preventing it automatically.

Shashank H. Yadav | Pratik M. Manwatkar

[1] W. B. Cavnar,et al. N-gram-based text categorization , 1994 .

[2] Ying Chen,et al. Detecting Offensive Language in Social Media to Protect Adolescent Online Safety , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[3] S. C. Hui,et al. An intelligent categorization engine for bilingual web content filtering , 2005, IEEE Transactions on Multimedia.

[4] S. C. Hui,et al. Neural Networks for Web Content Filtering , 2002, IEEE Intell. Syst..

[5] Reihaneh Safavi-Naini,et al. Web filtering using text classification , 2003, The 11th IEEE International Conference on Networks, 2003. ICON2003..

[6] Paul A. Watters,et al. Statistical and structural approaches to filtering Internet pornography , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[7] Zhouyu Fu,et al. Recognition of Pornographic Web Pages by Classifying Texts and Images , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Tomáš ÖLVECKÝ. N-Gram Based Statistics Aimed at Language Identification , 2005 .

[9] Félix Gómez Mármol,et al. Reporting Offensive Content in Social Networks: Toward a Reputation-Based Assessment Approach , 2014, IEEE Internet Computing.

[10] Carolyn Penstein Rosé,et al. Detecting offensive tweets via topical feature discovery over a large scale twitter corpus , 2012, CIKM.