论文信息 - An Approach to Creating an Intelligent System for Detecting and Countering Inappropriate Information on the Internet

An Approach to Creating an Intelligent System for Detecting and Countering Inappropriate Information on the Internet

Currently, the Internet is becoming one of the most dangerous threats to personal, public and state information security. Therefore, the task of detecting and counteracting inappropriate information in digital network content becomes of national importance. The paper offers a new approach to creating an intelligent system for detecting and counteracting inappropriate information on the Internet based on the use of machine learning methods and processing of big data and describes the architecture of such a system. Experimental evaluation of one of the most important system components, which is the component of multidimensional evaluation and categorization of information objects in single-threaded and multi-threaded modes showed high efficiency of using various classifiers included in the Python Scikit-learn and Spark MLlib libraries to solve the problem.

Igor Saenko | Olga Tushkanova | Lidiya Vitkova

[1] Charu C. Aggarwal,et al. Feature Selection for Classification: A Review , 2014, Data Classification: Algorithms and Applications.

[2] H. Roberts,et al. Network Propaganda: Manipulation, Disinformation, and Radicalization in American Politics , 2018 .

[3] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[4] Brian D. Davison,et al. Web page classification: Features and algorithms , 2009, CSUR.

[5] Shanshan Zhang,et al. A Survey on Information Diffusion in Online Social Networks: Models and Methods , 2017, Inf..

[6] Thanh Tran,et al. Uncovering Fake Likers in Online Social Networks , 2016, CIKM.

[7] Yimin Chen,et al. Automatic deception detection: Methods for finding fake news , 2015, ASIST.

[8] Dr. Charu C. Aggarwal. Machine Learning for Text , 2018, Springer International Publishing.

[9] Panagiotis Takis Metaxas,et al. The Fake News Spreading Plague: Was it Preventable? , 2017, WebSci.

[10] Igor V. Kotenko,et al. Improving the Categorization of Web Sites by Analysis of Html-Tags Statistics to Block Inappropriate Content , 2015, IDC.

[11] Youssef Iraqi,et al. Enhancing Phishing E-Mail Classifiers: A Lexical URL Analysis Approach , 2013 .