An Expert System for Classifying Harmful Content on the Dark Web

In this research, we examine and develop an expert system with a mechanism to automate crime category classification and threat level assessment, using the information collected by crawling the dark web. We have constructed a bag of words from 250 posts on the dark web and developed an expert system which takes the frequency of terms as an input and classifies sample posts into 6 criminal category dealing with drugs, stolen credit card, passwords, counterfeit products, child porn and others, and 3 threat levels (high, middle, low). Contrary to prior expectations, our simple and explainable expert system can perform competitively with other existing systems. For short, our experimental result with 1500 posts on the dark web shows 76.4% of recall rate for 6 criminal category classification and 83% of recall rate for 3 threat level discrimination for 100 random-sampled posts.

[1]  M. I. V. Eale,et al.  SLAVE TO THE ALGORITHM ? WHY A ‘ RIGHT TO AN EXPLANATION ’ IS PROBABLY NOT THE REMEDY YOU ARE LOOKING FOR , 2017 .

[2]  Hsinchun Chen,et al.  Exploring the online underground marketplaces through topic-based social network and clustering , 2016, 2016 IEEE Conference on Intelligence and Security Informatics (ISI).

[3]  Hsinchun Chen,et al.  IEDs in the Dark Web: Genre classification of improvised explosive device web pages , 2008, 2008 IEEE International Conference on Intelligence and Security Informatics.

[4]  Michael Veale,et al.  Slave to the Algorithm? Why a 'Right to an Explanation' Is Probably Not the Remedy You Are Looking For , 2017 .

[5]  Masashi KADOGUCHI,et al.  Exploring the Dark Web for Cyber Threat Intelligence using Machine Leaning , 2019, 2019 IEEE International Conference on Intelligence and Security Informatics (ISI).

[6]  Ahmad Diab,et al.  Darknet and deepnet mining for proactive cybersecurity threat intelligence , 2016, 2016 IEEE Conference on Intelligence and Security Informatics (ISI).

[7]  Hsinchun Chen IEDs in the dark web: Lexicon expansion and genre classification , 2009, 2009 IEEE International Conference on Intelligence and Security Informatics.