A Comparative Study on Fake Job Post Prediction Using Different Data mining Techniques

In recent years, due to advancement in modern technology and social communication, advertising new job posts has become very common issue in the present world. So, fake job posting prediction task is going to be a great concern for all. Like many other classification tasks, fake job posing prediction leaves a lot of challenges to face. This paper proposed to use different data mining techniques and classification algorithm like KNN, decision tree, support vector machine, naïve bayes classifier, random forest classifier, multilayer perceptron and deep neural network to predict a job post if it is real or fraudulent. We have experimented on Employment Scam Aegean Dataset (EMSCAD) containing 18000 samples. Deep neural network as a classifier, performs great for this classification task. We have used three dense layers for this deep neural network classifier. The trained classifier shows approximately 98% classification accuracy (DNN) to predict a fraudulent job post.

[1]  Kiet Van Nguyen,et al.  Job Prediction: From Deep Neural Network Models to Applications , 2019, 2020 RIVF International Conference on Computing and Communication Technologies (RIVF).

[2]  Kiet Van Nguyen,et al.  Hate Speech Detection on Vietnamese Social Media Text using the Bi-GRU-LSTM-CNN Model , 2019, ArXiv.

[3]  Bandar Alghamdi,et al.  An Intelligent Model for Online Recruitment Fraud Detection , 2019, Journal of Information Security.

[4]  Kiet Van Nguyen,et al.  Deep Learning for Aspect Detection on Vietnamese Reviews , 2018, 2018 5th NAFOSTED Conference on Information and Computer Science (NICS).

[5]  Zhihua Li,et al.  News Text Classification Based on Improved Bi-LSTM-CNN , 2018, 2018 9th International Conference on Information Technology in Medicine and Education (ITME).

[6]  Philip S. Yu,et al.  FakeDetector: Effective Fake News Detection with Deep Diffusive Neural Network , 2018, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[7]  Georgios Kambourakis,et al.  Automatic Detection of Online Recruitment Frauds: Characteristics, Methods, and a Public Dataset , 2017, Future Internet.

[8]  Abdelmunem Abuhasan,et al.  An intelligent classification model for phishing email detection , 2016, ArXiv.

[9]  Peng Wang,et al.  Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification , 2016, Neurocomputing.

[10]  Bing Liu,et al.  Spotting Fake Reviews via Collective Positive-Unlabeled Learning , 2014, 2014 IEEE International Conference on Data Mining.

[11]  Nasrullah Memon,et al.  Detection of Fraudulent Emails by Employing Advanced Feature Abundance , 2014 .

[12]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[13]  Matthew S. Gerber,et al.  Automatic detection of cyber-recruitment by violent extremists , 2014, Security Informatics.

[14]  K. R. Remya,et al.  Using weighted majority voting classifier combination for relation classification in biomedical texts , 2014, 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT).

[15]  Claire Cardie,et al.  Estimating the prevalence of deception in online review communities , 2012, WWW.