Social Media User’s Safety Level Detection through Classification via Clustering Approach

Social media has a significant impact on our daily life, and the popularity is increasing rapidly because of the ability to be attached to people around the world and share feelings, photos, videos, etc. So, it bears a high-security concern. However, most of the social media user does not know the security level of their account, including what features of social media should consider if the account is in a risk situation. The posting, friendship, etc. sometimes brings unfortunate events like identity theft, sexual harassment, cyber-crime, etc. To overcome such kind of unexpected issues, this research proposes a classification via clustering algorithm based predictive model by which one can know his safety level in the social media. A dataset is formed through a closed-ended questionnaire. Essential features are selected via gain ration method as high dimensional data is costly to train a model. An unsupervised algorithm, hierarchical clustering, cluster the users into three groups that are labeled for further analysis. The various classification algorithm is chosen to train the predictive model. From the model evaluation result, “Logistic Regression” predicts the safety level of a social media user with high accuracy. So, this model will bring an extra dimension in social media user account safety.

[1]  A. Karegowda,et al.  COMPARATIVE STUDY OF ATTRIBUTE SELECTION USING GAIN RATIO AND CORRELATION BASED FEATURE SELECTION , 2010 .

[2]  Do-Hyeun Kim,et al.  An Optimization Scheme for Water Pump Control in Smart Fish Farm with Efficient Energy Consumption , 2018, Processes.

[3]  H. Edelsbrunner,et al.  Efficient algorithms for agglomerative hierarchical clustering methods , 1984 .

[4]  Fadi A. Thabtah,et al.  Phishing detection based Associative Classification data mining , 2014, Expert Syst. Appl..

[5]  Mauro Conti,et al.  FakeBook: Detecting Fake Profiles in On-Line Social Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[6]  Joby James,et al.  Detection of phishing URLs using machine learning techniques , 2013, 2013 International Conference on Control Communication and Computing (ICCC).

[7]  Vairaprakash Gurusamy,et al.  Mining the Attitude of Social Network Users using K-means Clustering , 2017 .

[8]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[9]  Amir Masoud Rahmani,et al.  Identifying Fake Accounts on Social Networks Based on Graph Analysis and Classification Algorithms , 2018, Secur. Commun. Networks.

[10]  Jeanna Neefe Matthews,et al.  Profile characteristics of fake Twitter accounts , 2016 .

[11]  Yuval Elovici,et al.  Generic anomalous vertices detection utilizing a link prediction algorithm , 2018, Social Network Analysis and Mining.

[12]  Jeanna Neefe Matthews,et al.  Fake Twitter accounts: profile characteristics obtained using an activity-based pattern detection approach , 2015, SMSociety.