Discovering socially similar users in social media datasets based on their socially important locations

Abstract Socially similar social media users can be defined as users whose frequently visited locations in their social media histories are similar. Discovering socially similar social media users is important for several applications, such as, community detection, friendship analysis, location recommendation, urban planning, and anomaly user and behavior detection. Discovering socially similar users is challenging due to dataset size and dimensions, spam behaviors of social media users, spatial and temporal aspects of social media datasets, and location sparseness in social media datasets. In the literature, several studies are conducted to discover similar social media users out of social media datasets using spatial and temporal information. However, most of these studies rely on trajectory pattern mining methods or take into account semantic information of social media datasets. Limited number of studies focus on discovering similar users based on their social media location histories. In this study, to discover socially similar users, frequently visited or socially important locations of social media users are taken into account instead of all locations that users visited. A new interest measure, which is based on Levenshtein distance, was proposed to quantify user similarity based on their socially important locations and two algorithms were developed using the proposed method and interest measure. The algorithms were experimentally evaluated on a real-life Twitter dataset. The results show that the proposed algorithms could successfully discover similar social media users based on their socially important locations.

[1]  Panagiotis Symeonidis,et al.  Recommending Friends and Locations over a Heterogeneous Spatio-Temporal Graph , 2015, MEDI.

[2]  Mete Celik,et al.  Discovering socio-spatio-temporal important locations of social media users , 2017, J. Comput. Sci..

[3]  Pramit Mazumdar,et al.  An approach to compute user similarity for GPS applications , 2016, Knowl. Based Syst..

[4]  Julita Vassileva,et al.  SocConnect: A personalized social network aggregator and recommender , 2013, Inf. Process. Manag..

[5]  Alessandro Moschitti,et al.  Multi-lingual opinion mining on YouTube , 2016, Inf. Process. Manag..

[6]  Marco Ortolani,et al.  Detecting Similarities in Mobility Patterns , 2016, STAIRS.

[7]  Wang-Chien Lee,et al.  PGT: Measuring Mobility Relationship Using Personal, Global and Temporal Factors , 2014, 2014 IEEE International Conference on Data Mining.

[8]  Hao Wu,et al.  A data-intensive approach for discovering user similarities in social behavioral interactions based on the bayesian network , 2017, Neurocomputing.

[9]  Cecilia Mascolo,et al.  Exploiting place features in link prediction on location-based social networks , 2011, KDD.

[10]  Alexander Lazovik,et al.  Mining Twitter in the Cloud: A Case Study , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[11]  Naveen Nandan,et al.  SimMiner: A Tool for Discovering User Similarity by Mining Geospatial Trajectories , 2014, 2014 IEEE 15th International Conference on Mobile Data Management.

[12]  Jianfeng Guan,et al.  Finding top-k similar users based on Trajectory-Pattern model for personalized service recommendation , 2016, 2016 IEEE International Conference on Communications Workshops (ICC).

[13]  Zheyi Chen,et al.  Detecting spammers on social networks , 2015, Neurocomputing.

[14]  Jing Xiao,et al.  Friend Recommendation by User Similarity Graph Based on Interest in Social Tagging Systems , 2015, ICIC.

[15]  Jun Pang,et al.  Constructing and Comparing User Mobility Profiles , 2014, TWEB.

[16]  Halit Oguztüzün,et al.  Evidential estimation of event locations in microblogs using the Dempster-Shafer theory , 2016, Inf. Process. Manag..

[17]  Yang Zhang,et al.  Inferring friendship from check-in data of location-based social networks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[18]  Jun Pang,et al.  MinUS: Mining User Similarity with Trajectory Patterns , 2014, ECML/PKDD.

[19]  Wei Li,et al.  Mining User Similarity Based on Users Trajectories , 2013, 2013 International Conference on Cloud Computing and Big Data.

[20]  Virgílio A. F. Almeida,et al.  Detecting Spammers on Twitter , 2010 .

[21]  Stefano Mizzaro,et al.  Finding Important Locations: A Feature-Based Approach , 2015, 2015 16th IEEE International Conference on Mobile Data Management.

[22]  Chin-Wan Chung,et al.  A User Similarity Calculation Based on the Location for Social Network Services , 2011, DASFAA.

[23]  Yang Zhang,et al.  walk2friends: Inferring Social Links from Mobility Profiles , 2017, CCS.

[24]  Tal Samuel-Azran,et al.  Gendered discourse patterns on online social networks: A social network analysis perspective , 2017, Comput. Hum. Behav..

[25]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[26]  Gyözö Gidófalvi,et al.  Geographical and temporal similarity measurement in location-based social networks , 2013, MobiGIS '13.

[27]  Li Yujian,et al.  A Normalized Levenshtein Distance Metric , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Chenghu Zhou,et al.  Semantic-Geographic Trajectory Pattern Mining Based on a New Similarity Measurement , 2017, ISPRS Int. J. Geo Inf..

[29]  Mohamed F. Mokbel,et al.  Recommendations in location-based social networks: a survey , 2015, GeoInformatica.

[30]  Daniel Borrajo,et al.  Planning for tourism routes using social networks , 2017, Expert Syst. Appl..

[31]  Haoran Sun,et al.  Friend Recommendation Algorithm for Online Social Networks Based on Location Preference , 2016, 2016 3rd International Conference on Information Science and Control Engineering (ICISCE).

[32]  Mohamed F. Mokbel,et al.  Location-based and preference-aware recommendation using sparse geo-social networking data , 2012, SIGSPATIAL/GIS.

[33]  Mete Celik,et al.  Discovering socially important locations of social media users , 2017, Expert Syst. Appl..

[34]  Makarand Hastak,et al.  Social network analysis: Characteristics of online social networks after a disaster , 2018, Int. J. Inf. Manag..

[35]  Xing Xie,et al.  Finding similar users using category-based location history , 2010, GIS '10.

[36]  Panagiotis Symeonidis,et al.  A Graph-Based Taxonomy of Recommendation Algorithms and Systems in LBSNs , 2016, IEEE Transactions on Knowledge and Data Engineering.

[37]  Jun Pang,et al.  Measuring User Similarity with Trajectory Patterns: Principles and New Metrics , 2014, APWeb.

[38]  Yan Liu,et al.  EBM: an entropy-based model to infer social strength from spatiotemporal data , 2013, SIGMOD '13.

[39]  Xinmei Tian,et al.  Flickr group recommendation using rich social media information , 2016, Neurocomputing.

[40]  Enhong Chen,et al.  A habit mining approach for discovering similar mobile users , 2012, WWW.

[41]  Abbas Rajabifard,et al.  Event relatedness assessment of Twitter messages for emergency response , 2017, Inf. Process. Manag..

[42]  Avinash Chandra Pandey,et al.  Twitter sentiment analysis using hybrid cuckoo search method , 2017, Inf. Process. Manag..

[43]  Barbara Carminati,et al.  User similarities on social networks , 2013, Social Network Analysis and Mining.

[44]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[45]  Ling Chen,et al.  Mining user similarity based on routine activities , 2013, Inf. Sci..

[46]  Chunfeng Yang,et al.  Who Are Like-Minded: Mining User Interest Similarity in Online Social Networks , 2016, ICWSM.

[47]  Hermann Hellwagner,et al.  Online indexing and clustering of social media data for emergency management , 2016, Neurocomputing.

[48]  Alia I. Abdelmoty,et al.  Computing similarity between users on location-based social networks , 2016 .

[49]  Fabián Riquelme,et al.  Measuring user influence on Twitter: A survey , 2015, Inf. Process. Manag..