Opinion community detection and opinion leader detection based on text information and network topology in cloud environment

Abstract With the rapid development of web technology, the social networks have become the largest information portals. In the social platforms, the text information can effectively reflect the user opinions or the public opinions for a certain entity, such as company, celebrity service, product and so on. Therefore, mining user opinions from social networks have become an imperative requirement for the service groups. In this paper, an opinion community detection method is proposed by considering the content similarity, the time similarity and the topology structure of users. The integrated similarity between two users, which includes the content similarity, the time similarity and the topology structure of users, is achieved. Then, based on the integrated similarities, the opinion communities are detected. Furthermore, in order to identify the opinion leader, an opinion leader detection method is proposed based on the user influence and emotional analysis. The users with the same topic form the opinion community. Meanwhile, a directed graph is created to formulate the interaction relationship between users in the opinion community. Then, the user influence model and emotional analysis model are presented. Moreover, the occurrence frequencies of the negative words are also considered in the emotional analysis model. Then, a model of influence value for each user in the opinion community is built. The user with the highest influence value is considered as the opinion leader. Finally, the performances of the proposed algorithms are evaluated in a distributed computing environment. Meanwhile, the extensive experiments are conducted. The results indicate that our proposed opinion community detection algorithm can effectively detect the opinion communities. Also, the proposed opinion leader detection algorithm can significantly identify the opinion leader in the social networks.

[1]  Roberto V. Zicari,et al.  PoliTwi: Early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis , 2014, Knowl. Based Syst..

[2]  Jianfang Wang,et al.  Identifying and Evaluating the Internet Opinion Leader Community Through k-clique Clustering , 2013, J. Comput..

[3]  Ellis Solaiman,et al.  A multi-layered performance analysis for cloud-based topic detection and tracking in Big Data applications , 2018, Future Gener. Comput. Syst..

[4]  Ana M. García-Serrano,et al.  A step forward for Topic Detection in Twitter: An FCA-based approach , 2016, Expert Syst. Appl..

[5]  Unil Yun,et al.  Single-pass based efficient erasable pattern mining using list data structure on dynamic incremental databases , 2018, Future Gener. Comput. Syst..

[6]  Yuefeng Li,et al.  Extracting news blog hot topics based on the W2T Methodology , 2013, World Wide Web.

[7]  Xiaojun Wan,et al.  CMiner: Opinion Extraction and Summarization for Chinese Microblogs , 2016, IEEE Transactions on Knowledge and Data Engineering.

[8]  Milos Pavkovic,et al.  Intelligent crawler for web forums based on improved regular expressions , 2013, 2013 21st Telecommunications Forum Telfor (TELFOR).

[9]  Qing Li,et al.  Forum topic detection based on hierarchical clustering , 2016, 2016 International Conference on Audio, Language and Image Processing (ICALIP).

[10]  Junping Du,et al.  Opinion Leader Mining of Social Network Combined with Hierarchical Sentiment Analysis , 2017 .

[11]  Ya Jun Du,et al.  Opinion leader mining algorithm in microblog platform based on topic similarity , 2016, 2016 2nd IEEE International Conference on Computer and Communications (ICCC).

[12]  Haitao Xiong,et al.  Topic Detection from Short Text: A Term-based Consensus Clustering method , 2016, 2016 13th International Conference on Service Systems and Service Management (ICSSSM).

[13]  Xiaolong Wang,et al.  Identifying Opinion Leaders from Online Comments , 2014, SMP.

[14]  Yijun Liu,et al.  Superedge coupling algorithm and its application in coupling mechanism analysis of online public opinion supernetwork , 2015, Expert Syst. Appl..

[15]  Youlong Luo,et al.  Collaborative cache allocation and task scheduling for data-intensive applications in edge computing environment , 2019, Future Gener. Comput. Syst..

[16]  Ling-Jyh Chen,et al.  Automatic opinion leader recognition in group discussions , 2016, 2016 Conference on Technologies and Applications of Artificial Intelligence (TAAI).

[17]  Haoran Xie,et al.  Supervised Intensive Topic Models for Emotion Detection over Short Text , 2017, DASFAA.

[18]  Hendri Murfi,et al.  Combination of singular value decomposition and K-means clustering methods for topic detection on Twitter , 2015, 2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS).

[19]  Wei Ai,et al.  Parallelizing hot topic detection of microblog on spark , 2016, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[20]  Wenjun Zhao,et al.  Community detection using hierarchical clustering based on edge-weighted similarity in cloud environment , 2019, Inf. Process. Manag..

[21]  Saeed Jalili,et al.  Single-pass and linear-time k-means clustering based on MapReduce , 2016, Inf. Syst..

[22]  Tang Jianhang,et al.  Joint optimization of data placement and scheduling for improving user experience in edge computing , 2019, J. Parallel Distributed Comput..

[23]  Chen Zhang,et al.  A hybrid term-term relations analysis approach for topic detection , 2016, Knowl. Based Syst..

[24]  Weidong Xiao,et al.  BBS opinion leader mining based on an improved PageRank algorithm using MapReduce , 2013, 2013 Chinese Automation Congress.

[25]  Bo Hu,et al.  An Improved Single-Pass Algorithm for Chinese Microblog Topic Detection and Tracking , 2016, 2016 IEEE International Congress on Big Data (BigData Congress).

[26]  Lei Zhang,et al.  Data locality optimization based on data migration and hotspots prediction in geo-distributed cloud environment , 2019, Knowl. Based Syst..