An intelligent textual corpus big data computing approach for lexicons construction and sentiment classification of public emergency events

Considering the deficiencies in the existing emotional lexicons like too many manual interventions, lack of scalability and ignorance of dependency parsing in emotional computing, this paper first uses Word2Vec, cosine word vector similarity calculation and SO-PMI algorithms to build a public event-oriented Weibo emotional lexicon; then, it proposes a Weibo emotion computing method based on dependency parsing and designs an emotion binary tree based on dependency parsing, and dependency-based emotion calculation rules; and at last, through an experiment, it shows that this emotional lexicon has a wider coverage and higher accuracy than the existing ones, and it also performs a public opinion evolution analysis on an actual public event and the empirical results show that the algorithm is feasible and effective.

[1]  Mark Strembeck,et al.  An Analysis of the Twitter Discussion on the 2016 Austrian Presidential Elections , 2017, Online Soc. Networks Media.

[2]  Wanxiang Che,et al.  LTP: A Chinese Language Technology Platform , 2010, COLING.

[3]  Erik Cambria,et al.  A Review of Sentiment Analysis Research in Chinese Language , 2017, Cognitive Computation.

[4]  Xinmiao Li,et al.  A Global Optimization Approach to Multi-Polarity Sentiment Analysis , 2015, PloS one.

[5]  Avi Arampatzis,et al.  Towards the creation of an emotion lexicon for microblogging , 2018, J. Syst. Inf. Technol..

[6]  Stewart Massie,et al.  Emotion-Corpus Guided Lexicons for Sentiment Analysis on Twitter , 2016, SGAI Conf..

[7]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[8]  David Zimbra,et al.  Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network , 2013, Expert Syst. Appl..

[9]  Zhen Lin,et al.  ESIS: Emotion-based spreader-ignorant-stifler model for information diffusion , 2015, Knowl. Based Syst..

[10]  Yves Bestgen,et al.  Building Affective Lexicons from Specific Corpora for Automatic Sentiment Analysis , 2008, LREC.

[11]  Rens Vliegenthart,et al.  The Mediating Role of Emotions , 2015 .

[12]  Mohammad Tubishat,et al.  Implicit aspect extraction in sentiment analysis: Review, taxonomy, oppportunities, and open challenges , 2018, Inf. Process. Manag..

[13]  G. Rajesh,et al.  Sentiment Analysis on Movie Reviews: A Comparative Study of Machine Learning Algorithms and Open Source Technologies , 2016 .

[14]  Xiao Ke,et al.  Multi-modality weakly labeled sentiment learning based on Explicit Emotion Signal for Chinese microblog , 2018, Neurocomputing.

[15]  Gjorgji Madjarov,et al.  Deep neural network architecture for sentiment analysis and emotion identification of Twitter messages , 2018, Multimedia Tools and Applications.

[16]  Udo Hahn,et al.  Representation Mapping: A Novel Approach to Generate High-Quality Multi-Lingual Emotion Lexicons , 2018, LREC.

[17]  万常选,et al.  Sentiment Computing of Web Financial Information Based on the Part-of-speech Tagging and Dependency Parsing , 2013 .

[18]  Hazem M. Hajj,et al.  EmoWordNet: Automatic Expansion of Emotion Lexicon Using English WordNet , 2018, *SEMEVAL.

[19]  Kiran Bhowmick,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2015 .

[20]  Yun Jung Choi,et al.  Understanding social viewing through discussion network and emotion: A focus on South Korean presidential debates , 2018, Telematics Informatics.

[21]  Zhongxia Zhang,et al.  基于Word2Vec的情感词典自动构建与优化 (Automatic Construction and Optimization of Sentiment Lexicon Based on Word2Vec) , 2017, 计算机科学.

[22]  Mark Strembeck,et al.  On the Influence of Emotional Valence Shifts on the Spread of Information in Social Networks , 2017, ASONAM.

[23]  Albert Y. Zomaya,et al.  Detecting users' anomalous emotion using social media for business intelligence , 2017, J. Comput. Sci..

[24]  Heidi Vandebosch,et al.  #Smiling, #venting, or both? Adolescents' social sharing of emotions on social media , 2018, Comput. Hum. Behav..

[25]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[26]  Erik Cambria,et al.  Ensemble application of convolutional neural networks and multiple kernel learning for multimodal sentiment analysis , 2017, Neurocomputing.

[27]  V. Srividhya,et al.  Emotion Classification of Twitter Data using Lexicon based Approach , 2018 .

[28]  Luis Alfonso Ureña López,et al.  Ranked WordNet graph for Sentiment Polarity Classification in Twitter , 2014, Comput. Speech Lang..

[29]  Stewart Massie,et al.  Emotion‐aware polarity lexicons for Twitter sentiment analysis , 2018, Expert Syst. J. Knowl. Eng..

[30]  Yang Li,et al.  Learning multi-grained aspect target sequence for Chinese sentiment analysis , 2018, Knowl. Based Syst..

[31]  Mark Strembeck,et al.  Identifying Emotions in Social Media: Comparison of Word-Emotion Lexicons , 2017, 2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW).

[32]  Osmar R. Zaïane,et al.  Current State of Text Sentiment Analysis from Opinion to Emotion Mining , 2017, ACM Comput. Surv..

[33]  Stewart Massie,et al.  Lexicon based feature extraction for emotion text classification , 2017, Pattern Recognit. Lett..

[34]  Sylvia Jiankun Guo,et al.  The 2013 Boston marathon bombing: Publics’ emotions, coping, and organizational engagement , 2017 .

[35]  Lean Yu,et al.  What can mass media do to control public panic in accidents of hazardous chemical leakage into rivers? A multi-agent-based online opinion dissemination model , 2017 .

[36]  King-Wa Fu,et al.  Challenging Official Propaganda? Public Opinion Leaders on Sina Weibo* , 2016, The China Quarterly.

[37]  Tianru Guan Framing the Boundary of Sino-Japanese Conflicts in China’s Communication Sphere: a Content Analysis of the News Coverage of Japan and Sino-Japanese Controversies by the People’s Daily between 2001 and 2015 , 2018 .

[38]  Wenbo Kuang Empirical Studies on New-Media Public Opinion , 2018 .