Detecting users' anomalous emotion using social media for business intelligence

Abstract Anomaly detection in sentiment analysis refers to detecting users’ abnormal opinions, sentiment patterns or special temporal aspects of such patterns. Users’ emotional state extracted from social media contains business information and business value for decision making. Social media platforms, such as Sina Weibo or Twitter, provide a vast source of information, which include user feedbacks, opinions and information on most issues. Many organizations also leverage social media platforms to publish information about events, products, services, policies and other topics frequently, analyzing social media data to identify abnormal events and make decisions in a timely manner is a beneficial topic. This paper adopts the multivariate Gauss distribution with the power-law distribution to model and analyze the users’ emotion of micro-blogs and detect abnormal emotion state. With the measure of joint probability density value and the validation of the corpus, anomaly detection accuracy of individual user is 83.49% and of different month is 87.84% by this method. Through the distribution test, the results show that individual users’ neutral, happy and sad emotions obey the normal distribution, but the surprised and angry emotions do not. Besides, emotions of micro-blogs released by groups obey power-law distribution, but the individual emotions do not. This paper proposes a quantitative method for abnormal emotion detection on social media, which automatically captures the correlation between different features of the emotions, and saves a certain amount of time by batch calculation of the joint probability density of data sets. The method can help the businesses and government organizations to make decisions according to the user's affective disposition, intervene early or adopt proper strategies if needed.

[1]  Cheng Wu,et al.  Semi-Supervised and Unsupervised Extreme Learning Machines , 2014, IEEE Transactions on Cybernetics.

[2]  Zhang Jin Recognition and Classification of Emotions in the Chinese Microblog Based on Emotional Factor , 2014 .

[3]  John J. Leonard,et al.  An incremental trust-region method for Robust online sparse least-squares estimation , 2012, 2012 IEEE International Conference on Robotics and Automation.

[4]  Lambert Schomaker,et al.  Indoor localization by denoising autoencoders and semi-supervised learning in 3D simulated environment , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[5]  Naoki Abe,et al.  Proximity-Based Anomaly Detection Using Sparse Structure Learning , 2009, SDM.

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Andrew T. A. Cheng,et al.  Invited commentaries on: International representation in psychiatric literature. Survey of six leading journals , 2001, British Journal of Psychiatry.

[8]  Zhang Ya A boost factor based detection method for abnormal rank of microblogging , 2013 .

[9]  Barbara Poblete,et al.  On-line relevant anomaly detection in the Twitter stream: an efficient bursty keyword detection model , 2013, ODD '13.

[10]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[11]  John J. Leonard,et al.  RISE: An Incremental Trust-Region Method for Robust Online Sparse Least-Squares Estimation , 2014, IEEE Transactions on Robotics.

[12]  Fan Yang,et al.  Automatic detection of rumor on Sina Weibo , 2012, MDS '12.

[13]  Tat-Seng Chua,et al.  Detecting Stress Based on Social Interactions in Social Networks , 2017, IEEE Transactions on Knowledge and Data Engineering.

[14]  Jing Lei,et al.  Network Cross-Validation for Determining the Number of Communities in Network Data , 2014, 1411.1715.

[15]  Wang Shi-tong Multi-classification method applied to face recognition based on mixed Gaussian distribution , 2013 .

[16]  Zhaoxia Wang,et al.  Anomaly Detection through Enhanced Sentiment Analysis on Social Media Data , 2014, 2014 IEEE 6th International Conference on Cloud Computing Technology and Science.

[17]  Jiawei Han,et al.  Making SVMs Scalable to Large Data Sets using Hierarchical Cluster Indexing , 2005, Data Mining and Knowledge Discovery.

[18]  Emmanuel Müller,et al.  Proceedings of the ACM SIGKDD Workshop on Outlier Detection and Description , 2013, KDD 2013.

[19]  Ruxu Du,et al.  Model-based Fault Detection and Diagnosis of HVAC systems using Support Vector Machine method , 2007 .

[20]  R. Serfling,et al.  Nonparametric depth-based multivariate outlier identifiers, and masking robustness properties , 2010 .

[21]  Amparo Albalate,et al.  Semi‐Supervised Classification Using Pattern Clustering , 2013 .

[22]  Hassan J. Eghbali,et al.  K-S Test for Detecting Changes from Landsat Imagery Data , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[23]  Jose M. Such,et al.  International Joint Conference on Artificial Intelligence (IJCAI) , 2016 .

[24]  Zhi Gang Liu,et al.  Density-Based Distributed Elliptical Anomaly Detection in Wireless Sensor Networks , 2012 .

[25]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[26]  Matthew Cook,et al.  Fast-classifying, high-accuracy spiking deep networks through weight and threshold balancing , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).