A lightweight clustering–based approach to discover different emotional shades from social message streams

With the explosion of social media, automatic analysis of sentiment and emotion from user‐generated content has attracted the attention of many research areas and commercial‐marketing domains targeted at studying the social behavior of web users and their public attitudes toward brands, social events, and political actions. Capturing the emotions expressed in the written language could be crucial to support the decision‐making processes: the emotion resulting from a tweet or a review about an item could affect the way to advertise or to trade on the web and then to make predictions about future changes in popularity or market behavior. This paper presents an experience with the emotion‐based classification of textual data from a social network by using an extended version of the fuzzy C‐means algorithm called extension of fuzzy C‐means. The algorithm shows interesting results due to its intrinsic fuzzy nature that reflects the human feeling expressed in the text, often composed of a mix of blurred emotions, and at the same time, the benefits of the extended version yield better classification results.

[1]  P. Wilson,et al.  The Nature of Emotions , 2012 .

[2]  Salvatore Sessa,et al.  Spatio-temporal hotspots and application on a disease analysis case via GIS , 2014, Soft Comput..

[3]  Shenghuo Zhu,et al.  SumView: A Web-based engine for summarizing product reviews and customer opinions , 2013, Expert Syst. Appl..

[4]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[5]  T. Velmurugan,et al.  A Survey of Partition based Clustering Algorithms in Data Mining: An Experimental Approach , 2011 .

[6]  Liu Rui,et al.  Fuzzy c-Means Clustering Algorithm , 2008 .

[7]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[8]  Jeonghee Yi,et al.  Sentiment analysis: capturing favorability using natural language processing , 2003, K-CAP '03.

[9]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[10]  Sumbal Riaz,et al.  Opinion mining on large scale data using sentiment analysis and k-means clustering , 2019, Cluster Computing.

[11]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[12]  Qiyong Guo,et al.  Comparison of K-Means and Fuzzy c-Means Algorithm Performance for Automated Determination of the Arterial Input Function , 2014, PloS one.

[13]  Salvatore Sessa,et al.  Hotspots Detection in Spatial Analysis via the Extended Gustafson-Kessel Algorithm , 2013, Adv. Fuzzy Syst..

[14]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[15]  Erik Cambria,et al.  The Hourglass of Emotions , 2011, COST 2102 Training School.

[16]  Salvatore Sessa,et al.  Extended Fuzzy C-Means hotspot detection method for large and very large event datasets , 2018, Inf. Sci..

[17]  Vincenzo Loia,et al.  A fuzzy-oriented sentic analysis to capture the human emotion in Web-based content , 2014, Knowl. Based Syst..

[18]  Weitong Chen,et al.  A survey of sentiment analysis in social media , 2018, Knowledge and Information Systems.

[19]  Sanjay Kumar Dubey,et al.  Comparative Analysis of K-Means and Fuzzy C- Means Algorithms , 2013 .

[20]  Erik Cambria,et al.  Affective Computing and Sentiment Analysis , 2016, IEEE Intelligent Systems.

[21]  Salvatore Sessa,et al.  WebGIS based on spatio-temporal hot spots: an application to oto-laryngo-pharyngeal diseases , 2016, Soft Comput..

[22]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[23]  Laizhong Cui,et al.  Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis , 2017, Neurocomputing.

[24]  Uzay Kaymak,et al.  Fuzzy clustering with volume prototypes and adaptive cluster merging , 2002, IEEE Trans. Fuzzy Syst..

[25]  Lawrence O. Hall,et al.  Single Pass Fuzzy C Means , 2007, 2007 IEEE International Fuzzy Systems Conference.

[26]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[27]  Vincenzo Loia,et al.  Context-aware profiling of concepts from a semantic topological space , 2017, Knowl. Based Syst..

[28]  Yang Liu,et al.  A method for multi-class sentiment classification based on an improved one-vs-one (OVO) strategy and the support vector machine (SVM) algorithm , 2017, Inf. Sci..

[29]  Yiqiang Chen,et al.  ASELM: Adaptive semi-supervised ELM with application in question subjectivity identification , 2016, Neurocomputing.

[30]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .

[31]  Salvatore Sessa,et al.  The extended fuzzy C-means algorithm for hotspots in spatio-temporal GIS , 2011, Expert Syst. Appl..

[32]  A. Govardhan,et al.  Experiments on Hypothesis "Fuzzy K-Means is Better than K-Means for Clustering" , 2014 .

[33]  Marimuthu Palaniswami,et al.  Fuzzy c-Means Algorithms for Very Large Data , 2012, IEEE Transactions on Fuzzy Systems.