A locally weighted method to improve linear regression for lexical-based valence-arousal prediction

Text-based sentiment analysis is a growing research field in affective computing, driven by both commercial applications and academic interest. Continuous dimensional representations, such as valence-arousal (VA) space, can represent the affective state more precisely than discrete effective representations. In building dimensional sentiment applications, affective lexicons with valence-arousal ratings are useful resources but are still very rare. Therefore, recent studies have investigated the automatic development of VA lexicons using linear regression techniques. One of the major limitations of linear regression is the under-fitting problem which can cause a poor fit between the algorithm and the training data. To tackle this problem, this study proposes the use of a locally weighted linear regression (LWLR) model to predict the valence-arousal ratings of affective words. The locally weighted method performs a regression around the point of interest using only training data that are “local” to that point, and thus can reduce the impact of noise from unrelated training data. Experimental results show that the proposed method achieved better performance for VA word prediction.

[1]  Sunghwan Mac Kim,et al.  EMOTIONS IN TEXT: DIMENSIONAL AND CATEGORICAL MODELS , 2013, Comput. Intell..

[2]  Qing Zhou,et al.  Parents’ Expression and Discussion of Emotion in the Multilingual Family , 2012, Perspectives on psychological science : a journal of the Association for Psychological Science.

[3]  Christopher D. Manning,et al.  Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.

[4]  Chung-Hsien Wu,et al.  A Regression Approach to Affective Rating of Chinese Words from ANEW , 2011, ACII.

[5]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[6]  M. Bradley,et al.  Measuring emotion: the Self-Assessment Manikin and the Semantic Differential. , 1994, Journal of behavior therapy and experimental psychiatry.

[7]  Yi-Hsuan Yang,et al.  Ranking-Based Emotion Recognition for Music Organization and Retrieval , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Claire Cardie,et al.  Towards a General Rule for Identifying Deceptive Opinion Spam , 2014, ACL.

[9]  W. Cleveland,et al.  Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[10]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[11]  P. Ekman An argument for basic emotions , 1992 .

[12]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[13]  J. Russell A circumplex model of affect. , 1980 .

[14]  Thierry Pun,et al.  Valence-arousal evaluation using physiological signals in an emotion recall paradigm , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[15]  Chung-Hsien Wu,et al.  Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition , 2012, IEEE Transactions on Multimedia.

[16]  Amy Beth Warriner,et al.  Norms of valence, arousal, and dominance for 13,915 English lemmas , 2013, Behavior Research Methods.

[17]  Shrikanth S. Narayanan,et al.  Kernel Models for Affective Lexicon Creation , 2011, INTERSPEECH.

[18]  Jaime Redondo,et al.  The Spanish adaptation of ANEW (Affective Norms for English Words) , 2007, Behavior research methods.

[19]  Muni S. Srivastava,et al.  Regression Analysis: Theory, Methods, and Applications , 1991 .

[20]  Rafael A. Calvo,et al.  Affect Detection: An Interdisciplinary Review of Models, Methods, and Their Applications , 2010, IEEE Transactions on Affective Computing.

[21]  Rada Mihalcea,et al.  Porting Multilingual Subjectivity Resources across Languages , 2013, IEEE Transactions on Affective Computing.

[22]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[23]  Adam Pease,et al.  The Suggested Upper Merged Ontology: A Large Ontology for the Semantic Web and its Applic ations , 2002 .

[24]  Mike Thelwall,et al.  Seeing Stars of Valence and Arousal in Blog Posts , 2013, IEEE Transactions on Affective Computing.

[25]  M. Bradley,et al.  Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings , 1999 .

[26]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[27]  J. Leu,et al.  Are positive emotions just as "positive" across cultures? , 2011, Emotion.

[28]  Arvid Kappas,et al.  Predicting Emotional Responses to Long Informal Text , 2013, IEEE Transactions on Affective Computing.

[29]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[30]  Kam-Fai Wong,et al.  Cross lingual opinion holder extraction based on multi-kernel SVMs and transfer learning , 2013, World Wide Web.

[31]  Jan Larsen,et al.  Combining semantic and acoustic features for valence and arousal recognition in speech , 2012, 2012 3rd International Workshop on Cognitive Information Processing (CIP).

[32]  Arjun Mukherjee,et al.  Aspect Extraction through Semi-Supervised Modeling , 2012, ACL.

[33]  E. Ambrosini,et al.  The adaptation of the Affective Norms for English Words (ANEW) for Italian , 2014, Behavior research methods.