Weighted multi-label classification model for sentiment analysis of online news

With the extensive growth of social media services, many users express their feelings and opinions through news articles, blogs and tweets/microblogs. To discover the connections between emotions evoked in a user by varied-scale documents effectively, the paper is concerned with the problem of sentiment analysis over online news. Different from previous models which treat training documents uniformly, a weighted multi-label classification model (WMCM) is proposed by introducing the concept of “emotional concentration” to estimate the weight of training documents, in addition to tackle the issue of noisy samples for each emotion. The topic assignment is also used to distinguish different emotional senses of the same word at the semantic level. Experimental evaluations using short news headlines and long documents validate the effectiveness of the proposed WMCM for sentiment prediction.

[1]  Rong Yan,et al.  Joint Emotion-Topic Modeling for Social Affective Text Mining , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[2]  Wenyin Liu,et al.  Affective topic model for social emotion detection , 2014, Neural Networks.

[3]  Long Jiang,et al.  User-level sentiment analysis incorporating social networks , 2011, KDD.

[4]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[5]  Rong Yan,et al.  Mining Social Emotions from Affective Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[6]  Saso Dzeroski,et al.  Decision trees for hierarchical multi-label classification , 2008, Machine Learning.

[7]  Haoran Xie,et al.  Does Summarization Help Stock Prediction? A News Impact Analysis , 2015, IEEE Intelligent Systems.

[8]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[9]  Andrew McCallum,et al.  Collective multi-label classification , 2005, CIKM '05.

[10]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[11]  Jiafeng Guo,et al.  BTM: Topic Modeling over Short Texts , 2014, IEEE Transactions on Knowledge and Data Engineering.

[12]  Carlo Strapparava,et al.  SemEval-2007 Task 14: Affective Text , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[13]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[14]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[15]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[16]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[17]  Richard Wicentowski,et al.  SWAT-MP:The SemEval-2007 Systems for Task 5 and Task 14 , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[18]  Milos Hauskrecht,et al.  A Generalized Mixture Framework for Multi-label Classification , 2015, SDM.

[19]  Eyke Hüllermeier,et al.  Dependent binary relevance models for multi-label classification , 2014, Pattern Recognit..

[20]  Lei Tang,et al.  Large scale multi-label classification via metalabeler , 2009, WWW '09.

[21]  Paolo Giudici,et al.  Applied Data Mining: Statistical Methods for Business and Industry , 2003 .

[22]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[23]  Claire Cardie,et al.  Annotating Topics of Opinions , 2008, LREC.

[24]  David Bell,et al.  Microblogging as a mechanism for human-robot interaction , 2014, Knowl. Based Syst..

[25]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[26]  Changqin Quan,et al.  An Exploration of Features for Recognizing Word Emotion , 2010, COLING.

[27]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[28]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[29]  Luis Alfonso Ureña López,et al.  Crowd explicit sentiment analysis , 2014, Knowl. Based Syst..

[30]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[31]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[32]  Kyoungok Kim,et al.  Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction , 2014, Pattern Recognit..

[33]  Eyke Hüllermeier,et al.  On label dependence and loss minimization in multi-label classification , 2012, Machine Learning.

[34]  Mingliang Chen,et al.  Building emotional dictionary for sentiment analysis of online news , 2014, World Wide Web.

[35]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Huan Liu,et al.  Exploiting social relations for sentiment analysis in microblogging , 2013, WSDM.

[37]  Yanghui Rao,et al.  Sentiment topic models for social emotion mining , 2014, Inf. Sci..

[38]  Xin Li,et al.  Social Emotion Classification via Reader Perspective Weighted Model , 2016, AAAI.

[39]  Mike Y. Chen,et al.  Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[40]  Sheng Wang,et al.  SUIT: A Supervised User-Item Based Topic Model for Sentiment Analysis , 2014, AAAI.

[41]  Roberto V. Zicari,et al.  PoliTwi: Early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis , 2014, Knowl. Based Syst..

[42]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[43]  Lei Huang,et al.  Sentence-level Emotion Classification with Label and Context Dependence , 2015, ACL.

[44]  Sebastián Ventura,et al.  A Tutorial on Multilabel Learning , 2015, ACM Comput. Surv..

[45]  Hsin-Hsi Chen,et al.  Ranking Reader Emotions Using Pairwise Loss Minimization and Emotional Distribution Regression , 2008, EMNLP.

[46]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .