CENNLP at SemEval-2018 Task 1: Constrained Vector Space Model in Affects in Tweets

This paper discusses on task 1, “Affect in Tweets” sharedtask, conducted in SemEval-2018. This task comprises of various subtasks, which required participants to analyse over different emotions and sentiments based on the provided tweet data and also measure the intensity of these emotions for subsequent subtasks. Our approach in these task was to come up with a model on count based representation and use machine learning techniques for regression and classification related tasks. In this work, we use a simple bag of words technique for supervised text classification model as to compare, that even with some advance distributed representation models we can still achieve significant accuracy. Further, fine tuning on various parameters for the bag of word, representation model we acquired better scores over various other baseline models (Vinayan et al.) participated in the sharedtask.

[1]  Thomas François,et al.  Do NLP and machine learning improve traditional readability formulas? , 2012, PITR@NAACL-HLT.

[2]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[3]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[4]  K. P. Soman,et al.  Amrita_CEN at SemEval-2016 Task 1: Semantic Relation from Word Embeddings in Higher Dimension , 2016, SemEval@NAACL-HLT.

[5]  Saif Mohammad,et al.  SemEval-2018 Task 1: Affect in Tweets , 2018, *SEMEVAL.

[6]  Saif Mohammad,et al.  Understanding Emotions: A Dataset of Tweets to Study Interactions between Affect Categories , 2018, LREC.

[7]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[8]  K. P. Soman,et al.  From Vector Space Models to Vector Space Models of Semantics , 2016, FIRE Workshop.

[9]  Saif Mohammad,et al.  NRC-Canada-2014: Recent Improvements in the Sentiment Analysis of Tweets , 2014, SemEval@COLING.

[10]  Felipe Bravo-Marquez,et al.  Meta-level sentiment models for big social data analysis , 2014, Knowl. Based Syst..

[11]  Saif Mohammad,et al.  Word Affect Intensities , 2017, LREC.

[12]  Saif Mohammad,et al.  Sentiment Composition of Words with Opposing Polarities , 2016, NAACL.

[13]  Felipe Bravo-Marquez,et al.  Positive, Negative, or Neutral: Learning an Expanded Opinion Lexicon from Emoticon-Annotated Tweets , 2015, IJCAI.

[14]  Ray R. Larson Introduction to Information Retrieval , 2010 .

[15]  Juan Enrique Ramos,et al.  Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .