Stance Detection on Microblog Focusing on Syntactic Tree Representation

Microblog, especially Twitter, has become an integral part of our daily life, where millions of users expressing their opinions daily towards various target entities. Detecting and analyzing user stances from such massive opinion-oriented twitter posts provide enormous opportunities to journalists, governments, companies, and other organizations. However, the short length characteristics and frequent use of idiosyncratic abbreviations in tweets make this task challenging to infer the users’ stance automatically towards a particular target. In this paper, we leverage the syntactic tree representation of tweets to detect the tweet stance. We devise a new parts-of-speech (POS) generalization technique and employ the hashtag segmentation for effective tree representation. Then, we make use of support vector machine (SVM) classifier with three different tree kernel functions including subtree (ST) kernel, subset tree (SST) kernel, and partial tree (PT) kernel as the base-classifiers. Finally, a majority voting count based prediction scheme is employed to identify the tweet stance. We conducted our experiments using SemEval-2016 twitter stance detection dataset. Experimental results demonstrate the effectiveness of our proposed method over the baseline and the state-of-the-art related works.

[1]  Stan Matwin,et al.  From Argumentation Mining to Stance Classification , 2015, ArgMining@HLT-NAACL.

[2]  Alberto Barrón-Cedeño,et al.  Selecting Sentences versus Selecting Tree Constituents for Automatic Question Ranking , 2016, COLING.

[3]  M. de Rijke,et al.  A syntax-aware re-ranker for microblog retrieval , 2014, SIGIR.

[4]  Josep L. Rosselló,et al.  FPGA-Based Stochastic Echo State Networks for Time-Series Forecasting , 2015, Comput. Intell. Neurosci..

[5]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[6]  Guido Zarrella,et al.  MITRE at SemEval-2016 Task 6: Transfer Learning for Stance Detection , 2016, *SEMEVAL.

[7]  Beatrice Santorini,et al.  The Penn Treebank: An Overview , 2003 .

[8]  Xiao Zhang,et al.  pkudblab at SemEval-2016 Task 6 : A Specific Convolutional Neural Network System for Effective Stance Detection , 2016, *SEMEVAL.

[9]  Marilyn A. Walker,et al.  Cats Rule and Dogs Drool!: Classifying Stance in Online Debate , 2011, WASSA@ACL.

[10]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[11]  Nikos Pelekis,et al.  DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis , 2017, *SEMEVAL.

[12]  Swapna Somasundaran,et al.  Recognizing Stances in Ideological On-Line Debates , 2010, HLT-NAACL 2010.

[13]  Huan Liu,et al.  Identifying Users with Opposing Opinions in Twitter Debates , 2014, SBP.

[14]  Ruifeng Xu,et al.  Stance Classification with Target-specific Neural Attention , 2017, IJCAI.

[15]  Alessandro Moschitti,et al.  Making Tree Kernels Practical for Natural Language Learning , 2006, EACL.

[16]  Dimitris Kotzinos,et al.  What does Twitter have to say about ideology , 2014 .

[17]  Martin Tutek,et al.  TakeLab at SemEval-2016 Task 6: Stance Classification in Tweets Using a Genetic Algorithm Based Ensemble , 2016, *SEMEVAL.

[18]  Xiaoyong Liu,et al.  Support Vector Machine with Ensemble Tree Kernel for Relation Extraction , 2016, Comput. Intell. Neurosci..

[19]  Saif Mohammad,et al.  SemEval-2016 Task 6: Detecting Stance in Tweets , 2016, *SEMEVAL.

[20]  Lei Shi,et al.  Connecting Targets to Tweets: Semantic Attention-Based Model for Target-Specific Stance Detection , 2017, WISE.