Exploring Deep Recurrent Convolution Neural Networks for Subjectivity Classification

Subjectivity classification aims to discriminate whether the given text presents opinions or states facts, which is an important foundation and component of sentiment analysis task. The characteristics of microblog (e.g., short text, full noise, casual expression, and incomplete grammatical structure) bring new challenges to microblog text subjectivity classification task. The existing subjectivity classification approaches of microblog text mainly depend on the techniques of artificially constructing emotional features that are commonly based on the bag-of-words model, which will lead to the curse of dimensionality and data sparsity problem. Furthermore, microblog text usually contains the left-handed compliment or implicit representation. It is difficult to find and describe this implicit semantic representation by artificially constructing emotional features, which seriously affects the subjectivity classification accuracy. To address these problems, we introduce a deep recurrent convolution neural network model for subjectivity classification of microblog text with the multi-features combination. In our model, we apply the convolution neural networks to automatically learn classification features and use the recurrent neural networks to abstract and memorize the sequence semantic relations of microblog text. We also employ pre-trained word embedding to reduce the representation dimension of the text, which avoids dimension disasters and data sparsity issues. We perform experiments on six commonly used Twitter datasets. The experimental results demonstrate that our proposed model outperforms the state-of-the-art approaches on several datasets. Compared with the baseline methods, the accuracy and the average F1-measure of our proposed model for subjectivity classification are improved by a minimum of 1.21% and 2.82%, respectively.

[1]  Mohammad Saniee Abadeh,et al.  SubLex: Generating subjectivity lexicons using genetic algorithm for subjectivity classification of big social data , 2016, 2016 1st Conference on Swarm Intelligence and Evolutionary Computation (CSIEC).

[2]  Rabab Kreidieh Ward,et al.  Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[3]  Xiaolin Gui,et al.  Comparison Research on Text Pre-processing Methods on Twitter Sentiment Analysis , 2017, IEEE Access.

[4]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[5]  Christopher D. Manning,et al.  Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[6]  Xuanjing Huang,et al.  Recurrent Neural Network for Text Classification with Multi-Task Learning , 2016, IJCAI.

[7]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[8]  Donald E. Brown,et al.  RMDL: Random Multimodel Deep Learning for Classification , 2018, ICISDM '18.

[9]  Mark A. Clements,et al.  Sentence-Level Subjectivity Detection Using Neuro-Fuzzy Models , 2013, WASSA@NAACL-HLT.

[10]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[11]  S. Albayrak,et al.  Language-Independent Twitter Sentiment Analysis , 2012 .

[12]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[13]  Jun Zhao,et al.  Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.

[14]  Christopher D. Manning,et al.  Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.

[15]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[16]  Yiqun Liu,et al.  Microblog Sentiment Analysis with Emoticon Space Model , 2014, Journal of Computer Science and Technology.

[17]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[18]  Donald E. Brown,et al.  HDLTex: Hierarchical Deep Learning for Text Classification , 2017, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA).

[19]  Tong Zhang,et al.  Effective Use of Word Order for Text Categorization with Convolutional Neural Networks , 2014, NAACL.

[20]  Aitor Almeida,et al.  An Approach to Subjectivity Detection on Twitter Using the Structured Information , 2016, ICCCI.

[21]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[22]  Yang Liu,et al.  An Efficient Cross-lingual Model for Sentence Classification Using Convolutional Neural Network , 2016, ACL.

[23]  Yulan He,et al.  Sentence Subjectivity Detection with Weakly-Supervised Learning , 2011, IJCNLP.

[24]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[25]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[26]  Preslav Nakov,et al.  SemEval-2015 Task 10: Sentiment Analysis in Twitter , 2015, *SEMEVAL.

[27]  John G. Breslin,et al.  INSIGHT-1 at SemEval-2016 Task 4: Convolutional Neural Networks for Sentiment Classification and Quantification , 2016, SemEval@NAACL-HLT.

[28]  Bernard J. Jansen,et al.  Micro-blogging as online word of mouth branding , 2009, CHI Extended Abstracts.

[29]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[30]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[31]  Fermín L. Cruz,et al.  Tweet categorization by combining content and structural knowledge , 2016, Inf. Fusion.

[32]  H. Keshavarz,et al.  MHSubLex: Using Metaheuristic Methods for Subjectivity Classification of Microblogs , 2018 .

[33]  Erik Cambria,et al.  Bayesian Deep Convolution Belief Networks for Subjectivity Detection , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[34]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[35]  Zhao Jianqiang,et al.  Combining Semantic and Prior Polarity for Boosting Twitter Sentiment Analysis , 2015, 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity).