Sentiment Classification Based on Syntax Tree Pruning and Tree Kernel

Sentiment classification is a way to analyze the subjective information in the text and then mine the opinion. We focus on the sentence-level sentiment classification. On the systematically analyzing the importance and difficulties of the sentence-level sentiment classification, this paper proposes a syntax tree pruning and tree kernel-based approach to sentiment classification. In our method, the convolution kernel of SVM is first used to obtain structured information, and then apply syntax tree as a feature in Sentiment Classification. Firstly, we focus on how to apply the structured features from the syntax tree to the sentiment classification and propose a novel approach of sentence-level sentiment classification which apply the tree kernel and composite kernel to the SVM classifier. Secondly, we provide two kinds of syntax tree pruning strategies: adjectives-based and sentiment words-based. The experimental results show that our method can achieve better performance in sentence level Sentiment Classification.

[1]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[2]  Tianfang Yao,et al.  A Kernel-Based Sentiment Classification Approach for Chinese Sentences , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[3]  Hiroya Takamura,et al.  Sentiment Classification Using Word Sub-sequences and Dependency Sub-trees , 2005, PAKDD.

[4]  Bing Liu,et al.  Opinion Extraction and Summarization on the Web , 2006, AAAI.

[5]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[6]  Nigel Collier,et al.  Sentiment Analysis using Support Vector Machines with Diverse Information Sources , 2004, EMNLP.

[7]  J. Kamps,et al.  Words with attitude , 2002 .

[8]  Michael Collins,et al.  Convolution Kernels for Natural Language , 2001, NIPS.

[9]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[10]  Christopher D. Manning,et al.  Exploring Sentiment Summarization , 2004 .

[11]  Michael Collins,et al.  New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[12]  Jian Su,et al.  A Composite Kernel to Extract Relations between Entities with Both Flat and Structured Features , 2006, ACL.

[13]  Alok N. Choudhary,et al.  Sentiment Analysis of Conditional Sentences , 2009, EMNLP.

[14]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[15]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[16]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[17]  Alison Huettner,et al.  Fuzzy Typing for Document Management , 2000 .