A grammatical dependency improved CRF learning approach for integrated product extraction

The aspect-based opinion mining aims to provide finegrained product feature analysis from the product reviews. Nowadays, newer works based on probabilistic models have achieved satisfactory result on product entities extraction, but these works didn't take the multi-words feature expression problems into consideration which lead to inaccurate match between the feature and the sentimental orientation. In this paper, we propose an approach, based CRF learning model, to extract integrated feature expression from Chinese product reviews, and improve it by grammatical dependency generated by Stanford Parser. The experiment results based on actual data, demonstrate that the proposed approach is effective and domain-independent. Also we find introducing grammatical dependency into CRF model can improve the precision and recall in varying degrees.

[1]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[2]  Daniel Dajun Zeng,et al.  Mining Fine Grained Opinions by Using Probabilistic Models and Domain Knowledge , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[3]  Ming Xu,et al.  Feature-level sentiment analysis for Chinese product reviews , 2011, 2011 3rd International Conference on Computer Research and Development.

[4]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[5]  Li Chen,et al.  Comparison of Model-Based Learning Methods for Feature-Level Opinion Mining , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[6]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[7]  Jochen L. Leidner,et al.  Handbook of Natural Language Processing (second edition) , 2011 .

[8]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[9]  Rohini K. Srihari,et al.  OpinionMiner: a novel machine learning system for web opinion mining and extraction , 2009, KDD.

[10]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[11]  Andrew McCallum,et al.  Efficiently Inducing Features of Conditional Random Fields , 2002, UAI.

[12]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[13]  Suk Hwan Lim,et al.  Extracting and Ranking Product Features in Opinion Documents , 2010, COLING.