Extracting Implicit Features Based on Association Rules

Product reviews in the network shopping platform provide references to customs' purchase decision. However, existing researches on opinion objects mainly focus on explicit features, and few of scholars take implicit features into consideration. In this paper, based on Chinese online comments data preprocessing. We proposed a Fuzzy C-means algorithm based on Simulated Annealing (SA-FCM) to cluster the explicit comment sentences into 9 classes. And put each class of comment sentences into a document set. Then association rules between opinion words and opinion objects in every document set are mined and build an association rules table among classes, opinion targets and opinion words. The implicit features are discovered according to the opinion words in the association rule table. Finally, the implicit features excavate method proposed in this paper can effectively improve the accuracy of the extraction effect through an experiment verification.

[1]  Bela Gipp,et al.  TF-IDuF : A Novel Term-Weighting Scheme for User Modeling based on Users’ Personal Document Collections , 2017 .

[2]  Zhen Hai,et al.  Implicit Feature Identification via Co-occurrence Association Rule Mining , 2011, CICLing.

[3]  Zhao Ming Study on Feature Selection in Chinese Text Categorization , 2004 .

[4]  Eric Chang,et al.  Red Opal: product-feature scoring from reviews , 2007, EC '07.

[5]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[6]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[7]  Yu Zhang,et al.  Extracting implicit features in online customer reviews for opinion mining , 2013, WWW '13 Companion.

[8]  Hua Xu,et al.  Implicit feature identification in Chinese reviews using explicit topic mining model , 2015, Knowl. Based Syst..

[9]  Xu Yang,et al.  Transaction rating credibility based on user group preference , 2015 .

[10]  Iryna Gurevych,et al.  Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields , 2010, EMNLP.

[11]  Xindong Wu,et al.  The Top Ten Algorithms in Data Mining , 2009 .

[12]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[13]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[14]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[15]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[16]  Hua Xu,et al.  Implicit feature identification via hybrid association rule mining , 2013, Expert Syst. Appl..

[17]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[18]  Xueming Luo,et al.  Quantifying the Long-Term Impact of Negative Word of Mouth on Cash Flows and Stock Prices , 2009, Mark. Sci..

[19]  James C. Bezdek Selected Applications in Classifier Design , 1981 .

[20]  Oren Etzioni,et al.  OPINE: Extracting Product Features and Opinions from Reviews , 2005, HLT/EMNLP.

[21]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[22]  Qing Ye,et al.  An Improved Association Rule Mining Approach to Identification ofImplicit Product Aspects , 2014 .

[23]  P. Herr,et al.  Effects of Word-of-Mouth and Product-Attribute Information on Persuasion: An Accessibility-Diagnosticity Perspective , 1991 .

[24]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[25]  Chen Yong-sheng Particle Swarm Algorithm Modified Based on Ideal of Simulated Annealing for Knapsack Problem , 2010 .