Opinion Mining of Online Product Reviews from Traditional LDA Topic Clusters using Feature Ontology Tree and Sentiwordnet

Online product reviews provide data about the user‟s perspective on the features that were experienced by them. Product features and corresponding opinions form a major part in analyzing the online product reviews. Extracting features from a huge number of reviews is classified into three major categories such as utilizing language rules, sequence labeling as well as the topic modeling. Latent Dirichlet Allocation (LDA) is one such topic model which clusters the document words into unsupervised learned topics using Dirichlet priors. The words so clustered are the features and opinion words in the product reviews domain. To identify appropriate product features from these clusters a hierarchical, domain independent Feature Ontology Tree (FOT) is applied to LDA clusters. The opinion bearing words of obtained product features are identified by utilizing the document indicators available from topic matrix of LDA. These indicators are useful to backtrack to the corresponding online review in which the product feature is present. The polarity of the opinion bearing word is calculated with the help of SentiWordNet. This improves the accuracy of the features using extracted LDA topic clusters and machine interpretation of polarity of opinion word is satisfactory.

[1]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[2]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Giuseppe Carenini,et al.  Extracting knowledge from evaluative text , 2005, K-CAP '05.

[4]  Satoshi Sekine,et al.  SurfShop: combing a product ontology with topic model results for online window-shopping , 2012, HLT-NAACL.

[5]  Ian Horrocks,et al.  Ontologies and the semantic web , 2008, CACM.

[6]  D. Teja Santosh,et al.  OBTAINING FEATURE- AND SENTIMENT-BASED LINKED INSTANCE RDF DATA FROM UNSTRUCTURED REVIEWS USING ONTOLOGY-BASED MACHINE LEARNING , 2015 .

[7]  Maya R. Gupta,et al.  Introduction to the Dirichlet Distribution and Related Processes , 2010 .

[8]  W. Hall,et al.  Using Protege for automatic ontology instantiation , 2004 .

[9]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[10]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[11]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[12]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[13]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[14]  Masoud Rahgozar,et al.  An ontology-based semantic extraction approach for B2C ecommerce , 2011, Int. Arab J. Inf. Technol..

[15]  B. Vishnu Vardhan,et al.  Extracting Product Features from Reviews Using Feature Ontology Tree Applied on LDA Topic Clusters , 2016, 2016 IEEE 6th International Conference on Advanced Computing (IACC).

[16]  Vishnu Vardhan Bulusu,et al.  Obtaining Feature- and Sentiment-Based Linked Instance RDF Data from Unstructured Reviews using Ontology-Based Machine Learning , 2015 .

[17]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[18]  Iryna Gurevych,et al.  A Comparative Study of Feature Extraction Algorithms in Customer Reviews , 2008, 2008 IEEE International Conference on Semantic Computing.

[19]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.