Unsupervised Topic-Specific Domain Dependency Graphs for Aspect Identification in Sentiment Analysis

We propose to model a collection of documents by means of topic-specific domain dependency graphs (DDGs). We use LDA topic modeling to detect topics underlying a mixed-domain dataset and select topically pure documents from the collection. We aggregate counts of words and their dependency relations per topic, weigh them with Tf-Idf and produce a DDG by selecting the highest-ranked words and their dependency relations. We demonstrate an implementation of the approach on the task of identifying product aspects for aspect-oriented sentiment analysis. A large corpus of Amazon reviews is used to identify product aspects by applying syntactic filtering to the DDG. Evaluation on a small set of cameras reviews demonstrate a good precision of our method. To our knowledge, this is the first method that finds product-class specific aspects in mutli-domain collections in an unsupervised fashion.

[1]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[2]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[3]  Sasha Blair-Goldensohn,et al.  Building a Sentiment Summarizer for Local Service Reviews , 2008 .

[4]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[5]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[6]  Iryna Gurevych,et al.  Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields , 2010, EMNLP.

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  Núria Bel,et al.  A Classification of Adjectives for Polarity Lexicons Enhancement , 2012, LREC.

[9]  Dragomir R. Radev,et al.  Book Review: Graph-Based Natural Language Processing and Information Retrieval by Rada Mihalcea and Dragomir Radev , 2011, CL.

[10]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[11]  Saif Mohammad,et al.  NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews , 2014, *SEMEVAL.

[12]  Xiaojin Zhu,et al.  Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization , 2006 .

[13]  Joakim Nivre,et al.  Integrating Graph-Based and Transition-Based Dependency Parsers , 2008, ACL.

[14]  Guido Zuccon,et al.  Graph-based concept weighting for medical information retrieval , 2012, ADCS.

[15]  Juan Carlos Vidal,et al.  Graph-based semantic annotation for enriching educational content with linked data , 2014, Knowl. Based Syst..

[16]  Andrea Esuli,et al.  PageRanking WordNet Synsets: An Application to Opinion Mining , 2007, ACL.

[17]  Michael Halliday,et al.  Cohesion in English , 1976 .

[18]  Rada Mihalcea,et al.  Word Sense and Subjectivity , 2006, ACL.

[19]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[20]  Dragomir R. Radev,et al.  Networks and Natural Language Processing , 2008, AI Mag..

[21]  Robert Remus,et al.  Domain Adaptation Using Domain Similarity- and Domain Complexity-Based Instance Selection for Cross-Domain Sentiment Analysis , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[22]  Omer Levy,et al.  Proposition Knowledge Graphs , 2014 .

[23]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[24]  Christina Lioma,et al.  Graph-based term weighting for information retrieval , 2011, Information Retrieval.

[25]  Andrew McCallum,et al.  Optimizing Semantic Coherence in Topic Models , 2011, EMNLP.

[26]  Nathanael Chambers,et al.  Template-Based Information Extraction without the Templates , 2011, ACL.

[27]  Qiang Yang,et al.  Cross-Domain Co-Extraction of Sentiment and Topic Lexicons , 2012, ACL.

[28]  Elena Tutubalina Target-Based Topic Model for Problem Phrase Extraction , 2015, ECIR.

[29]  Christian Biemann Structure Discovery in Natural Language , 2012, Theory and Applications of Natural Language Processing.

[30]  Meng Wang,et al.  Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews , 2011, ACL.