An Unsupervised Approach for Aspect Category Detection Using Soft Cosine Similarity Measure

Aspect category detection is one of the important and challenging subtasks of aspect-based sentiment analysis. Given a set of pre-defined categories, this task aims to detect categories which are indicated implicitly or explicitly in a given review sentence. Supervised machine learning approaches perform well to accomplish this subtask. Note that, the performance of these methods depends on the availability of labeled train data, which is often difficult and costly to obtain. Besides, most of these supervised methods require feature engineering to perform well. In this paper, we propose an unsupervised method to address aspect category detection task without the need for any feature engineering. Our method utilizes clusters of unlabeled reviews and soft cosine similarity measure to accomplish aspect category detection task. Experimental results on SemEval-2014 restaurant dataset shows that proposed unsupervised approach outperforms several baselines by a substantial margin.

[1]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[2]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[3]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[4]  John Pavlopoulos,et al.  AUEB-ABSA at SemEval-2016 Task 5: Ensembles of Classifiers and Embeddings for Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[5]  Saif Mohammad,et al.  NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews , 2014, *SEMEVAL.

[6]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[7]  Kim Schouten,et al.  COMMIT-P1WP3: A Co-occurrence Based Approach to Aspect-Level Sentiment Analysis , 2014, *SEMEVAL.

[8]  W. Bruce Croft,et al.  Embedding-based Query Language Models , 2016, ICTIR.

[9]  Jian Su,et al.  NLANGP at SemEval-2016 Task 5: Improving Aspect Based Sentiment Analysis using Neural Network Features , 2016, *SEMEVAL.

[10]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[11]  Roberto Basili,et al.  UNITOR: Aspect Based Sentiment Analysis with Structured Learning , 2014, *SEMEVAL.

[12]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[13]  Grigori Sidorov,et al.  Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model , 2014, Computación y Sistemas.

[14]  Flavius Frasincar,et al.  Supervised and Unsupervised Aspect Category Detection for Sentiment Analysis with Co-occurrence Data , 2018, IEEE Transactions on Cybernetics.

[15]  Ben Carterette,et al.  Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval , 2016, ICTIR 2016.

[16]  Hwee Tou Ng,et al.  An Unsupervised Neural Attention Model for Aspect Extraction , 2017, ACL.

[17]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[18]  Aitor García Pablos,et al.  V3: Unsupervised Generation of Domain Aspect Terms for Aspect Based Sentiment Analysis , 2014, SemEval@COLING.

[19]  Gülsen Eryigit,et al.  TGB at SemEval-2016 Task 5: Multi-Lingual Constraint System for Aspect Based Sentiment Analysis , 2016, SemEval@NAACL-HLT.