Hierarchical features-based targeted aspect extraction from online reviews

With the prevalence of online review websites, large-scale data promote the necessity of focused analysis. This task aims to capture the information that is highly relevant to a specific aspect. However, the broad scope of the aspects of the various products makes this task overarching but challenging. A commonly used solution is to modify the topic models with additional information to capture the features for a specific aspect (referred to as a targeted aspect). However, the existing topic models, either perform the full analysis to capture features as many as possible or estimate the similarity to capture features as coherent as possible, overlook the fine-grained semantic relations between the features, resulting in the captured features coarse and confusing. In this paper, we propose a novel Hierarchical Features-based Topic Model (HFTM) to extract targeted aspects from online reviews, then to capture the aspect-specific features. Specifically, our model can not only capture the direct features posing target-to-feature semantics but also capture the latent features posing feature-to-feature semantics. The experiments conducted on real-world datasets demonstrate that HFTMl outperforms the state-of-the-art baselines in terms of both aspect extraction and document classification.

[1]  Hassan Dawood,et al.  A Multi-Layer Dual Attention Deep Learning Model With Refined Word Embeddings for Aspect-Based Sentiment Analysis , 2019, IEEE Access.

[2]  K. Nimala,et al.  A Robust User Sentiment Biterm Topic Mixture Model Based on User Aggregation Strategy to Avoid Data Sparsity for Short Text , 2019, Journal of Medical Systems.

[3]  Mohammad-Reza Feizi-Derakhshi,et al.  Overlapping community detection in rating-based social networks through analyzing topics, ratings and links , 2018, Pattern Recognit..

[4]  Mingming Jiang,et al.  A novel temporal and topic-aware recommender model , 2018, World Wide Web.

[5]  Hadi Tabatabaee Malazi,et al.  Dynamic windowing mechanism to combine sentiment and N-gram analysis in detecting events from social media , 2018, Knowledge and Information Systems.

[6]  Chuhan Wu,et al.  A hybrid unsupervised method for aspect term and opinion target extraction , 2018, Knowl. Based Syst..

[7]  Stefan Kramer,et al.  Online multi-label dependency topic models for text classification , 2018, Machine Learning.

[8]  Ji Zhang,et al.  Locally weighted embedding topic modeling by markov random walk structure approximation and sparse regularization , 2018, Neurocomputing.

[9]  Dimitrios Gunopulos,et al.  Learning patterns for discovering domain-oriented opinion words , 2017, Knowledge and Information Systems.

[10]  Vili Podgorelec,et al.  Text classification method based on self-training and LDA topic models , 2017, Expert Syst. Appl..

[11]  Xiaohui Yu,et al.  Collaborative filtering with weighted opinion aspects , 2016, Neurocomputing.

[12]  P. Senkul,et al.  Context-aware location recommendation by using a random walk-based approach , 2016, Knowledge and Information Systems.

[13]  Dat Quoc Nguyen,et al.  Improving Topic Models with Latent Feature Word Representations , 2015, TACL.

[14]  Tianqing Zhu,et al.  Privacy-preserving topic model for tagging recommender systems , 2015, Knowledge and Information Systems.

[15]  Jiafeng Guo,et al.  BTM: Topic Modeling over Short Texts , 2014, IEEE Transactions on Knowledge and Data Engineering.

[16]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Bin Ma,et al.  Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[18]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2009 .