A Sentiment-Aware Topic Model for Extracting Failures from Product Reviews

This paper describes a probabilistic model that aims to extract different kinds of product difficulties conditioned on users’ dissatisfaction through the use of sentiment information. The proposed model learns a distribution over words, associated with topics, sentiment and problem labels. The results were evaluated on reviews of products, randomly sampled from several domains (automobiles, home tools, electronics, and baby products), and user comments about mobile applications, in English and Russian. The model obtains a better performance than several state-of-the-art models in terms of the likelihood of a held-out test and outperforms these models in a classification task.