A hybrid of XGBoost and aspect-based review mining with attention neural network for user preference prediction

With the rapid development of the internet, users tend to refer to the rating scores or review opinions on social platforms. Most recommendation systems use collaborative filtering (CF) methods to recommend items based on users’ ratings. The rating-based CF methods do not consider users’ review opinions on different aspects of items. The accuracy of the rating predictions can be effectively improved by considering the latent semantics and various aspects of user reviews. In this paper, a novel rating prediction method is proposed according to an attention-based gated recurrent unit (GRU) deep learning model with semantic aspects. A two-phase method is proposed herein; it combines the word attention mechanism and review semantics to extract aspect features from user preferences. In the first phase, a bidirectional GRU neural network is adopted according to word attention in order to extract important words from users’ reviews. In the second phase, we split users’ reviews into words, and generate the aspect-based attention semantic vectors from these reviews based on Latent Dirichlet Allocation and the attention weights of the chosen words. The XGBoost method is then adopted to predict user preference ratings based on the aspect-based attention semantic vectors. The experimental results show that the proposed method outperforms traditional prediction methods and effectively improves the accuracy of predictions.

[1]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[2]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[3]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[4]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Markus Schaal,et al.  Topic Extraction from Online Reviews for Classification and Recommendation , 2013, IJCAI.

[6]  Claire Cardie,et al.  Multi-aspect Sentiment Analysis with Topic Models , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[7]  Yun-Cheng Chou,et al.  Online recommendations based on dynamic adjustment of recommendation lists , 2018, Knowl. Based Syst..

[8]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[10]  Martin Ester,et al.  FLAME: A Probabilistic Model Combining Aspect Based Opinion Mining and Collaborative Filtering , 2015, WSDM.

[11]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[12]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[13]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[14]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[15]  Songjie Gong A Collaborative Filtering Recommendation Algorithm Based on User Clustering and Item Clustering , 2010, J. Softw..

[16]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[17]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[18]  Stephanie Rogers,et al.  Improving Restaurants by Extracting Subtopics from Yelp Reviews , 2014 .

[19]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Mahmoud Al-Ayyoub,et al.  Using long short-term memory deep neural networks for aspect-based sentiment analysis of Arabic reviews , 2018, International Journal of Machine Learning and Cybernetics.

[21]  Enrique Herrera-Viedma,et al.  Sentiment analysis: A review and comparative analysis of web services , 2015, Inf. Sci..

[22]  Michael R. Lyu,et al.  Ratings meet reviews, a combined approach to recommend , 2014, RecSys '14.

[23]  Zachary Chase Lipton A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.

[24]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[25]  Liu Zhen,et al.  Word-character attention model for Chinese text classification , 2019 .

[26]  Idris Rabiu,et al.  Recommendation system exploiting aspect-based opinion mining with deep learning method , 2020, Inf. Sci..

[27]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[28]  Shiliang Sun,et al.  A review of natural language processing techniques for opinion mining systems , 2017, Inf. Fusion.

[29]  Lei Zheng,et al.  Joint Deep Modeling of Users and Items Using Reviews for Recommendation , 2017, WSDM.

[30]  Yue Lu,et al.  Latent aspect rating analysis on review text data: a rating regression approach , 2010, KDD.

[31]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[32]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[33]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[34]  Franciska de Jong,et al.  Care more about customers: Unsupervised domain-independent aspect detection for sentiment analysis of customer reviews , 2013, Knowl. Based Syst..

[35]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[36]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[37]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[38]  Jing Huang,et al.  Interpretable Convolutional Neural Networks with Dual Local and Global Attention for Review Rating Prediction , 2017, RecSys.

[39]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[40]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[41]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[42]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[43]  José A. Olivas,et al.  A T1OWA fuzzy linguistic aggregation methodology for searching feature-based opinions , 2020, Knowl. Based Syst..

[44]  Kim Schouten,et al.  SOBA: Semi-automated Ontology Builder for Aspect-based sentiment analysis , 2020, J. Web Semant..

[45]  Ye Zhang,et al.  A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification , 2015, IJCNLP.

[46]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[47]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[48]  Ting Liu,et al.  User Modeling with Neural Network for Review Rating Prediction , 2015, IJCAI.