Representation Learning of Users and Items for Review Rating Prediction Using Attention-based Convolutional Neural Network

It is common nowadays for e-commerce websites to encourage their users to rate shopping items and write review text. This review text information has been proven to be very useful in understanding user preferences and item properties, and thus enhances the capability of these websites to make personalized recommendations. In this paper, we propose to model user preferences and item properties using a convolutional neural network (CNN) with attention, motivated by the huge success of CNN for many natural language processing tasks. By using aggregated review text from users and items, we aim to build vector representations of user and item using attention-based CNNs. These vector representations are then used to predict rating values for a user on an item. We train these user and item networks jointly, which enables the interaction between users and items in a way similar to the matrix factorization technique. In addition, the visualization of the attention layer gives us insight on when words are selected by the models that highlight a user’s preferences or an item’s properties. We validate the proposed models on popular review datasets, Yelp and Amazon, and compare results with matrix factorization (MF), and hidden factor and topical (HFT) models. Our experiments show improvement over HFT, which proves the effectiveness of these representations learned from our networks on review text for rating prediction.

[1]  Eduard H. Hovy,et al.  When Are Tree Structures Necessary for Deep Learning of Representations? , 2015, EMNLP.

[2]  Jure Leskovec,et al.  Inferring Networks of Substitutable and Complementary Products , 2015, KDD.

[3]  Michael R. Lyu,et al.  Ratings meet reviews, a combined approach to recommend , 2014, RecSys '14.

[4]  Dit-Yan Yeung,et al.  Collaborative Deep Learning for Recommender Systems , 2014, KDD.

[5]  Yann LeCun,et al.  Very Deep Convolutional Networks for Natural Language Processing , 2016, ArXiv.

[6]  Alessandro Moschitti,et al.  Twitter Sentiment Analysis with Deep Convolutional Neural Networks , 2015, SIGIR.

[7]  Yelong Shen,et al.  A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval , 2014, CIKM.

[8]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[9]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[10]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[11]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[12]  Aaron C. Courville,et al.  Learning Distributed Representations from Reviews for Collaborative Filtering , 2015, RecSys.

[13]  Ting Liu,et al.  Learning Semantic Representations of Users and Products for Document Level Sentiment Classification , 2015, ACL.

[14]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[15]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[16]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[17]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[18]  Cícero Nogueira dos Santos,et al.  Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[19]  Anton van den Hengel,et al.  Image-Based Recommendations on Styles and Substitutes , 2015, SIGIR.

[20]  Jie Zhang,et al.  TopicMF: Simultaneously Exploiting Ratings and Reviews for Recommendation , 2014, AAAI.