论文信息 - A Pre-training Strategy for Recommendation

A Pre-training Strategy for Recommendation

The side information of items has been shown to be effective in building the recommendation systems. Various methods have been developed to exploit the item side information for learning users' preferences on items. Differing from previous work, this paper focuses on developing an unsupervised pre-training strategy, which can exploit the items' multimodality side information (e.g., text and images) to learn the item representations that may benefit downstream applications, such as personalized item recommendation and click-through ratio prediction. Firstly, we employ a multimodal graph to describe the relationships between items and their multimodal feature information. Then, we propose a novel graph neural network, named Multimodal Graph-BERT (MG-BERT), to learn the item representations based on the item multimodal graph. Specifically, MG-BERT is trained by solving the following two graph reconstruction problems, i.e., graph structure reconstruction and masked node feature reconstruction. Experimental results on real datasets demonstrate that the proposed MG-BERT can effectively exploit the multimodality information of items to help downstream applications.

[1] Martha Larson,et al. Collaborative Filtering beyond the User-Item Matrix , 2014, ACM Comput. Surv..

[2] Yizhou Sun,et al. GPT-GNN: Generative Pre-Training of Graph Neural Networks , 2020, KDD.

[3] Yiming Yang,et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[4] Jiawei Zhang,et al. Graph-Bert: Only Attention is Needed for Learning Graph Representations , 2020, ArXiv.

[5] Aren Jansen,et al. CNN architectures for large-scale audio classification , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6] Max Welling,et al. Variational Graph Auto-Encoders , 2016, ArXiv.

[7] Steffen Rendle,et al. Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[8] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[9] Yi Tay,et al. Deep Learning based Recommender System: A Survey and New Perspectives , 2018 .

[10] Pietro Liò,et al. Deep Graph Infomax , 2018, ICLR.

[11] Tianqi Chen,et al. XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[12] Alexandros Karatzoglou,et al. Session-based Recommendations with Recurrent Neural Networks , 2015, ICLR.

[13] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14] Tat-Seng Chua,et al. Neural Graph Collaborative Filtering , 2019, SIGIR.

[15] Sergey Ioffe,et al. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[16] Jason Weston,et al. Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[17] Licheng Yu,et al. UNITER: Learning UNiversal Image-TExt Representations , 2019, ArXiv.