Towards Low-loss 1-bit Quantization of User-item Representations for Top-K Recommendation

Due to the promising advantages in space compression and inference acceleration, quantized representation learning for recommender systems has become an emerging research direction recently. As the target is to embed latent features in the discrete embedding space, developing quantization for user-item representations with a few low-precision integers confronts the challenge of high information loss, thus leading to unsatisfactory performance in Top-K recommendation. In this work, we study the problem of representation learning for recommendation with 1-bit quantization. We propose a model named Low-loss Quantized Graph Convolutional Network (L2Q-GCN). Different from previous work that plugs quantization as the final encoder of user-item embeddings, L2Q-GCN learns the quantized representations whilst capturing the structural information of user-item interaction graphs at different semantic levels. This achieves the substantial retention of intermediate interactive information, alleviating the feature smoothing issue for ranking caused by numerical quantization. To further improve the model performance, we also present an advanced solution named LQ-GCN0=; with quantization approximation and annealing training strategy. We conduct extensive experiments on four benchmarks over Top-K recommendation task. The experimental results show that, with nearly 9× representation storage compression, LQ-GCN0=; attains about 90∼99% performance recovery compared to the state-of-the-art model.

[1]  Irwin King,et al.  Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware Recommendation , 2021, WSDM.

[2]  Wolfgang Nejdl,et al.  Hashing-Accelerated Graph Neural Networks for Link Prediction , 2021, WWW.

[3]  Michael R. Lyu,et al.  BinaryBERT: Pushing the Limit of BERT Quantization , 2020, ACL.

[4]  Stefanos Zafeiriou,et al.  Binary Graph Neural Networks , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Ed H. Chi,et al.  Learning to Embed Categorical Features without Embedding Tables for Recommendation , 2020, KDD.

[6]  Yunhong Wang,et al.  Bi-GCN: Binary Graph Convolutional Network , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  A. Bronstein,et al.  Loss aware post-training quantization , 2019, Machine Learning.

[8]  Jiande Sun,et al.  Multi-Feature Discrete Collaborative Filtering for Fast Cold-start Recommendation , 2020, AAAI.

[9]  Xing Zhao,et al.  Learning to Hash with Graph Neural Networks for Recommender Systems , 2020, WWW.

[10]  Xiangnan He,et al.  LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation , 2020, SIGIR.

[11]  Jingkuan Song,et al.  Forward and Backward Information Retention for Accurate Binary Neural Networks , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Jiyan Yang,et al.  Compositional Embeddings Using Complementary Partitions for Memory-Efficient Recommendation Systems , 2019, KDD.

[13]  Mark Coates,et al.  Multi-graph Convolution Collaborative Filtering , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[14]  Yifei Zhang,et al.  Doc2hash: Learning Discrete Latent variables for Documents Retrieval , 2019, NAACL.

[15]  Tat-Seng Chua,et al.  Neural Graph Collaborative Filtering , 2019, SIGIR.

[16]  Bernard Ghanem,et al.  DeepGCNs: Can GCNs Go As Deep As CNNs? , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Minyi Guo,et al.  Knowledge Graph Convolutional Networks for Recommender Systems , 2019, WWW.

[18]  Kilian Q. Weinberger,et al.  Simplifying Graph Convolutional Networks , 2019, ICML.

[19]  Mark Coates,et al.  Bayesian graph convolutional neural networks for semi-supervised classification , 2018, AAAI.

[20]  Nicholas D. Lane,et al.  An Empirical study of Binary Neural Networks' Optimisation , 2018, ICLR.

[21]  Jure Leskovec,et al.  Graph Convolutional Neural Networks for Web-Scale Recommender Systems , 2018, KDD.

[22]  Mohan S. Kankanhalli,et al.  Aspect-Aware Latent Factor Model: Rating Prediction with Ratings and Reviews , 2018, WWW.

[23]  Xiao-Ming Wu,et al.  Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning , 2018, AAAI.

[24]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Wei Pan,et al.  Towards Accurate Binary Convolutional Neural Network , 2017, NIPS.

[26]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[27]  Max Welling,et al.  Graph Convolutional Matrix Completion , 2017, ArXiv.

[28]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[29]  Guowu Yang,et al.  Discrete Personalized Ranking for Fast Collaborative Filtering from Implicit Feedback , 2017, AAAI.

[30]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[31]  Ben Poole,et al.  Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[32]  Yee Whye Teh,et al.  The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables , 2016, ICLR.

[33]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[34]  Paul Covington,et al.  Deep Neural Networks for YouTube Recommendations , 2016, RecSys.

[35]  Huanbo Luan,et al.  Discrete Collaborative Filtering , 2016, SIGIR.

[36]  Tat-Seng Chua,et al.  Fast Matrix Factorization for Online Recommendation with Implicit Feedback , 2016, SIGIR.

[37]  Ali Farhadi,et al.  XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks , 2016, ECCV.

[38]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[39]  Ran El-Yaniv,et al.  Binarized Neural Networks , 2016, NIPS.

[40]  Ran El-Yaniv,et al.  Binarized Neural Networks , 2016, ArXiv.

[41]  David M. Blei,et al.  Modeling User Exposure in Recommendation , 2015, WWW.

[42]  Tat-Seng Chua,et al.  Learning Image and User Features for Recommendation in Social Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[43]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[45]  Yoshua Bengio,et al.  Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.

[46]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[47]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[48]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[49]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[50]  W. R. Bennett,et al.  Spectra of quantized signals , 1948, Bell Syst. Tech. J..