Lightweight Self-Attentive Sequential Recommendation

Modern deep neural networks (DNNs) have greatly facilitated the development of sequential recommender systems by achieving state-of-the-art recommendation performance on various sequential recommendation tasks. Given a sequence of interacted items, existing DNN-based sequential recommenders commonly embed each item into a unique vector to support subsequent computations of the user interest. However, due to the potentially large number of items, the over-parameterised item embedding matrix of a sequential recommender has become a memory bottleneck for efficient deployment in resource-constrained environments, e.g., smartphones and other edge devices. Furthermore, we observe that the widely-used multi-head self-attention, though being effective in modelling sequential dependencies among items, heavily relies on redundant attention units to fully capture both global and local item-item transition patterns within a sequence. In this paper, we introduce a novel lightweight self-attentive network (LSAN) for sequential recommendation. To aggressively compress the original embedding matrix, LSAN leverages the notion of compositional embeddings, where each item embedding is composed by merging a group of selected base embedding vectors derived from substantially smaller embedding matrices. Meanwhile, to account for the intrinsic dynamics of each item, we further propose a temporal context-aware embedding composition scheme. Besides, we develop an innovative twin-attention network that alleviates the redundancy of the traditional multi-head self-attention while retaining full capacity for capturing long- and short-term (i.e., global and local) item dependencies. Comprehensive experiments demonstrate that LSAN significantly advances the accuracy and memory efficiency of existing sequential recommenders.

[1]  Jure Leskovec,et al.  Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks , 2019, KDD.

[2]  Edward Y. Chang,et al.  Improving Sequential Recommendation with Knowledge-Enhanced Memory Networks , 2018, SIGIR.

[3]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[4]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Ke Wang,et al.  Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding , 2018, WSDM.

[6]  Walid Krichene,et al.  On Sampled Metrics for Item Recommendation , 2020, KDD.

[7]  Zi Huang,et al.  Collaborative Learning for Extremely Low Bit Asymmetric Hashing , 2018, IEEE Transactions on Knowledge and Data Engineering.

[8]  Wilfred Ng,et al.  SDM: Sequential Deep Matching Model for Online Large-scale Recommender System , 2019, CIKM.

[9]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[10]  Michael R. Lyu,et al.  Where You Like to Go Next: Successive Point-of-Interest Recommendation , 2013, IJCAI.

[11]  Hao Zhou,et al.  Less Is More: Towards Compact CNNs , 2016, ECCV.

[12]  Hongzhi Yin,et al.  Spatio-Temporal Recommendation in Social Media , 2016, SpringerBriefs in Computer Science.

[13]  Ji-Rong Wen,et al.  Taxonomy-Aware Multi-Hop Reasoning Networks for Sequential Recommendation , 2019, WSDM.

[14]  GAG , 2020, Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval.

[15]  Kevin Gimpel,et al.  Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units , 2016, ArXiv.

[16]  Zi Huang,et al.  GAG: Global Attributed Graph Neural Network for Streaming Session-based Recommendation , 2020, SIGIR.

[17]  Dik Lun Lee,et al.  Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba , 2018, KDD.

[18]  Zi Huang,et al.  Exploiting Cross-session Information for Session-based Recommendation with Graph Neural Networks , 2020, ACM Trans. Inf. Syst..

[19]  Julian J. McAuley,et al.  Translation-based Recommendation , 2017, RecSys.

[20]  Zhijian Liu,et al.  Lite Transformer with Long-Short Range Attention , 2020, ICLR.

[21]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[22]  Tong Chen,et al.  Discovering Collaborative Signals for Next POI Recommendation with Iterative Seq2Graph Augmentation , 2021, IJCAI.

[23]  M. de Rijke,et al.  RepeatNet: A Repeat Aware Neural Recommendation Machine for Session-based Recommendation , 2018, AAAI.

[24]  Julian J. McAuley,et al.  Fusing Similarity Models with Markov Chains for Sparse Sequential Recommendation , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[25]  Ji-Rong Wen,et al.  S3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization , 2020, CIKM.

[26]  Zheng Zhang,et al.  Context-Aware Attention-Based Data Augmentation for POI Recommendation , 2019, 2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW).

[27]  Yizhou Sun,et al.  Learning K-way D-dimensional Discrete Code For Compact Embedding Representations , 2017, ICML.

[28]  Zi Huang,et al.  SADIH: Semantic-Aware DIscrete Hashing , 2019, AAAI.

[29]  Jiliang Tang,et al.  Automated Embedding Size Search in Deep Recommender Systems , 2020, SIGIR.

[30]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[31]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[32]  Xing Xie,et al.  LightRec: A Memory and Search-Efficient Recommender System , 2020, WWW.

[33]  Alex Beutel,et al.  Recurrent Recommender Networks , 2017, WSDM.

[34]  Walid Krichene,et al.  On Sampled Metrics for Item Recommendation (Extended Abstract) , 2021, IJCAI.

[35]  Hideki Nakayama,et al.  Compressing Word Embeddings via Deep Compositional Code Learning , 2017, ICLR.

[36]  Zi Huang,et al.  Rethinking the Item Order in Session-based Recommendation with Graph Neural Networks , 2019, CIKM.

[37]  Tong Chen,et al.  Exploiting Positional Information for Session-Based Recommendation , 2021, ACM Trans. Inf. Syst..

[38]  Zi Huang,et al.  Discrete Deep Learning for Fast Content-Aware Recommendation , 2018, WSDM.

[39]  Quoc Viet Hung Nguyen,et al.  Where to Go Next: Modeling Long- and Short-Term User Preferences for Point-of-Interest Recommendation , 2020, AAAI.

[40]  Zi Huang,et al.  Next Point-of-Interest Recommendation on Resource-Constrained Mobile Devices , 2020, WWW.

[41]  Chen Gao,et al.  Learnable Embedding Sizes for Recommender Systems , 2021, ICLR.

[42]  Peng Jiang,et al.  BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer , 2019, CIKM.

[43]  Rui Yan,et al.  AIR: Attentional Intention-Aware Recommender Systems , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[44]  Yongfeng Zhang,et al.  Sequential Recommendation with User Memory Networks , 2018, WSDM.

[45]  Lars Schmidt-Thieme,et al.  Factorizing personalized Markov chains for next-basket recommendation , 2010, WWW '10.

[46]  SDM , 2019, Proceedings of the 28th ACM International Conference on Information and Knowledge Management.

[47]  Wenwu Ou,et al.  BERT4Rec , 2019, Proceedings of the 28th ACM International Conference on Information and Knowledge Management.

[48]  Zi Huang,et al.  Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval , 2021, IEEE Transactions on Multimedia.

[49]  Yann Dauphin,et al.  Pay Less Attention with Lightweight and Dynamic Convolutions , 2019, ICLR.

[50]  Wen-Chih Peng,et al.  Sequence-Aware Factorization Machines for Temporal Predictive Analytics , 2019, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[51]  Kenji Doya,et al.  Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning , 2017, Neural Networks.

[52]  Yang Wang,et al.  Learning Elastic Embeddings for Customizing On-Device Recommenders , 2021, KDD.

[53]  Julian J. McAuley,et al.  Self-Attentive Sequential Recommendation , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[54]  Joemon M. Jose,et al.  Self-Supervised Reinforcement Learning for Recommender Systems , 2020, SIGIR.

[55]  Alexandros Karatzoglou,et al.  Session-based Recommendations with Recurrent Neural Networks , 2015, ICLR.