Herding Effect Based Attention for Personalized Time-Sync Video Recommendation

Time-sync comment (TSC) is a new form of user-interaction review associated with real-time video contents, which contains a user's preferences for videos and therefore well suited as the data source for video recommendations. However, existing review-based recommendation methods ignore the context-dependent (generated by user-interaction), real-time, and time-sensitive properties of TSC data. To bridge the above gaps, in this paper, we use video images and users' TSCs to design an Image-Text Fusion model with a novel Herding Effect Attention mechanism (called ITF-HEA), which can predict users' favorite videos with model-based collaborative filtering. Specifically, in the HEA mechanism, we weight the context information based on the semantic similarities and time intervals between each TSC and its context, thereby considering influences of the herding effect in the model. Experiments show that ITF-HEA is on average 3.78% higher than the state-of-the-art method upon F1-score in baselines.

[1]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[2]  Deng Cai,et al.  Translating Embeddings for Knowledge Graph Completion with Relation Attention Mechanism , 2018, IJCAI.

[3]  Xanadu Halkias,et al.  Tradeoff Between Distributed Social Learning and Herding Effect in Online Rating Systems , 2017 .

[4]  Weijia Jia,et al.  Crowdsourced time-sync video tagging using semantic association graph , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[5]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[6]  Qing Ping Video recommendation using crowdsourced time-sync comments , 2018, RecSys.

[7]  Weijia Jia,et al.  Neural Relation Extraction via Inner-Sentence Noise Reduction and Transfer Learning , 2018, EMNLP.

[8]  Julian J. McAuley,et al.  VBPR: Visual Bayesian Personalized Ranking from Implicit Feedback , 2015, AAAI.

[9]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[10]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[11]  Qiang Yang,et al.  Crowdsourced time-sync video tagging using temporal and personalized topic modeling , 2014, KDD.

[12]  Chenxi Zhang,et al.  TSCSet: A Crowdsourced Time-Sync Comment Dataset for Exploration of User Experience Improvement , 2018, IUI.

[13]  Le Wu,et al.  Predicting the Popularity of DanMu-enabled Videos: A Multi-factor View , 2016, DASFAA.

[14]  Yongfeng Zhang,et al.  Personalized Key Frame Recommendation , 2017, SIGIR.

[15]  Changsheng Xu,et al.  A Unified Personalized Video Recommendation via Dynamic Recurrent Neural Networks , 2017, ACM Multimedia.

[16]  Xiangnan He,et al.  Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention , 2017, SIGIR.

[17]  Yi Zheng,et al.  Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding , 2016, AAAI.

[18]  Dietmar Jannach,et al.  When Recurrent Neural Networks meet the Neighborhood for Session-Based Recommendation , 2017, RecSys.

[19]  Tao Mei,et al.  Contextual Video Recommendation by Multimodal Relevance and User Feedback , 2011, TOIS.

[20]  Tiejun Zhao,et al.  Attention-Fused Deep Matching Network for Natural Language Inference , 2018, IJCAI.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Bing Liu,et al.  Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews , 2017, KDD.

[23]  Ruslan Salakhutdinov,et al.  Multimodal Neural Language Models , 2014, ICML.

[24]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[25]  Alexander J. Smola,et al.  Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS) , 2014, KDD.

[26]  Dilruk Perera,et al.  Exploring the use of Time-Dependent Cross-Network Information for Personalized Recommendations , 2017, ACM Multimedia.

[27]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[28]  Jing Huang,et al.  Interpretable Convolutional Neural Networks with Dual Local and Global Attention for Review Rating Prediction , 2017, RecSys.