Enhancing Music Recommendation with Social Media Content: an Attentive Multimodal Autoencoder Approach

Music recommendation methods predict users’ music preference primarily based on historical ratings. Meanwhile, manifold personal factors of users are also important for the problem, and research efforts have been made to improve the recommendation performance with auxiliary user information. As an important indicator of users’ personal traits and states, the numerous social media content (e.g., texts, images and short videos), however, is still hardly exploited. In this work, we systematically study the utilization of multimodal social media content for music recommendation. We define groups of both targeted handcrafted features and generic deep features for each modality, and further propose an Attentive Multimodal Autoencoder approach (AMAE) to learn cross-modal latent representations from the extracted features. Attention mechanism is also employed to integrate users’ global and contextual music preference with alterable weights. Experiments demonstrate remarkable improvement of recommendation performance (+2.40% in Hit Ratio and +3.30% in NDCG), manifesting the effectiveness of our AMAE approach, as well as the significance of incorporating social media content data in music recommendation.

[1]  Yong Liu,et al.  Your neighbors affect your ratings: on geographical neighborhood influence to rating prediction , 2014, SIGIR.

[2]  Qiong Wu,et al.  Information Cascades over Diffusion-Restricted Social Network: A Data-Driven Analysis , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[3]  Nadia Bianchi-Berthouze,et al.  K-DIME: An Affective Image Filtering System , 2003, IEEE Multim..

[4]  Markus Schedl,et al.  The Million Musical Tweet Dataset - What We Can Learn From Microblogs , 2013, ISMIR.

[5]  Jie Zhang,et al.  TopicMF: Simultaneously Exploiting Ratings and Reviews for Recommendation , 2014, AAAI.

[6]  Allan Hanbury,et al.  Affective image classification using features inspired by psychology and art theory , 2010, ACM Multimedia.

[7]  Tat-Seng Chua,et al.  Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks , 2017, IJCAI.

[8]  He Li,et al.  Developing Simplified Chinese Psychological Linguistic Analysis Dictionary for Microblog , 2013, Brain and Health Informatics.

[9]  Adrian C. North,et al.  Why do we listen to music? A uses and gratifications analysis. , 2011, British journal of psychology.

[10]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[11]  Yiqun Liu,et al.  Your Tweets Reveal What You Like: Introducing Cross-media Content Information into Multi-domain Recommendation , 2018, IJCAI.

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[14]  Huanbo Luan,et al.  Cross-Domain Depression Detection via Harvesting Social Media , 2018, IJCAI.

[15]  Guorui Zhou,et al.  Deep Interest Network for Click-Through Rate Prediction , 2017, KDD.

[16]  Shengming Jiang,et al.  Image Retrieval by Emotional Semantics: A Study of Emotional Space and Feature Extraction , 2006, SMC.

[17]  Tao Chen,et al.  TriRank: Review-aware Explainable Recommendation by Modeling Aspects , 2015, CIKM.

[18]  Bo Chen,et al.  PEIA: Personality and Emotion Integrated Attentive Model for Music Recommendation on Social Media Platforms , 2020, AAAI.

[19]  Xiangnan He,et al.  NAIS: Neural Attentive Item Similarity Model for Recommendation , 2018, IEEE Transactions on Knowledge and Data Engineering.

[20]  David S. Rosenblum,et al.  Context-aware mobile music recommendation for daily activities , 2012, ACM Multimedia.

[21]  Bruce Ferwerda,et al.  Personality & Emotional States: Understanding Users' Music Listening Needs , 2015, UMAP Workshops.

[22]  Jie Tang,et al.  Modeling Emotion Influence in Image Social Networks , 2015, IEEE Transactions on Affective Computing.

[23]  Adam D. I. Kramer The spread of emotion via facebook , 2012, CHI.

[24]  Chun Chen,et al.  Music recommendation by unified hypergraph: combining social media information and music content , 2010, ACM Multimedia.

[25]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[26]  Guandong Xu,et al.  Exploring user emotion in microblogs for music recommendation , 2015, Expert Syst. Appl..

[27]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[28]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Adrian C. North,et al.  The importance of music to adolescents. , 2000, The British journal of educational psychology.

[30]  Tat-Seng Chua,et al.  Fast Matrix Factorization for Online Recommendation with Implicit Feedback , 2016, SIGIR.

[31]  Jialie Shen,et al.  On Effective Location-Aware Music Recommendation , 2016, ACM Trans. Inf. Syst..

[32]  Heng Huang,et al.  Deep Attributed Network Embedding , 2018, IJCAI.

[33]  Jennifer Golbeck,et al.  Predicting personality with social media , 2011, CHI Extended Abstracts.

[34]  Sanjeev Arora,et al.  A Simple but Tough-to-Beat Baseline for Sentence Embeddings , 2017, ICLR.

[35]  Mohan S. Kankanhalli,et al.  Exploring User-Specific Information in Music Retrieval , 2017, SIGIR.

[36]  Shujian Huang,et al.  Deep Matrix Factorization Models for Recommender Systems , 2017, IJCAI.

[37]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[38]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.