Semantic audio content-based music recommendation and visualization based on user preference examples

Preference elicitation is a challenging fundamental problem when designing recommender systems. In the present work we propose a content-based technique to automatically generate a semantic representation of the user's musical preferences directly from audio. Starting from an explicit set of music tracks provided by the user as evidence of his/her preferences, we infer high-level semantic descriptors for each track obtaining a user model. To prove the benefits of our proposal, we present two applications of our technique. In the first one, we consider three approaches to music recommendation, two of them based on a semantic music similarity measure, and one based on a semantic probabilistic model. In the second application, we address the visualization of the user's musical preferences by creating a humanoid cartoon-like character - the Musical Avatar - automatically inferred from the semantic representation. We conducted a preliminary evaluation of the proposed technique in the context of these applications with 12 subjects. The results are promising: the recommendations were positively evaluated and close to those coming from state-of-the-art metadata-based systems, and the subjects judged the generated visualizations to capture their core preferences. Finally, we highlight the advantages of the proposed semantic user model for enhancing the user interfaces of information filtering systems.

[1]  Joan Serrà,et al.  From Low-Level to High-Level: Comparative Study of Music Similarity Measures , 2009, 2009 11th IEEE International Symposium on Multimedia.

[2]  A. D. Manning,et al.  Understanding Comics: The Invisible Art , 1993 .

[3]  Beth Logan,et al.  Music Recommendation from Song Sets , 2004, ISMIR.

[4]  Masataka Goto,et al.  Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences , 2006, ISMIR.

[5]  Paul Lamere,et al.  A Model-Based Approach to Constructing Music Similarity Functions , 2007, EURASIP J. Adv. Signal Process..

[6]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[7]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[8]  Gert R. G. Lanckriet,et al.  SEMANTIC SIMILARITY FOR MUSIC RETRIEVAL , 2007 .

[9]  Wolfgang Nejdl,et al.  The Benefit of Using Tag-Based Profiles , 2007 .

[10]  Donna Harman,et al.  Information Processing and Management , 2022 .

[11]  Casey Reas,et al.  Processing: a programming handbook for visual designers and artists , 2007 .

[12]  Ferdinand Fuhrmann,et al.  Content-based music recommendation based on user preference examples , 2010, RecSys 2010.

[13]  Jukka Holm,et al.  Associating Colours with Musical Genres , 2009 .

[14]  Gert R. G. Lanckriet,et al.  Smarter than Genius? Human Evaluation of Music Recommender Systems , 2009, ISMIR.

[15]  Russell Beale,et al.  Music organisation using colour synaesthesia , 2007, CHI Extended Abstracts.

[16]  Nuria Oliver,et al.  I Like It... I Like It Not: Evaluating User Ratings Noise in Recommender Systems , 2009, UMAP.

[17]  Keiichiro Hoashi,et al.  Personalization of user profiles for content-based music retrieval based on relevance feedback , 2003, ACM Multimedia.

[18]  Klaas Bosteels,et al.  Music Recommendation and the Long Tail , 2010 .

[19]  Sung-Hyon Myaeng,et al.  A Probabilistic Model for Music Recommendation Considering Audio Features , 2005, AIRS.

[20]  J. W. Minett,et al.  Language, Evolution, and the Brain , 2009 .

[21]  Kate Ehrlich,et al.  Pointing the way: active collaborative filtering , 1995, CHI '95.

[22]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[23]  Emilia Gómez,et al.  Comparative Analysis of Music Recordings from Western and Non-Western traditions by Automatic Tonal Feature Extraction , 2008 .

[24]  Daniel P. W. Ellis,et al.  Song-Level Features and Support Vector Machines for Music Classification , 2005, ISMIR.

[25]  Sebastian Streich,et al.  Music complexity: a multi-faceted description of audio content , 2007 .

[26]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.

[27]  John Riedl,et al.  Is seeing believing?: how recommender system interfaces affect users' opinions , 2003, CHI '03.

[28]  Judith Masthoff,et al.  Effective explanations of recommendations: user-centered design , 2007, RecSys '07.

[29]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[30]  Nicu Sebe,et al.  Special section from the ACM multimedia conference 2007 , 2008, TOMCCAP.

[31]  Elias Pampalk,et al.  Computational Models of Music Similarity and their Application in Music Information Retrieval , 2006 .

[32]  Peretz Shoval,et al.  Information Filtering: Overview of Issues, Research and Systems , 2001, User Modeling and User-Adapted Interaction.

[33]  XambóAnna,et al.  Semantic audio content-based music recommendation and visualization based on user preference examples , 2013 .

[34]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[35]  Vincent S. Tseng,et al.  A novel music recommender by discovering preferable perceptual-patterns from music pieces , 2010, SAC '10.

[36]  Òscar Celma,et al.  A new approach to evaluating novel recommendations , 2008, RecSys '08.

[37]  G. Grimmett,et al.  Probability and random processes , 2002 .

[38]  Alberto Leon-Garcia,et al.  Probability and Random Processes For EE's (3rd Edition) , 2007 .

[39]  Eric Petajan,et al.  MPEG-4 Face and Body Animation Coding Applied to HCI , 2005 .

[40]  Emilia Gómez Gutiérrez,et al.  Tonal description of music audio signals , 2006 .

[41]  Yee-Hong Yang,et al.  Music-driven character animation , 2009, TOMCCAP.

[42]  Òscar Celma,et al.  Foafing the Music: Bridging the Semantic Gap in Music Recommendation , 2006, SEMWEB.

[43]  Hans-Peter Seidel,et al.  Automatic generation of personalized human avatars from multi-view video , 2005, VRST '05.

[44]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[45]  Xavier Serra,et al.  Unifying Low-Level and High-Level Music Similarity Measures , 2011, IEEE Transactions on Multimedia.

[46]  Paul M. Brossier,et al.  Automatic annotation of musical audio for interactive applications , 2006 .

[47]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[48]  Emilia Gómez,et al.  The Musical Avatar: a visualization of musical preferences by means of audio content description , 2010, Audio Mostly Conference.

[49]  Peter Knees,et al.  On Rhythm and General Music Similarity , 2009, ISMIR.

[50]  Vincent S. Tseng,et al.  A novel method for personalized music recommendation , 2009, Expert Syst. Appl..

[51]  Linas Baltrunas,et al.  Towards Time-Dependant Recommendation based on Implicit Feedback , 2009 .

[52]  Mert Bay,et al.  The Music Information Retrieval Evaluation eXchange: Some Observations and Insights , 2010, Advances in Music Information Retrieval.

[53]  Joan Serrà,et al.  Music Mood Annotator Design and Integration , 2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing.

[54]  H. Yoshida Tokyo, Japan , 2019, The Statesman’s Yearbook Companion.

[55]  Sung-Hyon Myaeng,et al.  A probabilistic music recommender considering user opinions and audio features , 2007, Inf. Process. Manag..

[56]  Xavier Serra,et al.  Bridging the Music Semantic Gap , 2006 .

[57]  Martin Szomszor,et al.  Comparison of implicit and explicit feedback from an online music recommendation service , 2010, HetRec '10.

[58]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[59]  Fabio Vignoli,et al.  A Music Retrieval System Based on User Driven Similarity and Its Evaluation , 2005, ISMIR.

[60]  Xavier Serra,et al.  ISMIR 2004 Audio Description Contest , 2006 .

[61]  Yannis Manolopoulos,et al.  Music search engines: Specifications and challenges , 2009, Inf. Process. Manag..

[62]  Katharina Morik,et al.  A Benchmark Dataset for Audio Classification and Clustering , 2005, ISMIR.

[63]  Pearl Pu,et al.  User Technology Adoption Issues in Recommender Systems , 2007 .

[64]  George A. Tsihrintzis,et al.  MUSIPER: a system for modeling music similarity perception based on objective feature subset selection , 2008, User Modeling and User-Adapted Interaction.

[65]  Òscar Celma Herrada Music recommendation and discovery in the long tail , 2009 .

[66]  Peter Knees,et al.  A music information system automatically generated via Web content mining techniques , 2011, Inf. Process. Manag..

[67]  Lora Aroyo,et al.  The effects of transparency on trust in and acceptance of a content-based art recommender , 2008, User Modeling and User-Adapted Interaction.

[68]  Padraig Cunningham,et al.  Experimenting with music taste prediction by user profiling , 2004, MIR '04.

[69]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[70]  Mokhtar Abdullah,et al.  On a Robust Correlation Coefficient , 1990 .

[71]  M. Maffesoli The Time of the Tribes: The Decline of Individualism in Mass Society , 1995 .