Cochleogram-based approach for detecting perceived emotions in music

Abstract Identifying perceived emotional content of music constitutes an important aspect of easy and efficient search, retrieval, and management of the media. One of the most promising use cases of music organization is an emotion-based playlist, where automatic music emotion recognition plays a significant role in providing emotion related information, which is otherwise, generally unavailable. Based on the importance of the auditory system in emotional recognition and processing, in this study, we propose a new cochleogram-based system for detecting the affective musical content. To effectively simulate the response of the human auditory periphery, the music audio signal is processed by a detailed biophysical cochlear model, thus obtaining an output that closely matches the characteristics of human hearing. In this proposed approach, based on the cochleogram images, which we construct directly from the response of the basilar membrane, a convolutional neural network (CNN) is used to extract the relevant music features. To validate the practical implications of the proposed approach with regard to its possible integration in different digital music libraries, an extensive study was conducted to evaluate the predictive performance of our approach in different aspects of music emotion recognition. The proposed approach was evaluated on publicly available 1000 songs database and the experimental results showed that it performed better in comparison with common musical features (such as tempo, mode, pitch, clarity, and perceptually motivated mel-frequency cepstral coefficients (MFCC)) as well as official ”MediaEval” challenge results on the same reference database. Our findings clearly show that the proposed approach can lead to better music emotion recognition performance and be used as part of a state-of-the-art music information retrieval system.

[1]  Jeffrey J. Scott,et al.  MUSIC EMOTION RECOGNITION: A STATE OF THE ART REVIEW , 2010 .

[2]  Vallabha Hampiholi A method for Music Classification Based On Perceived Mood Detection for Indian Bollywood Music , 2012 .

[3]  Yi-Hsuan Yang,et al.  The MediaEval 2013 Brave New Task: Emotion in Music , 2013, MediaEval.

[4]  Petri Toiviainen,et al.  MIR in Matlab (II): A Toolbox for Musical Feature Extraction from Audio , 2007, ISMIR.

[5]  Junjie Bai,et al.  Dimensional music emotion recognition by valence-arousal regression , 2016, 2016 IEEE 15th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC).

[6]  M. Bradley,et al.  Measuring emotion: the Self-Assessment Manikin and the Semantic Differential. , 1994, Journal of behavior therapy and experimental psychiatry.

[7]  S. Gosling,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES The Do Re Mi’s of Everyday Life: The Structure and Personality Correlates of Music Preferences , 2003 .

[8]  Dip Paul,et al.  A Survey of Music Recommendation Systems with a Proposed Music Recommendation System , 2019, Advances in Intelligent Systems and Computing.

[9]  Yannis Manolopoulos,et al.  Music search engines: Specifications and challenges , 2009, Inf. Process. Manag..

[10]  R. Patterson,et al.  Complex Sounds and Auditory Images , 1992 .

[11]  Ye Wang,et al.  Enhancing Collaborative Filtering Music Recommendation by Balancing Exploration and Exploitation , 2014, ISMIR.

[12]  Emilia Gómez,et al.  Semantic audio content-based music recommendation and visualization based on user preference examples , 2013, Inf. Process. Manag..

[13]  P. Hills,et al.  Emotion recognition in children with profound and severe deafness: Do they have a deficit in perceptual processing? , 2010, Journal of clinical and experimental neuropsychology.

[14]  Mert Bay,et al.  The 2007 MIREX Audio Mood Classification Task: Lessons Learned , 2008, ISMIR.

[15]  Pamela A. Wood,et al.  An algorithmic approach to music retrieval by emotion based on feature data , 2016, 2016 Future Technologies Conference (FTC).

[16]  Renato Nobili,et al.  Otoacoustic Emissions from Residual Oscillations of the Cochlear Basilar Membrane in a Human Ear Model , 2003, Journal of the Association for Research in Otolaryngology.

[17]  Yi-Hsuan Yang,et al.  Music Emotion Classification: A Regression Approach , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[18]  Siti Mariyam Shamsuddin,et al.  Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion , 2019, Inf. Process. Manag..

[19]  Emmanuel Vincent,et al.  The 2005 Music Information retrieval Evaluation Exchange (MIREX 2005): Preliminary Overview , 2005, ISMIR.

[20]  Mark Sandler,et al.  Convolutional recurrent neural networks for music classification , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  Yueting Zhuang,et al.  Popular music retrieval by detecting mood , 2003, SIGIR.

[22]  Naresh N. Vempala,et al.  Modeling Music Emotion Judgments Using Machine Learning Methods , 2018, Front. Psychol..

[23]  Xavier Serra,et al.  Experimenting with musically motivated convolutional neural networks , 2016, 2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI).

[24]  Daniel Müllensiefen,et al.  Decoding emotions in expressive music performances: A multi-lab replication and extension study , 2018, Cognition & emotion.

[25]  Jie Zhou,et al.  Discovering Attractive Segments in the User Generated Video Streams , 2019, APWeb/WAIM.

[26]  Densil Cabrera,et al.  ' PSYSOUND' : A COMPUTER PROGRAM FOR PSYCHOACOUSTICAL ANALYSIS , 1999 .

[27]  I. Peretz,et al.  Musical and vocal emotion perception for cochlear implants users , 2018, Hearing Research.

[28]  Nanning Zheng,et al.  Brain-Inspired Cognitive Model With Attention for Self-Driving Cars , 2017, IEEE Transactions on Cognitive and Developmental Systems.

[29]  Vandana M. Ladwani,et al.  Classification of music into moods using musical features , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[30]  Yi-Hsuan Yang,et al.  Music emotion classification: a fuzzy approach , 2006, MM '06.

[31]  F Mammano,et al.  Biophysics of the cochlea. II: Stationary nonlinear phenomenology. , 1996, The Journal of the Acoustical Society of America.

[32]  Sarik Ghazarian,et al.  Enhancing memory-based collaborative filtering for group recommender systems , 2015, Expert Syst. Appl..

[33]  George Tzanetakis,et al.  MARSYAS SUBMISSIONS TO MIREX 2007 , 2007 .

[34]  Ruslan Salakhutdinov,et al.  Learning Cognitive Models Using Neural Networks , 2018, AIED.

[35]  Yi-Hsuan Yang,et al.  A Regression Approach to Music Emotion Recognition , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[36]  Lie Lu,et al.  Automatic mood detection and tracking of music audio signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[37]  Carlos Montemayor,et al.  Perception and Cognition Are Largely Independent, but Still Affect Each Other in Systematic Ways: Arguments from Evolution and the Consciousness-Attention Dissociation , 2017, Front. Psychol..

[38]  Gordon Cheng,et al.  A closed-loop, music-based brain-computer interface for emotion mediation , 2019, PloS one.

[39]  Yi-Hsuan Yang,et al.  1000 songs for emotional analysis of music , 2013, CrowdMM '13.

[40]  F. Mammano,et al.  Biophysics of the cochlea: linear approximation. , 1993, The Journal of the Acoustical Society of America.

[41]  Paul Lamere,et al.  Social Tagging and Music Information Retrieval , 2008 .

[42]  Mladen Russo,et al.  Cochlea-based Features for Music Emotion Classification , 2017, SIGMAP.

[43]  Tanaya Guha,et al.  Novel affective features for multiscale prediction of emotion in music , 2016, 2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP).

[44]  Daniel Müllensiefen,et al.  The Musical Emotion Discrimination Task: A New Measure for Assessing the Ability to Discriminate Emotions in Music , 2019, Front. Psychol..

[45]  Remco C. Veltkamp,et al.  Studying emotion induced by music through a crowdsourcing game , 2016, Inf. Process. Manag..

[46]  G. Peeters,et al.  A Generic Training and Classification System for MIREX08 Classification Tasks: Audio Music Mood, Audio Genre, Audio Artist and Audio Tag , 2008 .

[47]  Shashidhar G. Koolagudi,et al.  Content-Based Music Information Retrieval (CB-MIR) and Its Applications toward the Music Industry , 2018, ACM Comput. Surv..

[48]  M.D. Korhonen,et al.  Modeling emotional content of music using system identification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[49]  Yulan He,et al.  TDAM: a Topic-Dependent Attention Model for Sentiment Analysis , 2019, Inf. Process. Manag..

[50]  Rui Pedro Paiva,et al.  Novel Audio Features for Music Emotion Recognition , 2020, IEEE Transactions on Affective Computing.

[51]  Yan Liu,et al.  CNN based music emotion classification , 2017, ArXiv.

[52]  Janto Skowronek,et al.  A Demonstrator for Automatic Music Mood Estimation , 2007, ISMIR.

[53]  Charles Farrugia,et al.  Emotion recognition/understanding ability in hearing or vision-impaired children: do sounds, sights, or words make the difference? , 2004, Journal of child psychology and psychiatry, and allied disciplines.

[54]  Joon-Sang Park,et al.  Utilizing context-relevant keywords extracted from a large collection of user-generated documents for music discovery , 2017, Inf. Process. Manag..

[55]  Yi-Hsuan Yang,et al.  Developing a benchmark for emotional analysis of music , 2017, PloS one.

[56]  J. Stephen Downie,et al.  The music information retrieval evaluation exchange (2005-2007): A window into music information retrieval research , 2008 .

[57]  J. Sloboda,et al.  Handbook of Music and Emotion: Theory, Research, Applications , 2011 .

[58]  Michela C. Tacca Commonalities between Perception and Cognition , 2011, Front. Psychology.