Music Mood Classification via Deep Belief Network

In this paper we present a study on music mood classification by using only lyrics information. Specially considering the Chinese songs, the Chinese word-segmentation has caused intolerable errors and inadequate use of lyrics information. Our work proposes to use bag-of-character features instead of bag-of-word features to avoid the word segmentation error, which makes the classification more inaccurate. Further more, the use of DBN which trains useful features automatically can make more use of lyrics information. With the experiments on 1200 songs collected, we demonstrate that our high level features trained by DBN and joint bag-of-character perform much better than the traditional features in music mood classification.

[1]  Yi-Hsuan Yang,et al.  Toward Multi-modal Music Emotion Classification , 2008, PCM.

[2]  J. Stephen Downie,et al.  Improving mood classification in music digital libraries by combining lyrics and audio , 2010, JCDL '10.

[3]  Hwee Tou Ng,et al.  A Maximum Entropy Approach to Chinese Word Segmentation , 2005, SIGHAN@IJCNLP 2005.

[4]  Grigorios Tsoumakas,et al.  Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[5]  Hui He,et al.  Language Feature Mining for Music Emotion Classification via Supervised Learning from Lyrics , 2008, ISICA.

[6]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[7]  Mert Bay,et al.  The 2007 MIREX Audio Mood Classification Task: Lessons Learned , 2008, ISMIR.

[8]  Hai Zhao,et al.  An Improved Chinese Word Segmentation System with Conditional Random Field , 2006, SIGHAN@COLING/ACL.

[9]  Andreas F. Ehmann,et al.  Lyric Text Mining in Music Mood Classification , 2009, ISMIR.

[10]  Yajie Hu,et al.  Lyric-based Song Emotion Detection with Affective Lexicon and Fuzzy Clustering Method , 2009, ISMIR.

[11]  Haizhou Li,et al.  Chinese Word Segmentation , 1998, PACLIC.

[12]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[13]  Yi-Hsuan Yang,et al.  Cross-cultural Music Mood Classification: A Comparison on English and Chinese Songs , 2012, ISMIR.

[14]  Jens Grivolla,et al.  Multimodal Music Mood Classification Using Audio and Lyrics , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[15]  Lie Lu,et al.  Automatic mood detection and tracking of music audio signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  G. Widmer,et al.  EVALUATION OF FREQUENTLY USED AUDIO FEATURES FOR CLASSIFICATION OF MUSIC INTO PERCEPTUAL CATEGORIES , 2005 .

[17]  Xing Wang,et al.  Music Emotion Classification of Chinese Songs based on Lyrics Using TF*IDF and Rhyme , 2011, ISMIR.