Recognizing Musical Entities in User-generated Content

Recognizing Musical Entities is important for Music Information Retrieval (MIR) since it canimprove the performance of several tasks such as music recommendation, genre classification or artist similarity. However, most entity recognition systems in the music domain have concentrated on formal texts (e.g. artists’ biographies, encyclopedic articles, etc.), ignoring rich and noisy user-generated content. In this work, we present a novel method to recognize musicalentities in Twitter content generated by users following a classical music radio channel. Our approach takes advantage of both formal radio schedule and users’ tweets to improve entity recognition. We instantiate several machine learning algorithms to perform entity recognition combining task-specific and corpus-based features. We also show how to improve recognition results by jointly considering formal and user-generated content.

[1]  Maurice van Keulen,et al.  Information Extraction for Social Media , 2014, SWAIE@COLING.

[2]  Barbara Di Eugenio,et al.  Generating Fine-Grained Reviews of Songs from Album Reviews , 2010, ACL.

[3]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[4]  Xavier Serra,et al.  Sound and Music Recommendation with Knowledge Graphs , 2016, ACM Trans. Intell. Syst. Technol..

[5]  Meinard Müller,et al.  Fundamentals of Music Processing , 2015, Springer International Publishing.

[6]  Eva Zangerle,et al.  Exploiting Twitter's Collective Knowledge for Music Recommendations , 2012, #MSM.

[7]  Markus Schedl,et al.  Mining microblogs to infer music artist similarity and cultural listening patterns , 2012, WWW.

[8]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[9]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[10]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[11]  Xavier Serra,et al.  MEL: a music entity linking system , 2017 .

[12]  Zhen Liu,et al.  A Hybrid Approach for Chinese Named Entity Recognition in Music Domain , 2009, 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing.

[13]  Xavier Serra,et al.  ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain , 2016, LREC.

[14]  Meinard Mller,et al.  Fundamentals of Music Processing: Audio, Analysis, Algorithms, Applications , 2015 .

[15]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[16]  Eva Zangerle,et al.  #nowplaying Music Dataset: Extracting Listening Behavior from Twitter , 2014, WISMM '14.

[17]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[18]  Iryna Gurevych,et al.  Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging , 2017, EMNLP.

[19]  Markus Schedl,et al.  The Million Musical Tweet Dataset - What We Can Learn From Microblogs , 2013, ISMIR.

[20]  Paolo Ferragina,et al.  Fast and Accurate Annotation of Short Texts with Wikipedia Pages , 2010, IEEE Software.

[21]  Raphaël Troncy,et al.  Analysis of named entity recognition and linking for tweets , 2014, Inf. Process. Manag..

[22]  Lorenzo Porcaro Information extraction from user-generated content in the classical music domain , 2018 .

[23]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[24]  Kenny Q. Zhu,et al.  Multi-channel BiLSTM-CRF Model for Emerging Named Entity Recognition in Social Media , 2017, NUT@EMNLP.

[25]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[26]  Xavier Serra,et al.  Exploring Customer Reviews for Music Genre Classification and Evolutionary Studies , 2016, ISMIR.