Music Information Retrieval: Recent Developments and Applications

We provide a survey of the field of Music Information Retrieval (MIR), in particular paying attention to latest developments, such as semantic auto-tagging and user-centric retrieval and recommendation approaches. We first elaborate on well-established and proven methods for feature extraction and music indexing, from both the audio signal and contextual data sources about music items, such as web pages or collaborative tags. These in turn enable a wide variety of music retrieval tasks, such as semantic music search or music identification ("query by example"). Subsequently, we review current work on user analysis and modeling in the context of music recommendation and retrieval, addressing the recent trend towards user-centric and adaptive approaches and systems. A discussion follows about the important aspect of how various MIR approaches to different problems are evaluated and compared. Eventually, a discussion about the major open challenges concludes the survey.

[1]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[2]  David Wessel,et al.  Timbre Space as a Musical Control Structure , 1979 .

[3]  Anssi Klapuri,et al.  Automatic Classification of Pitched Musical Instrument Sounds , 2006 .

[4]  Òscar Celma,et al.  QueryBag: Using Different Sources For Querying Large Music Collections , 2009 .

[5]  Peter Knees,et al.  Towards Semantic Music Information Extraction from the Web Using Rule Patterns and Supervised Learning , 2011 .

[6]  Peter Knees,et al.  DRAFT : A REFINED BLOCK-LEVEL FEATURE SET FOR CLASSIFICATION , SIMILARITY AND TAG PREDICTION , 2011 .

[7]  Mark Sanderson,et al.  Test Collection Based Evaluation of Information Retrieval Systems , 2010, Found. Trends Inf. Retr..

[8]  Matthias Jarke,et al.  Adaptive Multimodal Exploration of Music Collections , 2009, ISMIR.

[9]  Peter Knees,et al.  “Reinventing the Wheel”: A Novel Approach to Music Player Interfaces , 2007, IEEE Transactions on Multimedia.

[10]  Gert R. G. Lanckriet,et al.  A Game-Based Approach for Collecting Semantic Annotations of Music , 2007, ISMIR.

[11]  Òscar Celma,et al.  Music Recommendation and Discovery - The Long Tail, Long Fail, and Long Play in the Digital Music Space , 2010 .

[12]  Gerhard Widmer,et al.  On the Importance of "Real" Audio Data for MIR Algorithm Evaluation at the Note-Level - A Comparative Study , 2011, ISMIR.

[13]  L. Auger The Journal of the Acoustical Society of America , 1949 .

[14]  Yuval Shavitt,et al.  Song Ranking based on Piracy in Peer-to-Peer Networks , 2009, ISMIR.

[15]  Òscar Celma,et al.  Search Sounds: An audio crawler focused on weblogs , 2006, ISMIR.

[16]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[17]  Jin Ha Lee,et al.  Crowdsourcing Music Similarity Judgments using Mechanical Turk , 2010, ISMIR.

[18]  Anssi Klapuri,et al.  Auditory-Model Based Methods for Multiple Fundamental Frequency Estimation , 2006 .

[19]  Xavier Rodet,et al.  Toward Automatic Music Audio Summary Generation from Signal Analysis , 2002, ISMIR.

[20]  Matthew E. P. Davies,et al.  Selective Sampling for Beat Tracking Evaluation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Peretz Shoval,et al.  Information Filtering: Overview of Issues, Research and Systems , 2001, User Modeling and User-Adapted Interaction.

[22]  Oliver Hummel,et al.  Using cultural metadata for artist recommendations , 2003, Proceedings Third International Conference on WEB Delivering of Music.

[23]  Thierry Bertin-Mahieux,et al.  The million song dataset challenge , 2012, WWW.

[24]  Peter Knees,et al.  Artist Classification with Web-Based Data , 2004, ISMIR.

[25]  Daniel Wolff,et al.  Adapting Metrics for Music Similarity Using Comparative Ratings , 2011, ISMIR.

[26]  Masataka Goto,et al.  MusicRainbow: A New User Interface to Discover Artists Using Audio-based Similarity and Web-based Labeling , 2006, ISMIR.

[27]  Nicola Orio,et al.  A professionally annotated and enriched multimodal data set on popular music , 2013, MMSys.

[28]  B. Ong Structural analysis and segmentation of music signals , 2007 .

[29]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[30]  J. Stephen Downie,et al.  Capturing the workflows of music information retrieval for repeatability and reuse , 2013, Journal of Intelligent Information Systems.

[31]  Daniele Quercia,et al.  Auralist: introducing serendipity into music recommendation , 2012, WSDM '12.

[32]  Peter Knees,et al.  One-touch access to music on mobile devices , 2007, MUM.

[33]  Gregory H. Wakefield,et al.  Mathematical representation of joint time-chroma distributions , 1999, Optics & Photonics.

[34]  Markus Koppenberger,et al.  MTG-DB: a repository for music audio processing , 2004, Proceedings of the Fourth International Conference onWeb Delivering of Music, 2004. EDELMUSIC 2004..

[35]  Edith Law,et al.  Input-agreement: a new mechanism for collecting data using human computation games , 2009, CHI.

[36]  Geoffroy Peeters,et al.  Large-Scale Study of Chord Estimation Algorithms Based on Chroma Representation and HMM , 2007, 2007 International Workshop on Content-Based Multimedia Indexing.

[37]  George Tzanetakis,et al.  Visualization in Audio-Based Music Information Retrieval , 2006, Computer Music Journal.

[38]  François Pachet,et al.  Hit Song Science Is Not Yet a Science , 2008, ISMIR.

[39]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[40]  Jonathan Foote,et al.  Audio Retrieval by Rhythmic Similarity , 2002, ISMIR.

[41]  E. Rasmussen Evaluation in Information Retrieval , 2002 .

[42]  Jonathan Foote,et al.  Automatic Music Summarization via Similarity Analysis , 2002, ISMIR.

[43]  Yves Grenier,et al.  Template-based Chord Recognition : Influence of the Chord Types , 2009, ISMIR.

[44]  Ronald W. Schafer,et al.  Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[45]  Christian Wartena Comparing segmentation strategies for efficient video passage retrieval , 2012, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI).

[46]  Daniel P. W. Ellis,et al.  Toward Evaluation Techniques for Music Similarity , 2003, SIGIR 2003.

[47]  Peter Knees,et al.  Building an Interactive Next-Generation Artist Recommender Based on Automatically Derived High-Level Concepts , 2007, 2007 International Workshop on Content-Based Multimedia Indexing.

[48]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[49]  Markus Schedl,et al.  Local and global scaling reduce hubs in space , 2012, J. Mach. Learn. Res..

[50]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[51]  Tim Pohle,et al.  The ISMIR Cloud: A Decade of ISMIR Conferences at Your Fingertips , 2009, ISMIR.

[52]  Elias Pampalk,et al.  Audio-Based Music Similarity and Retrieval : Combining a Spectral Similarity Model with Information Extracted from Fluctuation Patterns , 2006 .

[53]  Fabio Vignoli,et al.  A Music Retrieval System Based on User Driven Similarity and Its Evaluation , 2005, ISMIR.

[54]  Tague-SutcliffeJean The pragmatics of information retrieval experimentation, revisited , 1992 .

[55]  Erik Duval,et al.  A Web-based Approach to Determine the Origin of an Artist , 2009, ISMIR.

[56]  John Ashley Burgoyne,et al.  On Comparative Statistics for Labelling Tasks: What can We Learn from MIREX ACE 2013? , 2014, ISMIR.

[57]  Steve Lawrence,et al.  Inferring Descriptions and Similarity for Music from Community Metadata , 2002, ICMC.

[58]  Alessandro L. Koerich,et al.  The Latin Music Database , 2008, ISMIR.

[59]  Mónica Marrero,et al.  Crowdsourcing Preference Judgments for Evaluation of Music Similarity Tasks , 2010 .

[60]  Eva Zangerle,et al.  Exploiting Twitter's Collective Knowledge for Music Recommendations , 2012, #MSM.

[61]  James Bennett,et al.  The Netflix Prize , 2007 .

[62]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[63]  Nuno Vasconcelos,et al.  Image indexing with mixture hierarchies , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[64]  Meinard Müller,et al.  A Scape Plot Representation for Visualizing Repetitive Structures of Music Recordings , 2012, ISMIR.

[65]  Markus Koppenberger,et al.  The emergence of complex network patterns in music networks , 2004, ISMIR.

[66]  Alain de Cheveigné,et al.  Pitch perception models , 2005 .

[67]  Masataka Goto,et al.  RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[68]  Keiichiro Hoashi,et al.  Feature Analysis and Normalization Approach for Robust Content-Based Music Retrieval to Encoded Audio with Different Bit Rates , 2009, MMM.

[69]  W. Sethares Local consonance and the relationship between timbre and scale , 1993 .

[70]  Peter Knees,et al.  The Quest for Ground Truth in Musical Artist Tagging in the Social Web Era , 2007, ISMIR.

[71]  Markus Schedl,et al.  The Million Musical Tweet Dataset - What We Can Learn From Microblogs , 2013, ISMIR.

[72]  Peter Knees,et al.  What's Hot? Estimating Country-specific Artist Popularity , 2010, ISMIR.

[73]  J. Stephen Downie,et al.  How Significant is Statistically Significant? The case of Audio Music Similarity and Retrieval , 2012, ISMIR.

[74]  Yehuda Koren,et al.  The Yahoo! Music Dataset and KDD-Cup '11 , 2012, KDD Cup.

[75]  Òscar Celma,et al.  The Quest for Musical Genres: Do the Experts and the Wisdom of Crowds Agree? , 2008, ISMIR.

[76]  C. Krumhansl Cognitive Foundations of Musical Pitch , 1990 .

[77]  Bernd Ludwig,et al.  InCarMusic: Context-Aware Music Recommendations in a Car , 2011, EC-Web.

[78]  Anssi Klapuri,et al.  Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes , 2006, ISMIR.

[79]  Masataka Goto,et al.  Grand Challenges in Music Information Research , 2012, Multimodal Music Processing.

[80]  Marc Leman Schema-based tone center recognition of musical signals , 1994 .

[81]  Julián Urbano,et al.  Current Challenges in the Evaluation of Predominant Melody Extraction Algorithms , 2012, ISMIR.

[82]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[83]  David Temperley,et al.  What's Key for Key? The Krumhansl-Schmuckler Key-Finding Algorithm Reconsidered , 1999 .

[84]  Stefan Leitich,et al.  Globe of Music - Music Library Visualization Using Geosom , 2007, ISMIR.

[85]  Daniel P. W. Ellis,et al.  A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures , 2004, Computer Music Journal.

[86]  Simon Dixon,et al.  A Review of Automatic Rhythm Description Systems , 2005, Computer Music Journal.

[87]  Marc Leman,et al.  D-Jogger: Syncing Music with Walking , 2010 .

[88]  Carol Peters,et al.  Cross-Language Evaluation Forum: Objectives, Results, Achievements , 2004, Information Retrieval.

[89]  Riccardo Miotto,et al.  MusiCLEF: a Benchmark Activity in Multimodal Music Information Retrieval , 2011, ISMIR.

[90]  Gerhard Widmer,et al.  MATCH: A Music Alignment Tool Chest , 2005, ISMIR.

[91]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[92]  Juan Llorens Morillo,et al.  Improving the Generation of Ground Truths Based on Partially Ordered Lists , 2010, ISMIR.

[93]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[94]  Anssi Klapuri,et al.  Melody Description and Extraction in the Context of Music Content Processing , 2003 .

[95]  Peter Grosche,et al.  Audio Content-Based Music Retrieval , 2012, Multimodal Music Processing.

[96]  Anssi Klapuri,et al.  Signal Processing Methods for Music Transcription , 2006 .

[97]  Jean Tague-Sutcliffe,et al.  The Pragmatics of Information Retrieval Experimentation Revisited , 1997, Inf. Process. Manag..

[98]  Nicola Orio,et al.  MUSIC RETRIEVAL (Foundations and Trends(R) in Information Retrieval) , 2006 .

[99]  Emilia Gómez,et al.  Towards Computer-Assisted Flamenco Transcription: An Experimental Comparison of Automatic Transcription Algorithms as Applied to A Cappella Singing , 2013, Computer Music Journal.

[100]  Marc Leman,et al.  Content-Based Music Information Retrieval: Current Directions and Future Challenges , 2008, Proceedings of the IEEE.

[101]  Roger B. Dannenberg,et al.  TagATune: A Game for Music and Sound Annotation , 2007, ISMIR.

[102]  Markus Schedl,et al.  Three web-based heuristics to determine a person's or institution's country of origin , 2010, SIGIR '10.

[103]  Arthur Flexer,et al.  Statistical evaluation of music information retrieval experiments , 2006 .

[104]  W M Hartmann,et al.  Pitch, periodicity, and auditory organization. , 1996, The Journal of the Acoustical Society of America.

[105]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[106]  Gabriella Kazai INitiative for the Evaluation of XML Retrieval , 2009, Encyclopedia of Database Systems.

[107]  J. Stephen Downie,et al.  How People Describe Their Music Information Needs: A Grounded Theory Analysis Of Music Queries , 2003 .

[108]  Masataka Goto,et al.  Songrium: a music browsing assistance service based on visualization of massive open collaboration within music content creation community , 2013, OpenSym.

[109]  Ellen M. Voorhees,et al.  TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing) , 2005 .

[110]  Riccardo Miotto,et al.  Improving Auto-tagging by Modeling Semantic Co-occurrences , 2010, ISMIR.

[111]  J. Stephen Downie,et al.  The International Music Information Retrieval Systems Evaluation Laboratory: Governance, Access and Security , 2004, ISMIR.

[112]  Xavier Serra,et al.  Unifying Low-Level and High-Level Music Similarity Measures , 2011, IEEE Transactions on Multimedia.

[113]  David S. Rosenblum,et al.  Context-aware mobile music recommendation for daily activities , 2012, ACM Multimedia.

[114]  Mark B. Sandler,et al.  A Semantic Space for Music Derived from Social Tags , 2007, ISMIR.

[115]  Kaare Brandt Petersen,et al.  Mel Frequency Cepstral Coefficients: An Evaluation of Robustness of MP3 Encoded Music , 2006, ISMIR.

[116]  Sebastian Stober Adaptive methods for user-centered organization of music collections , 2011 .

[117]  Gabriella Kazai,et al.  Overview of the Initiative for the Evaluation of XML retrieval (INEX) 2002 , 2002, INEX Workshop.

[118]  Eleanor Selfridge-Field,et al.  Conceptual and representational issues in melodic comparison , 1998 .

[119]  Julián Urbano Merino,et al.  Evaluation in audio music similarity , 2013 .

[120]  Peter Knees,et al.  A music information system automatically generated via Web content mining techniques , 2011, Inf. Process. Manag..

[121]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[122]  Alain Bonardi IR for Contemporary Music: What the Musicologist Needs , 2000, ISMIR.

[123]  Elaine Chew,et al.  Visualizing Music: Tonal Progressions and Distributions , 2007, ISMIR.

[124]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[125]  Youngmoo E. Kim,et al.  MoodSwings: A Collaborative Game for Music Mood Label Collection , 2008, ISMIR.

[126]  Julián Urbano,et al.  Notes from the ISMIR 2012 late-breaking session on evaluation in music information retrieval , 2012, ISMIR 2012.

[127]  Bob L. Sturm Two systems for automatic music genre recognition: what are they really recognizing? , 2012, MIRUM '12.

[128]  Markus Schedl,et al.  #nowplaying Madonna: a large-scale evaluation on estimating similarities between music artists and between movies from microblogs , 2012, Information Retrieval.

[129]  Peter Knees,et al.  Searching for Music Using Natural Language Queries and Relevance Feedback , 2007, Adaptive Multimedia Retrieval.

[130]  Arthur Flexer,et al.  On Inter-rater Agreement in Audio Music Similarity , 2014, ISMIR.

[131]  Riccardo Miotto,et al.  A Probabilistic Model to Combine Tags and Acoustic Similarity for Music Retrieval , 2012, TOIS.

[132]  Katharina Morik,et al.  A Benchmark Dataset for Audio Classification and Clustering , 2005, ISMIR.

[133]  Yi-Hsuan Yang,et al.  The MediaEval 2013 Brave New Task: Emotion in Music , 2013, MediaEval.

[134]  Matthias Mauch,et al.  Automatic chord transcription from audio using computational models of musical context , 2010 .

[135]  Andreas F. Ehmann,et al.  Mining Music Reviews: Promising Preliminary Results , 2005, ISMIR.

[136]  J. Stephen Downie,et al.  Music information retrieval , 2005, Annu. Rev. Inf. Sci. Technol..

[137]  Peter Knees,et al.  Exploring the music similarity space on the web , 2011, TOIS.

[138]  E. Pampalk Islands of Music Analysis, Organization, and Visualization of Music Archives , 2002 .

[139]  Axel Röbel,et al.  Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[140]  Karin Dressler,et al.  SINUSOIDAL EXTRACTION USING AN EFFICIENT IMPLEMENTATION OF A MULTI-RESOLUTION FFT , 2006 .

[141]  Daniel P. W. Ellis,et al.  Please Scroll down for Article Journal of New Music Research a Web-based Game for Collecting Music Metadata a Web-based Game for Collecting Music Metadata , 2022 .

[142]  Alistair Moffat,et al.  EvaluatIR: an online tool for evaluating and comparing IR systems , 2009, SIGIR.

[143]  Youngmoo E. Kim,et al.  Singer Identification in Popular Music Recordings Using Voice Coding Features , 2002 .

[144]  Eleanor Selfridge-Field,et al.  Melodic Similarity : concepts, procedures, and applications , 1998 .

[145]  Paul Lamere,et al.  Using 3D Visualizations to Explore and Discover Music , 2007, ISMIR.

[146]  David Temperley A BAYESIAN KEY-FINDING MODEL , 2005 .

[147]  G. Peeters,et al.  Local key estimation based on harmonic and metric structures , 2009 .

[148]  Mark B. Sandler,et al.  Symbolic Representation of Musical Chords: A Proposed Syntax for Text Annotations , 2005, ISMIR.

[149]  Ching-Hua Chuan,et al.  Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[150]  J. Beauchamp,et al.  Fundamental frequency estimation of musical signals using a two‐way mismatch procedure , 1994 .

[151]  Noriko Kando,et al.  Overview of IR tasks , 1999, NTCIR.

[152]  Dirk Moelants,et al.  Extracting the perceptual tempo from music , 2004, ISMIR.

[153]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[154]  Wei Chai,et al.  Semantic Segmentation and Summarization of Music , 2006 .

[155]  J. Stephen Downie,et al.  The Impact of MIREX on Scholarly Research (2005 - 2010) , 2012, ISMIR.

[156]  Markus Schedl Leveraging Microblogs for Spatiotemporal Music Information Retrieval , 2013, ECIR.

[157]  J. Stephen Downie,et al.  Survey Of Music Information Needs, Uses, And Seeking Behaviours: Preliminary Findings , 2004, ISMIR.

[158]  Peter Knees,et al.  A survey of music similarity and recommendation from music context data , 2013, ACM Trans. Multim. Comput. Commun. Appl..

[159]  Alistair Moffat,et al.  Principles for robust evaluation infrastructure , 2011, DESIRE '11.

[160]  Yi-Hsuan Yang,et al.  A large in-situ dataset for context-aware music recommendation on smartphones , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[161]  Mert Bay,et al.  The Music Information Retrieval Evaluation eXchange: Some Observations and Insights , 2010, Advances in Music Information Retrieval.

[162]  Elias Pampalk,et al.  An Implementation of a Simple Playlist Generator Based on Audio Similarity Measures and User Feedback , 2006, ISMIR.

[163]  Jae Sik Lee,et al.  Context Awareness by Case-Based Reasoning in a Music Recommendation System , 2007, UCS.

[164]  Barry Vercoe,et al.  Automated analysis of musical structure , 2005 .

[165]  Tim Oates,et al.  A Human Activity Aware Learning Mobile Music Player , 2007, AITamI@IJCAI.

[166]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[167]  Matei Ripeanu,et al.  Peer-to-peer architecture case study: Gnutella network , 2001, Proceedings First International Conference on Peer-to-Peer Computing.

[168]  Andreas F. Ehmann,et al.  The Music Information Retrieval Evaluation Exchange "Do-It-Yourself" Web Service , 2007, ISMIR.

[169]  Thierry Bertin-Mahieux,et al.  Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude , 2012, ISMIR.

[170]  Markus Schedl,et al.  Location-Aware Music Artist Recommendation , 2014, MMM.

[171]  Emilia Gómez,et al.  Two-Dimensional Visual Inspection of Pitch-Space, Many Time-Scales and Tonal Uncertainty over Time , 2011, MCM.

[172]  J. Stephen Downie,et al.  Ten Years of ISMIR: Reflections on Challenges and Opportunities , 2009, ISMIR.

[173]  Bingjun Zhang,et al.  CompositeMap: a novel framework for music similarity measure , 2009, SIGIR.

[174]  Mounia Lalmas,et al.  Overview of the INitiative for the evaluation of XML retrieval (INEX) 2003 , 2014 .

[175]  Enric Guaus,et al.  The Discipline formerly known as MIR , 2009 .

[176]  Paulo Villegas,et al.  Music recommendations with temporal context awareness , 2010, RecSys '10.

[177]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[178]  Beth Logan,et al.  Mel Frequency Cepstral Coefficients for Music Modeling , 2000, ISMIR.

[179]  Emilia Gómez Gutiérrez,et al.  Tonal description of music audio signals , 2006 .

[180]  Emilia Gómez,et al.  Music and Geography: Content Description of Musical Audio from Different Parts of the World , 2009, ISMIR.

[181]  Graham E. Poliner,et al.  Melody Transcription From Music Audio: Approaches and Evaluation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[182]  Johan Pauwels,et al.  Evaluating automatically estimated chord sequences , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[183]  J. S. Downie The MIR/MDL Evaluation Project White Paper Collection , 2002 .

[184]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[185]  Peter Knees,et al.  A WEB-BASED APPROACH TO ASSESSING ARTIST SIMILARITY USING CO-OCCURRENCES , 2005 .

[186]  Arthur Flexer,et al.  A MIREX Meta-analysis of Hubness in Audio Music Similarity , 2012, ISMIR.

[187]  Xiao Hu,et al.  Generating ground truth for music mood classification using mechanical turk , 2012, JCDL '12.

[188]  Masashi Yamamuro,et al.  A practical query-by-humming system for a large music database , 2000, ACM Multimedia.

[189]  Anssi Klapuri,et al.  Sound onset detection by applying psychoacoustic knowledge , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[190]  Catherine Guastavino,et al.  User studies in the Music Information Retrieval Literature , 2011, ISMIR.

[191]  Jonathan Foote,et al.  Visualizing music and audio using self-similarity , 1999, MULTIMEDIA '99.

[192]  Nicola Orio,et al.  Brave New Task: MusiClef Multimodal Music Tagging , 2012, MediaEval.

[193]  Xavier Serra,et al.  Indexing music by mood: design and integration of an automatic content-based annotator , 2010, Multimedia Tools and Applications.

[194]  Francesco Ricci,et al.  Location-aware music recommendation using auto-tagging and hybrid matching , 2013, RecSys.

[195]  Emilia Gómez,et al.  Tonal Description of Polyphonic Audio for Music Content Processing , 2006, INFORMS J. Comput..

[196]  Jean-Pierre Martens,et al.  A comparison of human and automatic musical genre classification , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[197]  Mark A. Schmuckler,et al.  Pitch and Pitch Structures , 2004 .

[198]  Markus Schedl Web-Based and Community-Based Music Information Extraction , 2011 .

[199]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[200]  J. Stephen Downie,et al.  The Scientific Evaluation of Music Information Retrieval Systems: Foundations and Future , 2004, Computer Music Journal.

[201]  Eric J. Isaacson Music IR for Music Theory , 2002 .

[202]  Markus Schedl,et al.  Minimal test collections for low-cost evaluation of Audio Music Similarity and Retrieval systems , 2012, International Journal of Multimedia Information Retrieval.

[203]  Noriko Kando,et al.  User-centered Measures vs. System Effectiveness in Finding Similar Songs , 2012, ISMIR.

[204]  Andreas Nürnberger,et al.  MusicGalaxy: A Multi-focus Zoomable Interface for Multi-facet Exploration of Music Collections , 2010, CMMR.

[205]  A.P. Klapuri,et al.  A perceptually motivated multiple-F0 estimation method , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[206]  Matthew E. P. Davies,et al.  On the automatic identification of difficult examples for beat tracking: Towards building new evaluation datasets , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[207]  Xavier Serra,et al.  Evaluation in Music Information Retrieval , 2013, Journal of Intelligent Information Systems.

[208]  Pedro Cano,et al.  Audio Fingerprinting: Concepts And Applications , 2005, Computational Intelligence for Modelling and Prediction.

[209]  E. Voorhees Whither Music IR Evaluation Infrastructure : Lessons to be Learned from TREC , 2002 .

[210]  Homer H. Chen,et al.  Music Emotion Recognition , 2011 .

[211]  Daniel P. W. Ellis,et al.  Melody Extraction from Polyphonic Music Signals: Approaches, applications, and challenges , 2014, IEEE Signal Processing Magazine.

[212]  Gary Marchionini,et al.  Synthesis Lectures on Information Concepts, Retrieval, and Services , 2009 .

[213]  Markus Schedl,et al.  Harvesting microblogs for contextual music similarity estimation: a co-occurrence-based framework , 2014, Multimedia Systems.

[214]  Daniel P. W. Ellis,et al.  Anchor space for classification and similarity measurement of music , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[215]  François Pachet,et al.  Improving Timbre Similarity : How high’s the sky ? , 2004 .

[216]  Yi-Hsuan Yang,et al.  Machine Recognition of Music Emotion: A Review , 2012, TIST.

[217]  Paul Clough,et al.  ImageCLEF: Experimental Evaluation in Visual Information Retrieval , 2010 .

[218]  Markus Schedl,et al.  The neglected user in music information retrieval research , 2013, Journal of Intelligent Information Systems.

[219]  Markus Schedl,et al.  Automatically Detecting Members and Instrumentation of Music Bands Via Web Content Mining , 2007, Adaptive Multimedia Retrieval.

[220]  Remco C. Veltkamp,et al.  A Ground Truth For Half A Million Musical Incipits , 2005, J. Digit. Inf. Manag..

[221]  Emilia Gómez,et al.  Tonal-based retrieval of Arabic and middle-east music by automatic makam description , 2011, 2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI).

[222]  Lior Rokach,et al.  Recommender Systems Handbook , 2010 .

[223]  William W. Cohen,et al.  Web-collaborative filtering: recommending music by crawling the Web , 2000, Comput. Networks.

[224]  Marc Leman,et al.  Tendencies, perspectives, and opportunities of musical audio-mining , 2002 .

[225]  Hinrich Schütze,et al.  Introduction to Information Retrieval: Evaluation in information retrieval , 2008 .

[226]  Klaus Seyerlehner FUSING BLOCK-LEVEL FEATURES FOR MUSIC SIMILARITY ESTIMATION , 2010 .

[227]  Sally Jo Cunningham,et al.  Toward an understanding of the history and impact of user studies in music information retrieval , 2013, Journal of Intelligent Information Systems.

[228]  Mohamed Sordo Semantic annotation of music collections: A computational approach , 2012 .

[229]  Emilia Gómez,et al.  Tonal representations for music retrieval: from version identification to query-by-humming , 2012, International Journal of Multimedia Information Retrieval.

[230]  Jordan B. L. Smith,et al.  A Meta-Analysis of the MIREX Structural Segmentation Task , 2013, ISMIR.

[231]  Andreas F. Ehmann,et al.  Human Similarity Judgments: Implications for the Design of Formal Evaluations , 2007, ISMIR.

[232]  Andreas Nürnberger,et al.  Weighted Self-Organizing Maps: Incorporating User Feedback , 2003, ICANN.

[233]  Craig Stuart Sapp Visual hierarchical key analysis , 2005, CIE.

[234]  Xavier Serra,et al.  Roadmap for Music Information ReSearch , 2013 .

[235]  Justin Donaldson,et al.  Uncovering Affinity of Artists to Multiple Genres from Social Behaviour Data , 2008, ISMIR.

[236]  Andreas Rauber,et al.  Facilitating Comprehensive Benchmarking Experiments on the Million Song Dataset , 2012, ISMIR.

[237]  Yuval Shavitt,et al.  Song Clustering Using Peer-to-Peer Co-occurrences , 2009, 2009 11th IEEE International Symposium on Multimedia.

[238]  Gaël Richard,et al.  ENST-Drums: an extensive audio-visual database for drum signals processing , 2006, ISMIR.

[239]  Christopher Harte,et al.  Towards automatic extraction of harmony information from music signals , 2010 .

[240]  Malcolm Slaney,et al.  Web-Scale Multimedia Analysis: Does Content Matter? , 2011, IEEE MultiMedia.

[241]  Daniel P. W. Ellis,et al.  The Quest for Ground Truth in Musical Artist Similarity , 2002, ISMIR.

[242]  O. Lartillot,et al.  A MATLAB TOOLBOX FOR MUSICAL FEATURE EXTRACTION FROM AUDIO , 2007 .

[243]  Petri Toiviainen,et al.  MIR in Matlab (II): A Toolbox for Musical Feature Extraction from Audio , 2007, ISMIR.

[244]  Francesco Ricci,et al.  Location-adapted music recommendation using tags , 2011, UMAP'11.

[245]  N. Scaringella,et al.  Automatic genre classification of music content: a survey , 2006, IEEE Signal Process. Mag..

[246]  Matthew E. P. Davies,et al.  Real-time beat-synchronous analysis of musical audio , 2009 .

[247]  E. Chew Towards a mathematical model of tonality , 2000 .

[248]  Xavier Serra,et al.  What is the Effect of Audio Quality on the Robustness of MFCCs and Chroma Features? , 2014, ISMIR.

[249]  Razvan Pascanu,et al.  Contextual tag inference , 2011, TOMCCAP.

[250]  Tim Pohle,et al.  Dynamic Playlist Generation Based on Skipping Behavior , 2005, ISMIR.

[251]  Emilia Gómez,et al.  Tonality Visualization of Polyphonic audio , 2005, ICMC.

[252]  José Luis Vicedo González,et al.  TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[253]  Karën Fort,et al.  Towards a (Better) Definition of the Description of Annotated MIR Corpora , 2012, ISMIR.

[254]  Patrick Susini,et al.  The Timbre Toolbox: extracting audio descriptors from musical signals. , 2011, The Journal of the Acoustical Society of America.

[255]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[256]  Mónica Marrero,et al.  Audio Music Similarity and Retrieval: Evaluation Power and Stability , 2011, ISMIR.

[257]  Xavier Serra,et al.  A Multipitch Approach to Tonic Identification in Indian Classical Music , 2012, ISMIR.

[258]  Daniel P. W. Ellis,et al.  Quantitative Analysis of a Common Audio Similarity Measure , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[259]  Joan Serrà,et al.  From Low-Level to High-Level: Comparative Study of Music Similarity Measures , 2009, 2009 11th IEEE International Symposium on Multimedia.

[260]  Emmanuel Vincent,et al.  The 2005 Music Information retrieval Evaluation Exchange (MIREX 2005): Preliminary Overview , 2005, ISMIR.

[261]  Nicola Orio,et al.  Music Retrieval: A Tutorial and Review , 2006, Found. Trends Inf. Retr..

[262]  Emilia Gómez,et al.  Audio Cover Song Identification and Similarity: Background, Approaches, Evaluation, and Beyond , 2010, Advances in Music Information Retrieval.

[263]  Karin Dressler MULTIPLE FUNDAMENTAL FREQUENCY EXTRACTION FOR MIREX 2012 , 2011 .

[264]  Bill Tomlinson,et al.  PersonalSoundtrack: context-aware playlists that adapt to user pace , 2006, CHI Extended Abstracts.

[265]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[266]  Markus Schedl,et al.  Exploring Geospatial Music Listening Patterns in Microblog Data , 2012, Adaptive Multimedia Retrieval.

[267]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[268]  Douglas Turnbull,et al.  Using Artist Similarity to Propagate Semantic Information , 2009, ISMIR.

[269]  Masataka Goto,et al.  Musicream: New Music Playback Interface for Streaming, Sticking, Sorting, and Recalling Musical Pieces , 2005, ISMIR.

[270]  Antoni B. Chan,et al.  Time Series Models for Semantic Music Annotation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[271]  François Pachet,et al.  Musical data mining for electronic music distribution , 2001, Proceedings First International Conference on WEB Delivering of Music. WEDELMUSIC 2001.

[272]  Dan Barry,et al.  Towards a Personal Automatic Music Playlist Generation Alogorithm: the need for Contextual Information , 2007 .

[273]  Remco C. Veltkamp,et al.  A Measure for Evaluating Retrieval Techniques based on Partially Ordered Ground Truth Lists , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[274]  Emilia Gómez,et al.  Semantic audio content-based music recommendation and visualization based on user preference examples , 2013, Inf. Process. Manag..

[275]  Brian P. Bailey,et al.  DJogger: a mobile dynamic music device , 2006, CHI Extended Abstracts.

[276]  Janto Skowronek,et al.  Ground truth for automatic music mood classification , 2006, ISMIR.

[277]  Peter Knees,et al.  An innovative three-dimensional user interface for exploring music collections enriched , 2006, MM '06.

[278]  Gert R. G. Lanckriet,et al.  Combining audio content and social context for semantic music discovery , 2009, SIGIR.

[279]  Donna Harman,et al.  Information Retrieval Evaluation , 2011, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[280]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[281]  Meinard Müller,et al.  An Efficient Multiscale Approach to Audio Synchronization , 2006, ISMIR.

[282]  Ichiro Fujinaga,et al.  Web Services for Music Information Retrieval , 2004, ISMIR.

[283]  Andreas Rauber,et al.  Towards Time-resilient MIR Processes , 2012, ISMIR.

[284]  Mark B. Sandler,et al.  The amblr: A mobile spatial audio music browser , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[285]  Masataka Goto,et al.  RWC Music Database: Popular, Classical and Jazz Music Databases , 2002, ISMIR.

[286]  Sally Jo Cunningham,et al.  Influences of ISMIR and MIREX Research on Technology Patents , 2013, ISMIR.

[287]  J. Stephen Downie,et al.  Challenges in Cross-Cultural/Multilingual Music Information Seeking , 2005, ISMIR.

[288]  Peter Knees,et al.  A music search engine built upon audio-based and web-based similarity measures , 2007, SIGIR.

[289]  François Pachet,et al.  Scaling up music playlist generation , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[290]  Emilia Gómez,et al.  Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[291]  Thierry Bertin-Mahieux,et al.  The Million Song Dataset , 2011, ISMIR.

[292]  Douglas Eck,et al.  Learning Tags that Vary Within a Song , 2010, ISMIR.

[293]  Ning Hu,et al.  A comparative evaluation of search techniques for query-by-humming using the MUSART testbed , 2007, J. Assoc. Inf. Sci. Technol..

[294]  Fabien Gouyon Computational Rhythm Description: A Review and Novel Approach , 2008 .

[295]  Paul Lamere,et al.  Social Tagging and Music Information Retrieval , 2008 .

[296]  Perfecto Herrera-Boyer,et al.  Automatic Classification of Musical Instrument Sounds , 2003 .

[297]  Nicola Orio,et al.  User-Aware Music Retrieval , 2012, Multimodal Music Processing.

[298]  Meinard Müller,et al.  Towards Timbre-Invariant Audio Features for Harmony-Based Music , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[299]  Markus Schedl,et al.  Hybrid retrieval approaches to geospatial music recommendation , 2013, SIGIR.

[300]  Peter Knees,et al.  Towards Automatic Retrieval of Album Covers , 2006, ECIR.

[301]  Perfecto Herrera,et al.  Rocking around the clock eight days a week: an exploration of temporal patterns of music listening , 2010, RecSys 2010.

[302]  Markus Schedl,et al.  A model for serendipitous music retrieval , 2012, CaRR '12.

[303]  Xavier Serra Data gathering for a culture specific approach in MIR , 2012, WWW.

[304]  Jin Ha Lee,et al.  Understanding User Requirements for Music Information Services , 2012, ISMIR.

[305]  Lassi A. Liikkanen,et al.  Out of the bubble: serendipitous even recommendations at an urban music festival , 2012, IUI '12.

[306]  J. Stephen Downie,et al.  Interim Report on Establishing MIR / MDL Evaluation Frameworks : Commentary on Consensus Building , 2002 .

[307]  Luke Windsor,et al.  Rhythm Perception and Production , 2000 .