New Developments in Music Information Retrieval

The digital revolution has brought about a massive increase in the availability and distribution of musicrelated documents of various modalities comprising textual, audio, as well as visual material. Therefore, the development of techniques and tools for organizing, structuring, retrieving, navigating, and presenting music-related data has become a major strand of research—the field is often referred to as Music Information Retrieval (MIR). Major challenges arise because of the richness and diversity of music in form and content leading to novel and exciting research problems. In this article, we give an overview of new developments in the MIR field with a focus on content-based music analysis tasks including audio retrieval, music synchronization, structure analysis, and performance analysis.

[1]  Matthew Cooper,et al.  Summarizing popular music via structural similarity analysis , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[2]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[3]  Masataka Goto,et al.  Music Structure Analysis from Acoustic Signals , 2008 .

[4]  Nicola Orio,et al.  Robust Polyphonic Midi Score Following with Hidden Markov Models , 2004, ICMC.

[5]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[6]  Daniel P. W. Ellis,et al.  Identifying `Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  Meinard Müller,et al.  Audio Matching via Chroma-Based Statistical Features , 2005, ISMIR.

[8]  Jürgen Herre,et al.  AudioID: Towards Content-Based Identification of Audio Material , 2001 .

[9]  Meinard Müller,et al.  Towards Automated Extraction of Tempo Parameters from Expressive Music Recordings , 2009, ISMIR.

[10]  Meinard Müller,et al.  Towards Structural Analysis of Audio Recordings in the Presence of Musical Variations , 2007, EURASIP J. Adv. Signal Process..

[11]  Simon Dixon,et al.  Evaluation of the Audio Beat Tracking System BeatRoot , 2007 .

[12]  Pedro Cano,et al.  A review of algorithms for audio fingerprinting , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[13]  Meinard Müller,et al.  Audio-based Music Structure Analysis , 2010 .

[14]  Xavier Rodet,et al.  Improving polyphonic and poly-instrumental music to score alignment , 2003, ISMIR.

[15]  Gerhard Widmer,et al.  In Search of the Horowitz Factor , 2003, AI Mag..

[16]  Meinard Müller,et al.  A Framework for Managing Multimodal Digitized Music Collections , 2008, ECDL.

[17]  Meinard Müller,et al.  An Efficient Multiscale Approach to Audio Synchronization , 2006, ISMIR.

[18]  Verena Konz,et al.  A Multimodal Way of Experiencing and Exploring Music , 2010 .

[19]  Chin-Hui Lee,et al.  A hidden Markov model based approach to music segmentation and identification , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[20]  Emilia Gómez Gutiérrez,et al.  Tonal description of music audio signals , 2006 .

[21]  Michael A. Casey,et al.  The Importance of Sequences in Musical Similarity , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[22]  Meinard Müller,et al.  Towards an Efficient Algorithm for Automatic Score-to-Audio Synchronization , 2004, ISMIR.

[23]  Roger B. Dannenberg,et al.  An On-Line Algorithm for Real-Time Accompaniment , 1984, ICMC.

[24]  Meinard Müller,et al.  Efficient Index-Based Audio Matching , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[26]  Michael Clausen,et al.  Slave: A Score-Lyrics-Audio-Video-Explorer , 2009, ISMIR.

[27]  Malcolm Slaney,et al.  Analysis of Minimum Distances in High-Dimensional Musical Spaces , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  Daniel P. W. Ellis,et al.  Ground-truth transcriptions of real music from force-aligned MIDI syntheses , 2003, ISMIR.

[29]  Gerhard Widmer,et al.  A Multi-pass Algorithm for Accurate Audio-to-Score Alignment , 2010, ISMIR.

[30]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[31]  Craig Stuart Sapp Comparative Analysis of Multiple Musical Performances , 2007, ISMIR.

[32]  Meinard Müller,et al.  Automated Synchronization of Scanned Sheet Music with Audio Recordings , 2007, ISMIR.

[33]  Christopher Raphael,et al.  A Probabilistic Expert System for Automatic Musical Accompaniment , 2001 .

[34]  Mark Sandler,et al.  Segmentation of Musical Signals Using Hidden Markov Models. , 2001 .

[35]  Anssi Klapuri,et al.  Music Structure Analysis Using a Probabilistic Fitness Measure and a Greedy Search Algorithm , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[36]  Xavier Serra,et al.  Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[37]  Jonathan Foote,et al.  Automatic audio segmentation using a measure of audio novelty , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[38]  Frank Kurth,et al.  A Concept for Using Combined Multimodal Queries in Digital Music Libraries , 2009, ECDL.

[39]  Geoffroy Peeters Sequence Representation of Music Structure Using Higher-Order Similarity Matrix and Maximum-Likelihood Approach , 2007, ISMIR.

[40]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[41]  Werner Goebl,et al.  Visualizing Expressive Performance in Tempo—Loudness Space , 2003, Computer Music Journal.

[42]  Christopher Raphael,et al.  A Hybrid Graphical Model for Aligning Polyphonic Audio with Musical Scores , 2004, ISMIR.

[43]  Gregory H. Wakefield,et al.  Audio thumbnailing of popular music using chroma-based representations , 2005, IEEE Transactions on Multimedia.

[44]  Gerhard Widmer,et al.  Machine Discoveries: A Few Simple, Robust Local Expression Principles , 2002 .

[45]  Mark B. Sandler,et al.  Structural Segmentation of Musical Audio by Constrained Clustering , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[46]  Masataka Goto,et al.  A chorus section detection method for musical audio signals and its application to a music listening station , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[47]  Ye Wang,et al.  LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[48]  B. Ong Structural analysis and segmentation of music signals , 2007 .

[49]  Christopher Raphael,et al.  Music score alignment and computer accompaniment , 2006, CACM.

[50]  Meinard Müller,et al.  Lyrics-Based Audio Retrieval and Multimodal Navigation in Music Collections , 2007, ECDL.

[51]  Gerhard Widmer,et al.  MATCH: A Music Alignment Tool Chest , 2005, ISMIR.

[52]  Arshia Cont,et al.  A Coupled Duration-Focused Architecture for Real-Time Music-to-Score Alignment , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Hiromasa Fujihara,et al.  Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[54]  Changsheng Xu,et al.  Automatic music classification and summarization , 2005, IEEE Transactions on Speech and Audio Processing.

[55]  Peter Grosche,et al.  High resolution audio synchronization using chroma onset features , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[56]  Frank Kurth,et al.  Identification of Highly Distorted Audio Material for Querying Large Scale Data Bases , 2002 .

[57]  Ning Hu,et al.  Polyphonic Audio Matching for Score Following and Intelligent Audio Editors , 2003, ICMC.

[58]  Meinard Müller,et al.  A Demonstration of the SyncPlayer System , 2007, ISMIR.

[59]  George Tzanetakis,et al.  Polyphonic audio matching and alignment for music retrieval , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).