A Regularity-Constrained Viterbi Algorithm and Its Application to The Structural Segmentation of Songs

This paper presents a general approach for the structural segmentation of songs. It is formalized as a cost optimization problem that combines properties of the musical content and prior regularity assumption on the segment length. A versatile implementation of this approach is proposed by means of a Viterbi algorithm, and the design of the costs are discussed. We then present two systems derived from this approach, based on acoustic and symbolic features respectively. The advantages of the regularity constraint are evaluated on a database of 100 popular songs by showing a significant improvement of the segmentation performance in terms of F-measure.

[1]  Emmanuel Vincent,et al.  A Roadmap Towards Versatile MIR , 2010, ISMIR.

[2]  Jonathan Foote,et al.  Media segmentation using self-similarity decomposition , 2003, IS&T/SPIE Electronic Imaging.

[3]  Masataka Goto,et al.  AIST Annotation for the RWC Music Database , 2006, ISMIR.

[4]  Masataka Goto,et al.  RWC Music Database: Popular, Classical and Jazz Music Databases , 2002, ISMIR.

[5]  Beth Logan,et al.  Music summarization using key phrases , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[6]  T. Jehan,et al.  Hierarchical multi-class self similarities , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[7]  Xavier Rodet,et al.  Toward Automatic Music Audio Summary Generation from Signal Analysis , 2002, ISMIR.

[8]  Anssi Klapuri,et al.  Signal Processing Methods for Music Transcription , 2006 .

[9]  Shigeki Sagayama,et al.  HMM-based approach for automatic chord detection using refined acoustic features , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Emmanuel Vincent,et al.  A structural segmentation of songs using generalized likelihood ratio under regularity assumptions , 2010 .

[11]  C.-C. Jay Kuo,et al.  Similarity matrix processing for music structure analysis , 2006, AMCMM '06.

[12]  Meinard Müller,et al.  Audio-based Music Structure Analysis , 2010 .

[13]  Ron J. Weiss,et al.  Identifying Repeated Patterns in Music Using Sparse Convolutive Non-negative Matrix Factorization , 2010, ISMIR.

[14]  Kristoffer Jensen,et al.  Multiple Scale Music Segmentation Using Rhythm, Timbre, and Harmony , 2007, EURASIP J. Adv. Signal Process..

[15]  Masataka Goto,et al.  A chorus-section detecting method for musical audio signals , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[16]  A. Klapuri,et al.  Music structure analysis by finding repeated parts , 2006, AMCMM '06.

[17]  Antoni B. Chan,et al.  Modeling Music as a Dynamic Texture , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Mark B. Sandler,et al.  New methods in structural segmentation of musical audio , 2006, 2006 14th European Signal Processing Conference.

[19]  Masataka Goto,et al.  A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting , 2007, ISMIR.

[20]  Emmanuel Vincent,et al.  Decomposition Into Autonomous and Comparable Blocks: A Structural Description of Music Pieces , 2010, ISMIR.