论文信息 - Extraction of High-Level Musical Structure From Audio Data and Its Application to Thumbnail Generation

Extraction of High-Level Musical Structure From Audio Data and Its Application to Thumbnail Generation

A method for segmenting musical audio with a hierarchical timbre model is introduced. New evidence is presented to show that music segmentation can be recast as clustering of timbre features, and a new clustering algorithm is described. A prototype thumbnail-generating application is described and evaluated. Experimental results are given, including comparison of machine and human segmentations

[1] Mohan S. Kankanhalli,et al. Content-based music structure analysis with applications to music semantics understanding , 2004, MULTIMEDIA '04.

[2] Masataka Goto,et al. A chorus-section detecting method for musical audio signals , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[3] François Pachet,et al. "The way it Sounds": timbre models for analysis and retrieval of music signals , 2005, IEEE Transactions on Multimedia.

[4] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[5] Gerhard Widmer,et al. HIDDEN MARKOV MODELS FOR SPECTRAL SIMILARITY OF SONGS , 2005 .

[6] Jonathan Foote,et al. Visualizing music and audio using self-similarity , 1999, MULTIMEDIA '99.

[7] Lie Lu,et al. Repeating pattern discovery and structure analysis from acoustic music data , 2004, MIR '04.

[8] Xavier Rodet,et al. Toward Automatic Music Audio Summary Generation from Signal Analysis , 2002, ISMIR.

[9] Mark B. Sandler,et al. A Markov-Chain Monte-Carlo Approach to Musical Audio Segmentation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10] Henrique S. Malvar,et al. Using audio fingerprinting for duplicate detection and thumbnail generation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[11] Mark B. Sandler,et al. Theory and Evaluation of a Bayesian Music Structure Extractor , 2005, ISMIR.

[12] Beth Logan,et al. Music summarization using key phrases , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[13] Matthew E. P. Davies,et al. BEAT TRACKING WITH A TWO STATE MODEL , 2005 .