Finding Repeating Patterns in Acoustic Musical Signals : Applications for Audio Thumbnailing

Finding structure and repetitions in a musical signal is crucial to enable interactive browsing into large databases of music files. Notably, it is useful to produce short summaries of musical pieces, or ”audio thumbnails”. In this paper, we propose an algorithm to find repeating patterns in an acoustic musical signal. We first segment the signal into a meaningful succession of timbres. This gives a reduced string representation of the music, the texture score, which doesn’t encode any pitch information. We then look for patterns in this representation, using two techniques from image processing: Kernel Convolution and Hough Transform. The resulting patterns are relevant to musical structure, which shows that pitch is not the only useful representation for the structural analysis of polyphonic music.

[1]  David Huron Perceptual and Cognitive Applications in Music Information Retrieval , 2000, ISMIR.

[2]  Costas S. Iliopoulos,et al.  Pattern Processing in Melodic Sequences: Challenges, Caveats and Prospects , 2001, Comput. Humanit..

[3]  George Tzanetakis,et al.  Audio Information Retrieval (AIR) Tools , 2000, ISMIR.

[4]  Nicola Orio,et al.  The Use of Melodic Segmentation for Content-based Retrieval of Musical Data , 1999, ICMC.

[5]  J. Nattiez Fondements d'une sémiologie de la musique , 1976 .

[6]  Maxime Crochemore,et al.  An Optimal Algorithm for Computing the Repetitions in a Word , 1981, Inf. Process. Lett..

[7]  Mark Sandler,et al.  Using Long-Term Structure to Retrieve Music: Representation and Matching , 2001 .

[8]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[9]  Arbee L. P. Chen,et al.  Efficient repeating pattern finding in music databases , 1998, CIKM '98.

[10]  Mark Sandler,et al.  Segmentation of Musical Signals Using Hidden Markov Models. , 2001 .

[11]  G. H. Wakefield,et al.  To catch a chorus: using chroma-based representations for audio thumbnailing , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[12]  Rajeev Raman,et al.  String-Matching techniques for musical similarity and melodic recognition , 1998 .

[13]  PhD V. F. Leavers BSc Shape Detection in Computer Vision Using the Hough Transform , 1992, Springer London.

[14]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[15]  Beth Logan,et al.  Music summarization using key phrases , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).