Similarity matrix processing for music structure analysis

The structure analysis of pop and rock songs from audio signals is conducted via similarity matrix processing in this work. The similarity matrix offers pairwise similarity between any two short intervals of fixed length in a song. We use two similarity matrices to show their diverse characteristics. The characteristics are explained by musical chord successions. Then, several similarity matrix processing techniques are developed for music structure analysis. First, an algorithm is proposed to check the boundaries and periods of repetitive chord successions with short periods. Second, the Viterbi algorithm is applied to detect straight segments in sub-diagonal lines of the similarity matrix. Periods of repeating chord successions are used to refine the state space to enhance the detection performance. Furthermore, a post-processing technique is used to map detected segments into sections in a song. Experimental results from test musical audio data are given to demonstrate the performance of the proposed method.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Jonathan Foote,et al.  Visualizing music and audio using self-similarity , 1999, MULTIMEDIA '99.

[3]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[4]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[5]  Ken Stephenson What to Listen For in Rock: A Stylistic Analysis , 2002 .

[6]  Masataka Goto,et al.  A chorus-section detecting method for musical audio signals , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[7]  Barry Vercoe,et al.  Structural analysis of musical signals for indexing and thumbnailing , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[8]  Jonathan Foote,et al.  Media segmentation using self-similarity decomposition , 2003, IS&T/SPIE Electronic Imaging.

[9]  Lie Lu,et al.  Repeating pattern discovery and structure analysis from acoustic music data , 2004, MIR '04.

[10]  Wei Chai,et al.  Semantic Segmentation and Summarization of Music , 2006 .

[11]  Emilia Gómez,et al.  Tonal Description of Polyphonic Audio for Music Content Processing , 2006, INFORMS J. Comput..