A Clustering Strategy for the Key Segmentation of Musical Audio

Key changes are common in Western classical music. The precise segmentation of a music piece at instances where key changes occur allows for further analysis, like self-similarity analysis, chord recognition, and several other techniques that mainly pertain to the characterization of music content. This article examines the automatic segmentation of audio data into parts composed in different keys, using clustering on chroma-related spaces. To this end, the k-means algorithm is used and a methodology is proposed so that useful information about key changes can be derived, regardless of the number of clusters or key changes. The proposed methodology is evaluated by experimenting on the segmentation of recordings of existing compositions from the Classic-Romantic repertoire. Additional analysis is performed using artificial data sets. Specifically, the construction of artificial pieces is proposed as a means to investigate the potential of the strategy under discussion in predefined key-change scenarios that encompass different musical characteristics. For the existing compositions, we compare the results of our proposed methodology with others from the music information retrieval literature. Finally, although the proposed methodology is only capable of locating key changes and not the key identities themselves, we discuss results regarding the labeling of a composition's key in the located segments.

[1]  Laurent Oudre,et al.  Concurrent Estimation of Chords and Keys from Audio , 2010, ISMIR.

[2]  C. Harte,et al.  Detecting harmonic change in musical audio , 2006, AMCMM '06.

[3]  Elaine Chew,et al.  The Spiral Array: An Algorithm for Determining Key Boundaries , 2002, ICMAI.

[4]  Robert O. Gjerdingen,et al.  The Cognition of Basic Musical Structures , 2004 .

[5]  Andrei Doncescu,et al.  Non Negative Matrix Factorization Clustering Capabilities; Application on Multivariate Image Segmentation , 2009, 2009 International Conference on Complex, Intelligent and Software Intensive Systems.

[6]  Özgür Izmirli Localized Key Finding from Audio Using Nonnegative Matrix Factorization for Segmentation , 2007, ISMIR.

[7]  Barry Vercoe,et al.  Detection of Key Change in Classical Piano Music , 2005, ISMIR.

[8]  Malcolm Slaney,et al.  A Unified System for Chord Transcription and Key Extraction Using Hidden Markov Models , 2007, ISMIR.

[9]  Ching-Hua Chuan,et al.  Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes , 2007, EURASIP J. Adv. Signal Process..

[10]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[11]  Mark B. Sandler,et al.  Key Estimation Using a Hidden Markov Model , 2006, ISMIR.

[12]  Barry Vercoe,et al.  Automated analysis of musical structure , 2005 .

[13]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[14]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[15]  W. L. Windsor Music and Probability , 2009 .

[16]  Meinard Müller,et al.  Audio Matching via Chroma-Based Statistical Features , 2005, ISMIR.

[17]  Özgür Izmirli Audio Key Finding Using Low-Dimensional Spaces , 2006, ISMIR.

[18]  Chris H. Q. Ding,et al.  K-means clustering via principal component analysis , 2004, ICML.

[19]  G. Peeters,et al.  Local key estimation based on harmonic and metric structures , 2009 .

[20]  Meinard Müller,et al.  Chroma Toolbox: Matlab Implementations for Extracting Variants of Chroma-Based Audio Features , 2011, ISMIR.

[21]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[22]  Geoffroy Peeters,et al.  Local Key Estimation From an Audio Signal Relying on Harmonic and Metrical Structures , 2012, IEEE Transactions on Audio, Speech, and Language Processing.