Audio Properties of Perceived Boundaries in Music

Data mining tasks such as music indexing, information retrieval, and similarity search, require an understanding of how listeners process music internally. Many algorithms for automatically analyzing the structure of recorded music assume that a large change in one or another musical feature suggests a section boundary. However, this assumption has not been tested: while our understanding of how listeners segment melodies has advanced greatly in the past decades, little is known about how this process works with more complex, full-textured pieces of music, or how stable this process is across genres. Knowing how these factors affect how boundaries are perceived will help researchers to judge the viability of algorithmic approaches with different corpora of music. We present a statistical analysis of a large corpus of recordings whose formal structure was annotated by expert listeners. We find that the acoustic properties of boundaries in these recordings corroborate findings of previous perceptual experiments. Nearly all boundaries correspond to peaks in novelty functions, which measure the rate of change of a musical feature at a particular time scale. Moreover, most of these boundaries match peaks in novelty for several features at several time scales. We observe that the boundary-novelty relationship can vary with listener, time scale, genre, and musical feature. Finally, we show that a boundary profile derived from a collection of novelty functions correlates with the estimated salience of boundaries indicated by listeners.

[1]  Ching-Hua Chuan,et al.  Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes , 2007, EURASIP J. Adv. Signal Process..

[2]  Peter Grosche,et al.  Structure-Based Audio Fingerprinting for Music Retrieval , 2012, ISMIR.

[3]  Meinard Müller,et al.  Towards Structural Analysis of Audio Recordings in the Presence of Musical Variations , 2007, EURASIP J. Adv. Signal Process..

[4]  Gerhard Widmer,et al.  Exploring Music Collections by Browsing Different Views , 2004, Computer Music Journal.

[5]  Elizabeth Hellmuth Margulis,et al.  Musical Repetition Detection Across Multiple Exposures , 2012 .

[6]  Masataka Goto,et al.  A chorus section detection method for musical audio signals and its application to a music listening station , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Carol L. Krumhansl,et al.  Perceiving Musical Time , 1990 .

[8]  Emilios Cambouropoulos,et al.  The Local Boundary Detection Model (LBDM) and its Application in the Study of Expressive Timing , 2001, ICMC.

[9]  Masataka Goto,et al.  A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting , 2007, ISMIR.

[10]  Irène Deliège Grouping Conditions in Listening to Music: An Approach to Lerdahl & Jackendoff's Grouping Preference Rules , 1987 .

[11]  Michael J. Bruderer Perceptual evaluation of models for music segmentation , 2008 .

[12]  Jonathan Foote,et al.  Automatic audio segmentation using a measure of audio novelty , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[13]  Satoshi Tojo,et al.  Implementing “A Generative Theory of Tonal Music” , 2006 .

[14]  Geoffroy Peeters Deriving Musical Structures from Signal Analysis for Music Audio Summary Generation: "Sequence" and "State" Approach , 2003, CMMR.

[15]  John Z. Zhang,et al.  A Perceptual Study on Music Segmentation and Genre Classification , 2012 .

[16]  D. Temperley The Cognition of Basic Musical Structures , 2001 .

[17]  François Pachet,et al.  "The way it Sounds": timbre models for analysis and retrieval of music signals , 2005, IEEE Transactions on Multimedia.

[18]  Ag Armin Kohlrausch,et al.  The perception of structural boundaries in melody lines of Western popular music , 2009 .

[19]  A. Klapuri,et al.  ACOUSTIC FEATURES FOR MUSIC PIECE STRUCTURE ANALYSIS , 2011 .

[20]  Jordan B. L. Smith,et al.  Design and creation of a large-scale database of structural annotations , 2011, ISMIR.

[21]  Elias Pampalk,et al.  Content-based organization and visualization of music archives , 2002, MULTIMEDIA '02.

[22]  E. Chew Towards a mathematical model of tonality , 2000 .

[23]  Annabel J. Cohen,et al.  Parsing of Melody: Quantification and Testing of the Local Grouping Rules of Lerdahl and Jackendoff's A Generative Theory of Tonal Music , 2004 .

[24]  R. Jackendoff,et al.  A Generative Theory of Tonal Music , 1985 .

[25]  Meinard Müller,et al.  Audio-based Music Structure Analysis , 2010 .

[26]  Elaine Chew,et al.  Regards on two regards by Messiaen: Post-tonal music segmentation using pitch context distances in the spiral array , 2005 .