Perceptual Analysis of the F-Measure to Evaluate Section Boundaries in Music

In this paper, we aim to raise awareness of the limitations of the F-measure when evaluating the quality of the boundaries found in the automatic segmentation of music. We presentanddiscusstheresultsofvariousexperimentswhere subjects listened to different musical excerpts containing boundary indications and had to rate the quality of the boundaries. These boundaries were carefully generated from state-of-the-art segmentation algorithms as well as human-annotated data. The results show that humans tend to give more relevance to theprecisioncomponent of the Fmeasure rather than the recall component, therefore making the classical F-measure not as perceptually informative as currently assumed. Based on the results of the experiments, we discuss the potential of an alternative evaluation based on the F-measure that emphasizes precision over recall, making the section boundary evaluation more expressive and reliable.

[1]  Masataka Goto,et al.  A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting , 2007, ISMIR.

[2]  Ag Armin Kohlrausch,et al.  The perception of structural boundaries in melody lines of Western popular music , 2009 .

[3]  Guillaume Boutard,et al.  Browsing inside a Music Track, the Experimentation Case Study , 2006 .

[4]  Meinard Müller,et al.  Audio-based Music Structure Analysis , 2010 .

[5]  Perfecto Herrera,et al.  Semantic Segmentation of Music audio Contents , 2005, ICMC.

[6]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[7]  B Tillmann,et al.  Global context effect in normal and scrambled musical sequences. , 2001, Journal of experimental psychology. Human perception and performance.

[8]  Mark B. Sandler,et al.  Structural Segmentation of Musical Audio by Constrained Clustering , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Ron J. Weiss,et al.  Unsupervised Discovery of Temporal Structure in Music , 2011, IEEE Journal of Selected Topics in Signal Processing.

[10]  C. Y. Peng,et al.  An Introduction to Logistic Regression Analysis and Reporting , 2002 .

[11]  Daniel P. W. Ellis,et al.  Learning to segment songs with ordinal linear discriminant analysis , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Mark B. Sandler,et al.  The Sonic Visualiser: A Visualisation Platform for Semantic Descriptors from Musical Signals , 2006, ISMIR.

[13]  Peter Grosche,et al.  Unsupervised Music Structure Annotation by Time Series Structure Features and Segment Similarity , 2014, IEEE Transactions on Multimedia.

[14]  Oriol Nieto,et al.  Convex non-negative matrix factorization for automatic music structure identification , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Emmanuel Deruty,et al.  Is Music Structure Annotation Multi-dimensional ? A Proposal for Robust Local Music Annotation , 2009 .

[16]  Jordan B. L. Smith,et al.  Design and creation of a large-scale database of structural annotations , 2011, ISMIR.