论文信息 - Frame-Level Audio Segmentation for Abridged Musical Works

Frame-Level Audio Segmentation for Abridged Musical Works

Large-scale musical works such as operas may last several hours and typically involve a huge number of musicians. For such compositions, one often finds different arrangements and abridged versions (often lasting less than an hour), which can also be performed by smaller ensembles. Abridged versions still convey the flavor of the musical work containing the most important excerpts and melodies. In this paper, we consider the task of automatically segmenting an audio recording of a given version into semantically meaningful parts. Following previous work, the general strategy is to transfer a reference segmentation of the original complete work to the given version. Our main contribution is to show how this can be accomplished when dealing with strongly abridged versions. To this end, opposed to previously suggested segment-level matching procedures, we adapt a frame-level matching approach for transferring the reference segment information to the unknown version. Considering the opera “Der Freischutz” as an example scenario, we discuss how to balance out flexibility and robustness properties of our proposed framelevel segmentation procedure.

Meinard Müller | Thomas Prätzlich | Meinard Müller | Thomas Prätzlich

[1] Meinard Müller,et al. SM Toolbox: MATLAB Implementations for Computing and Enhancing Similarity Matrices , 2014, Semantic Audio.

[2] Gerhard Widmer,et al. Automatic Alignment of Music Performances with Structural Differences , 2013, ISMIR.

[3] Nicola Orio,et al. An Efficient Identification Methodology for Improved Access to Music Heritage Collections , 2012, J. Multim..

[4] Mark D. Plumbley,et al. Score-Informed Source Separation for Musical Audio Recordings: An overview , 2014, IEEE Signal Processing Magazine.

[5] Ning Hu,et al. Polyphonic Audio Matching for Score Following and Intelligent Audio Editors , 2003, ICMC.

[6] Gerhard Widmer,et al. MATCH: A Music Alignment Tool Chest , 2005, ISMIR.

[7] Meinard Müller,et al. Information retrieval for music and motion , 2007 .

[8] Meinard Müller,et al. Freischütz Digital: A Case Study for Reference-Based Audio Segmentation for Operas , 2013, ISMIR.

[9] Roger B. Dannenberg,et al. Remixing Stereo Music with Score-Informed Source Separation , 2006, ISMIR.

[10] Xavier Serra,et al. Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[11] Meinard Müller,et al. Towards Cross-Version Harmonic Analysis of Music , 2012, IEEE Transactions on Multimedia.

[12] Hanna M. Lukashevich. Towards Quantitative Measures of Evaluating Song Segmentation , 2008, ISMIR.