论文信息 - Separating Voices in Polyphonic Music: A Contig Mapping Approach

Separating Voices in Polyphonic Music: A Contig Mapping Approach

Voice separation is a critical component of music information retrieval, music analysis and automated transcription systems. We present a contig mapping approach to voice separation based on perceptual principles. The algorithm runs in O(n2) time, uses only pitch height and event boundaries, and requires no user-defined parameters. The method segments a piece into contigs according to voice count, then reconnects fragments in adjacent contigs using a shortest distance strategy. The order of connection is by distance from maximal voice contigs, where the voice ordering is known. This contig-mapping algorithm has been implemented in VoSA, a Java-based voice separation analyzer software. The algorithm performed well when applied to J. S. Bach's Two- and Three-Part Inventions and the forty-eight Fugues from the Well-Tempered Clavier. We report an overall average fragment consistency of 99.75%, correct fragment connection rate of 94.50% and average voice consistency of 88.98%, metrics which we propose to measure voice separation performance.

Elaine Chew | Xiaodan Wu

[1] Emilios Cambouropoulos,et al. From MIDI to Traditional Musical Notation , 2000 .

[2] Elaine Chew,et al. Determining context-defining windows: Pitch spelling using the spiral array , 2003, ISMIR.

[3] David Meredith,et al. Pitch Spelling Algorithms , 2003 .

[4] D. Temperley. The Cognition of Basic Musical Structures , 2001 .

[5] Jorma Tarhio,et al. Searching monophonic patterns within polyphonic sources , 2000 .

[6] Jorma Tarhio,et al. Detecting Monophonic Patterns within Polyphonic Sources , 2000, RIAO.

[7] C. Chuan. Tone and Voice: A Derivation of the Rules of Voice-Leading from Perceptual Principles , 2001 .

[8] Albert S. Bregman,et al. The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[9] Holger H. Hoos,et al. Voice Separation-A Local Optimisation Approach Voice Separation — A Local Optimisation Approach , 2002 .

[10] W Goebl,et al. Melody lead in piano performance: expressive device or artifact? , 2001, The Journal of the Acoustical Society of America.

[11] Emilios Cambouropoulos. Pitch Spelling: A Computational Model , 2003 .

[12] D. Deutsch. Two-channel listening to musical scales. , 1975, The Journal of the Acoustical Society of America.