论文信息 - Simultaneous Unsupervised Learning of Flamenco Metrical Structure, Hypermetrical Structure, and Multipart Structural Relations

Simultaneous Unsupervised Learning of Flamenco Metrical Structure, Hypermetrical Structure, and Multipart Structural Relations

We show how a new unsupervised approach to learning musical relationships can exploit Bayesian MAP induction of stochastic transduction grammars to overcome the challenges of learning complex relationships between multiple rhythmic parts that previously lay outside the scope of general computational approaches to music structure learning. A good illustrative genre is flamenco, which employs not only regular but also irregular hypermetrical structures that rapidly switch between 3/4 and 6/8 mediocompas blocks. Moreover, typical flamenco idioms employ heavy syncopation and sudden, misleading off-beat accents and patterns, while often elliding the downbeat accents that humans as well as existing meter-finding algorithms rely on, thus creating a high degree of listener “surprise” that makes not only the structural relations, but even the metrical structure itself, ellusive to learn. Flamenco musicians rely on both complex regular hypermetrical knowledge as well as irregular real-time clues to recognize when to switch meters and patterns. Our new approach envisions this as an integrated problem of learning a bilingual transduction, i.e., a structural relation between two languages—where there are different musical languages of, say, flamenco percussion versus zapateado footwork or palmas hand clapping. We apply minimum description length criteria to induce transduction grammars that simultaneously learn (1) the multiple metrical structures, (2) the hypermetrical structure that stochastically governs meter switching, and (3) the probabilistic transduction relationship between patterns of different rhythmic languages that enables musicians to predict when to switch meters and how to select patterns depending on what fellow musicians are generating.

Dekai Wu | Dekai Wu

[1] Mark Steedman. The Blues and the Abstract Truth: Music and Mental Models , 2009 .

[2] Richard Edwin Stearns,et al. Syntax-Directed Transduction , 1966, JACM.

[3] Peter Essens,et al. Perception of Temporal Patterns , 1985 .

[4] P. Desain,et al. Music, Mind, and Machine: Studies in Computer Music, Music Cognition, and Artificial Intelligence , 1992 .

[5] Elaine Chew,et al. Mimi4x: An interactive audio-visual installation for high-level structural improvisation , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[6] Mark Steedman,et al. On Interpreting Bach , 1987 .

[7] Rens Bod. Stochastic models of melodic analysis: Challenging the gestalt principles , 2001 .

[8] Robert Dale,et al. Handbook of Natural Language Processing , 2001, Computational Linguistics.

[9] R. Jackendoff,et al. A Generative Theory of Tonal Music , 1985 .

[10] M. Steedman,et al. The Perception of Musical Rhythm and Metre , 1977, Perception.

[11] Shlomo Dubnov,et al. OMax brothers: a dynamic yopology of agents for improvization learning , 2006, AMCMM '06.