Simultaneous Unsupervised Learning of Flamenco Metrical Structure, Hypermetrical Structure, and Multipart Structural Relations

We show how a new unsupervised approach to learning musical relationships can exploit Bayesian MAP induction of stochastic transduction grammars to overcome the challenges of learning complex relationships between multiple rhythmic parts that previously lay outside the scope of general computational approaches to music structure learning. A good illustrative genre is flamenco, which employs not only regular but also irregular hypermetrical structures that rapidly switch between 3/4 and 6/8 mediocompas blocks. Moreover, typical flamenco idioms employ heavy syncopation and sudden, misleading off-beat accents and patterns, while often elliding the downbeat accents that humans as well as existing meter-finding algorithms rely on, thus creating a high degree of listener “surprise” that makes not only the structural relations, but even the metrical structure itself, ellusive to learn. Flamenco musicians rely on both complex regular hypermetrical knowledge as well as irregular real-time clues to recognize when to switch meters and patterns. Our new approach envisions this as an integrated problem of learning a bilingual transduction, i.e., a structural relation between two languages—where there are different musical languages of, say, flamenco percussion versus zapateado footwork or palmas hand clapping. We apply minimum description length criteria to induce transduction grammars that simultaneously learn (1) the multiple metrical structures, (2) the hypermetrical structure that stochastically governs meter switching, and (3) the probabilistic transduction relationship between patterns of different rhythmic languages that enables musicians to predict when to switch meters and how to select patterns depending on what fellow musicians are generating.

[1]  Mark Steedman The Blues and the Abstract Truth: Music and Mental Models , 2009 .

[2]  Richard Edwin Stearns,et al.  Syntax-Directed Transduction , 1966, JACM.

[3]  Peter Essens,et al.  Perception of Temporal Patterns , 1985 .

[4]  P. Desain,et al.  Music, Mind, and Machine: Studies in Computer Music, Music Cognition, and Artificial Intelligence , 1992 .

[5]  Elaine Chew,et al.  Mimi4x: An interactive audio-visual installation for high-level structural improvisation , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[6]  Mark Steedman,et al.  On Interpreting Bach , 1987 .

[7]  Rens Bod Stochastic models of melodic analysis: Challenging the gestalt principles , 2001 .

[8]  Robert Dale,et al.  Handbook of Natural Language Processing , 2001, Computational Linguistics.

[9]  R. Jackendoff,et al.  A Generative Theory of Tonal Music , 1985 .

[10]  M. Steedman,et al.  The Perception of Musical Rhythm and Metre , 1977, Perception.

[11]  Shlomo Dubnov,et al.  OMax brothers: a dynamic yopology of agents for improvization learning , 2006, AMCMM '06.

[12]  Godfried T. Toussaint,et al.  On the relation between rhythm complexity measures and human rhythmic performance , 2008, C3S2E '08.

[13]  Godfried T. Toussaint,et al.  El Compás Flamenco: A Phylogenetic Analysis , 2004 .

[14]  Godfried T. Toussaint,et al.  Measuring Similarity between Flamenco Rhythmic Patterns , 2009 .

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  D. Temperley Music and probability , 2006 .

[17]  Dekai Wu,et al.  Iterative Rule Segmentation under Minimum Description Length for Unsupervised Transduction Grammar Induction , 2013, SLSP.

[18]  François Pachet,et al.  The Continuator: Musical Interaction With Style , 2003, ICMC.

[19]  Rens Bod,et al.  Memory-Based Models of Melodic Analysis: Challenging the Gestalt Principles , 2002 .

[20]  Emilia Gómez,et al.  Automatic Melodic Transcription of Flamenco Singing , 2008 .

[21]  Alfred V. Aho,et al.  The Theory of Parsing, Translation, and Compiling , 1972 .

[22]  Steve Young,et al.  Applications of stochastic context-free grammars using the Inside-Outside algorithm , 1990 .

[23]  José Miguel Díaz-Báñez,et al.  Characterization and Similarity in A Cappella Flamenco Cantes , 2010, ISMIR.

[24]  Daniel Dominic Sleator,et al.  Modeling Meter and Harmony: A Preference-Rule Approach , 1999, Computer Music Journal.

[25]  Elaine Chew,et al.  A Musical Improvisation System That Provides Visual Feedback to the Performer , 2007 .

[26]  Dekai Wu,et al.  Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[27]  Dekai Wu Trainable Coarse Bilingual Grammars for Parallel Text Bracketing , 1995, VLC@ACL.

[28]  Christopher Raphael,et al.  A hybrid graphical model for rhythmic parsing , 2002, Artif. Intell..

[29]  Robert M. Keller,et al.  Machine Learning of Jazz Grammars , 2010, Computer Music Journal.

[30]  Elaine Chew,et al.  Supporting Musical Creativity with Unsupervised Syntactic Parsing , 2008, AAAI Spring Symposium: Creative Intelligent Systems.

[31]  Mark Steedman The formal description of musical perception , 1972 .

[32]  Peter Desain,et al.  On tempo tracking: Tempogram Representation and Kalman filtering , 2000, ICMC.

[33]  Joakim Nivre,et al.  Learning Stochastic Bracketing Inversion Transduction Grammars with a Cubic Time Biparsing Algorithm , 2009, IWPT.