Evaluating multiple viewpoint models of tabla sequences

We describe a realtime tabla generation system based on a variable-length n-gram model trained on a large symbolic tabla database. A novel, parametric smoothing algorithm based on a family of exponential curves is introduced to control the relative weight of high- and low-order models. This technique is shown to lead to improvements over a back-off smoothing for our tabla database. We find that cross-entropy is lowest when the coefficient of the exponential curve is between 1 and 2 and increases for values outside of this optimal range. The basic n-gram model is extended to model dependencies between duration, stroke-type, and meter using cross-products in a Multiple Viewpoints (MV) framework, leading to improvements in most cases when compared with independent stroke and duration models.

[1]  James Kippen,et al.  Bol processor grammars , 1992 .

[2]  Ian H. Witten,et al.  Multiple viewpoint systems for music prediction , 1995 .

[3]  Parag Chordia,et al.  Tabla Gyan: An Artificial Tabla Improviser , 2010, ICCC.

[4]  Marcus T. Pearce,et al.  The construction and evaluation of statistical models of melodic structure in music perception and composition , 2005 .

[5]  Parag Chordia Representation and Automatic Transcription of Solo Tabla Music , 2006 .

[6]  W. Teahan,et al.  Experiments on the zero frequency problem , 1995, Proceedings DCC '95 Data Compression Conference.

[7]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[8]  Gaël Richard,et al.  Supervised and Unsupervised Sequence Modelling for Drum Transcription , 2007, ISMIR.

[9]  María Herrojo Ruiz,et al.  Unsupervised statistical learning underpins computational, behavioural, and neural manifestations of musical expectation , 2010, NeuroImage.

[10]  Charles Ames,et al.  The Markov Process as a Compositional Model: A Survey and Tutorial , 2017 .

[11]  François Pachet,et al.  "The way it Sounds": timbre models for analysis and retrieval of music signals , 2005, IEEE Transactions on Multimedia.

[12]  Chris Chafe,et al.  Automatic transcription of tabla music , 2005 .

[13]  F ChenStanley,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[14]  Ian H. Witten,et al.  The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.

[15]  Geraint A. Wiggins,et al.  Methods for Combining Statistical Models of Music , 2004, CMMR.

[16]  Mira Balaban,et al.  Understanding music with AI: perspectives on music cognition , 1992 .

[17]  Aaron Albin,et al.  Multiple Viewpoints Modeling of Tabla Sequences , 2010, ISMIR.

[18]  M. Pearce,et al.  Sweet Anticipation : Music and the Psychology of Expectation , 2007 .