Feature Discovery for Sequential Prediction of Monophonic Music

Learning a model for sequential prediction of symbolic music remains an open challenge. An important special case is the prediction of pitch sequences based on a corpus of monophonic music. We contribute to this line of research in two respects: (1) Our models improve the stateof-the-art performance. (2) Our method affords learning interpretable models by discovering an explicit set of relevant features. We discover features using the PULSE learning framework, which repetitively suggests new candidate features using a generative operation and selects features while optimizing the underlying model. Defining a domain-specific generative operation allows to combine multiple music-theoretically motivated features in a unified model and to control their interaction on a fine-grained level. We evaluate our models on a set of benchmark corpora of monophonic chorales and folk songs, outperforming previous work. Finally, we discuss the characteristics of the discovered features from a musicological perspective, giving concrete examples.

[1]  Tao Li,et al.  N-Gram Chord Profiles for Composer Style Representation , 2008, ISMIR.

[2]  Ian H. Witten,et al.  Multiple viewpoint systems for music prediction , 1995 .

[3]  Bernard Manderick,et al.  Global Feature Versus Event Models for Folk Song Classification , 2009, ISMIR.

[4]  Emmanuel Vincent,et al.  Multiple Pitch Transcription using DBN-based Musicological Models , 2010, ISMIR.

[5]  Ricardo Scholz,et al.  Robust modeling of musical chord sequences using probabilistic N-grams , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[7]  S. Koelsch,et al.  The Influence of Different Structural Features on Felt Musical Tension in Two Piano Pieces by Mozart and Mendelssohn , 2013 .

[8]  Leonard B. Meyer Emotion and Meaning in Music , 1957 .

[9]  Geraint A. Wiggins,et al.  EXPECTATION IN MELODY: THE INFLUENCE OF CONTEXT AND LEARNING , 2006 .

[10]  Chris Mellish,et al.  Statistical Learning of Harmonic Movement , 1999 .

[11]  Jean-François Paiement,et al.  Predictive models for music , 2009, Connect. Sci..

[12]  Elizabeth K. Johnson,et al.  Statistical learning of tone sequences by human infants and adults , 1999, Cognition.

[13]  Ian Cross,et al.  Statistical Properties of Tonal Harmony in Bach ’ s Chorales , 2008 .

[14]  Geraint A. Wiggins,et al.  Methods for Combining Statistical Models of Music , 2004, CMMR.

[15]  S. Koelsch,et al.  Predictive information processing in music cognition. A critical review. , 2012, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[16]  Marc Toussaint,et al.  Temporally extended features in model-based reinforcement learning with partial observability , 2016, Neurocomputing.

[17]  M. Pearce,et al.  Sweet Anticipation : Music and the Psychology of Expectation , 2007 .

[18]  Tillman Weyde,et al.  Hybrid Long- and Short-Term Models of Folk Melodies , 2015, ISMIR.

[19]  Darrell Conklin,et al.  Music Generation from Statistical Models of Harmony , 2016 .

[20]  Thore Graepel,et al.  Comparing Feature-Based Models of Harmony , 2012 .

[21]  David Temperley,et al.  What's Key for Key? The Krumhansl-Schmuckler Key-Finding Algorithm Reconsidered , 1999 .

[22]  Panayotis Mavromatis,et al.  Minimum description length modelling of musical structure , 2009 .

[23]  Geraint A. Wiggins,et al.  Multiple Viewpoint Systems: Time Complexity and the Construction of Domains for Complex Musical Viewpoints in the Harmonization Problem , 2013 .

[24]  Christopher Ariza,et al.  Music21: A Toolkit for Computer-Aided Musicology and Symbolic Music Data , 2010, ISMIR.

[25]  Panayotis Mavromatis,et al.  A Hidden Markov Model of Melody Production in greek Church Chant , 2006 .

[26]  Marcus T. Pearce,et al.  The construction and evaluation of statistical models of melodic structure in music perception and composition , 2005 .

[27]  Tillman Weyde,et al.  Discriminative learning and inference in the Recurrent Temporal RBM for melody modelling , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[28]  Jean-François Paiement,et al.  Probabilistic models for melodic prediction , 2009, Artif. Intell..

[29]  M. Farbood A Parametric, Temporal Model of Musical Tension , 2012 .

[30]  Martin Rohrmeier,et al.  Implicit Learning and Acquisition of Music , 2012, Top. Cogn. Sci..

[31]  Sophia Ananiadou,et al.  Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.

[32]  Geraint A. Wiggins,et al.  Improved Methods for Statistical Modelling of Monophonic Music , 2004 .

[33]  D. Conklin Multiple Viewpoint Systems for Music Classification , 2013 .

[34]  Geraint A. Wiggins,et al.  Auditory Expectation: The Information Dynamics of Music Perception and Cognition , 2012, Top. Cogn. Sci..

[35]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[36]  Tillman Weyde,et al.  A Distributed Model For Multiple-Viewpoint Melodic Prediction , 2013, ISMIR.

[37]  C. Krumhansl Cognitive Foundations of Musical Pitch , 1990 .