A Distributed Model For Multiple-Viewpoint Melodic Prediction

The analysis of sequences is important for extracting information from music owing to its fundamentally temporal nature. In this paper, we present a distributed model based on the Restricted Boltzmann Machine (RBM) for melodic sequences. The model is similar to a previous successful neural network model for natural language [2]. It is first trained to predict the next pitch in a given pitch sequence, and then extended to also make use of information in sequences of note-durations in monophonic melodies on the same task. In doing so, we also propose an efficient way of representing this additional information that takes advantage of the RBM’s structure. In our evaluation, this RBM-based prediction model performs slightly better than previously evaluated n-gram models in most cases. Results on a corpus of chorale and folk melodies showed that it is able to make use of information present in longer contexts more effectively than n-gram models, while scaling linearly in the number of free parameters required.

[1]  Ian H. Witten,et al.  Multiple viewpoint systems for music prediction , 1995 .

[2]  Fu Jie Huang,et al.  A Tutorial on Energy-Based Learning , 2006 .

[3]  Geraint A. Wiggins,et al.  Harmonising Melodies: Why Do We Add the Bass Line First? , 2013, ICCC.

[4]  Charles Ames,et al.  The Markov Process as a Compositional Model: A Survey and Tutorial , 2017 .

[5]  John A. Biles,et al.  GenJam: A Genetic Algorithm for Generating Jazz Solos , 1994, ICMC.

[6]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[7]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[8]  Yoshua Bengio,et al.  Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[9]  Geraint A. Wiggins,et al.  Improved Methods for Statistical Modelling of Monophonic Music , 2004 .

[10]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[11]  Marcus T. Pearce,et al.  The construction and evaluation of statistical models of melodic structure in music perception and composition , 2005 .

[12]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[13]  François Pachet,et al.  The Continuator: Musical Interaction With Style , 2003, ICMC.

[14]  D. Conklin Multiple Viewpoint Systems for Music Classification , 2013 .

[15]  Gautham J. Mysore,et al.  Evaluation of a Score-informed Source Separation System , 2010, ISMIR.

[16]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[17]  Peter M. Todd,et al.  Connectionist Music Composition Based on Melodic, Stylistic, and Psychophysical Constraints , 2003 .

[18]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[19]  Michael C Mozer,et al.  Connectionist Music Composition Based on Melodic, Stylistic, and Psychophysical Constraints ; CU-CS-495-90 , 1990 .

[20]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[21]  Petri Toiviainen,et al.  MIR In Matlab: The MIDI Toolbox , 2004, ISMIR.

[22]  Geoffrey E. Hinton,et al.  A Scalable Hierarchical Distributed Language Model , 2008, NIPS.

[23]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[24]  Robert M. Keller,et al.  A Grammatical Approach to Automatic Improvisation , 2007 .

[25]  Tijmen Tieleman,et al.  Training restricted Boltzmann machines using approximations to the likelihood gradient , 2008, ICML '08.

[26]  Peter Swire,et al.  Learning to Create Jazz Melodies Using Deep Belief Nets , 2010, ICCC.

[27]  Amos J. Storkey,et al.  Comparing Probabilistic Models for Melodic Sequences , 2011, ECML/PKDD.