Reinforcement Learning and the Creative, Automated Music Improviser

Automated creativity, giving a machine the ability to originate meaningful new concepts and ideas, is a significant challenge. Machine learning models make advances in this direction but are typically limited to reproducing already known material. Self-motivated reinforcement learning models present new possibilities in computational creativity, conceptually mimicking human learning to enable automated discovery of interesting or surprising patterns. This work describes a musical intrinsically motivated reinforcement learning model, built on adaptive resonance theory algorithms, towards the goal of producing humanly valuable creative music. The capabilities of the prototype system are examined through a series of short, promising compositions, revealing an extreme sensitivity to feature selection and parameter settings, and the need for further development of hierarchical models.

[1]  Ellen Campana,et al.  A Dynamic Bayesian Approach to Computational Laban Shape Quality Analysis , 2009, Adv. Hum. Comput. Interact..

[2]  Petr Sosnin Means of Question-Answer Interaction for Collaborative Development Activity , 2009, Adv. Hum. Comput. Interact..

[3]  Benjamin D. Smith,et al.  The Self-Supervising Machine , 2011, NIME.

[4]  M. Boden The creative mind : myths & mechanisms , 1991 .

[5]  Jürgen Schmidhuber,et al.  Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes , 2008, ABiALS.

[6]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[7]  L. Lunsky Contemporary Approaches to Creative Thinking. , 1963 .

[8]  Marcus T. Pearce,et al.  The construction and evaluation of statistical models of melodic structure in music perception and composition , 2005 .

[9]  Jon McCormack,et al.  Open Problems in Evolutionary Music and Art , 2005, EvoWorkshops.

[10]  Stephen Grossberg,et al.  Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system , 1991, Neural Networks.

[11]  Robert O. Gjerdingen,et al.  Categorization of Musical Patterns by Self-Organizing Neuronlike Networks , 1990 .

[12]  Jeffrey S Bowers,et al.  Contrasting five different theories of letter position coding: evidence from orthographic similarity effects. , 2006, Journal of experimental psychology. Human perception and performance.

[13]  Martin V. Butz,et al.  Anticipatory Behavior in Adaptive Learning Systems , 2003, Lecture Notes in Computer Science.