Exposing Parameters of a Trained Dynamic Model for Interactive Music Creation

As machine learning (ML) systems emerge in end-user applications, learning algorithms and classifiers will need to be robust to an increasingly unpredictable operating environment. In many cases, the parameters governing a learning system cannot be optimized for every user scenario, nor can users typically manipulate parameters defined in the space and terminology of ML. Conventional approaches to user-oriented ML systems have typically hidden this complexity from users by automating parameter adjustment. We propose a new paradigm, in which model and algorithm parameters are exposed directly to end-users with intuitive labels, suitable for applications where parameters cannot be automatically optimized or where there is additional motivation - such as creative flexibility - to expose, rather than fix or automatically adapt, learning parameters. In our CHI 2008 paper, we introduced and evaluated MySong, a system that uses a Hidden Markov Model to generate chords to accompany a vocal melody. The present paper formally describes the learning underlying MySong and discusses the mechanisms by which MySong's learning parameters are exposed to users, as a case study in making ML systems user-configurable. We discuss the generalizability of this approach, and propose that intuitively exposing ML parameters is a key challenge for the ML and human-computer-interaction communities.

[1]  Marisa E. Campbell CSCW 2004 , 2004, INTR.

[2]  Colin McGregor Controlling spam with SpamAssassin , 2007 .

[3]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[4]  Eric Horvitz,et al.  BusyBody: creating and fielding personalized models of the cost of interruption , 2004, CSCW.

[5]  Gert R. G. Lanckriet,et al.  Identifying Words that are Musically Meaningful , 2007, ISMIR.

[6]  Marisa E. Campbell CHI 2004 , 2004, INTR.

[7]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[8]  Tessa A. Lau,et al.  Automated email activity management: an unsupervised learning approach , 2005, IUI.

[9]  Ian H. Witten,et al.  Interactive machine learning: letting users build classifiers , 2002, Int. J. Hum. Comput. Stud..

[10]  Mark Dredze,et al.  Automatically classifying emails into activities , 2006, IUI '06.

[11]  Desney S. Tan,et al.  CueFlik: interactive concept learning in image search , 2008, CHI.

[12]  Gert R. G. Lanckriet,et al.  Semantic Annotation and Retrieval of Music and Sound Effects , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Thomas G. Dietterich,et al.  Toward harnessing user feedback for machine learning , 2007, IUI '07.

[14]  James Fogarty,et al.  Examining the robustness of sensor-based statistical models of human interruptibility , 2004, CHI.

[15]  Eric Horvitz,et al.  Coordinates: Probabilistic Forecasting of Presence and Availability , 2002, UAI.

[16]  Marcel Worring,et al.  Interactive Search by Direct Manipulation of Dissimilarity Space , 2007, IEEE Transactions on Multimedia.

[17]  Masayuki Numao,et al.  Music compositional intelligence with an affective flavor , 2007, IUI '07.

[18]  Jerry Alan Fails,et al.  Interactive machine learning , 2003, IUI '03.

[19]  Scott R. Klemmer,et al.  Authoring sensor-based interactions by demonstration with direct manipulation and pattern recognition , 2007, CHI.

[20]  James Fogarty,et al.  Toolkit support for developing and deploying sensor-based statistical models of human situations , 2007, CHI.

[21]  Zoltán Tóth,et al.  A Graphical User Interface for Evolutionary Algorithms , 2003, Acta Cybern..

[22]  Dan Morris,et al.  MySong: automatic accompaniment generation for vocal melodies , 2008, CHI.

[23]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[24]  Joe Tullio,et al.  How it works: a field study of non-technical users interacting with an intelligent system , 2007, CHI.

[25]  Stan Matwin,et al.  Email classification with co-training , 2011, CASCON.

[26]  Thomas G. Dietterich,et al.  A hybrid learning system for recognizing user tasks from desktop activities and email messages , 2006, IUI '06.

[27]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Scott R. Klemmer,et al.  groupTime: preference based group scheduling , 2006, CHI.

[29]  Ian Witten,et al.  Data Mining , 2000 .

[30]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .