Mapping between dynamic markings and performed loudness: a machine learning approach

Loudness variation is one of the foremost tools for expressivity in music performance. Loudness is frequently notated as dynamic markings such as (piano, meaning soft) or (forte, meaning loud). While dynamic markings in music scores are important indicators of how music pieces should be interpreted, their meaning is less straightforward than it may seem, and depends highly on the context in which they appear. In this article, we investigate the relationship between dynamic markings in the score and performed loudness by applying machine learning techniques – decision trees, support vector machines, artificial neural networks, and a k-nearest neighbor method – to the prediction of loudness levels corresponding to dynamic markings, and to the classification of dynamic markings given loudness values. The methods are applied to 44 recordings of performances of Chopin's Mazurkas, each by 8 pianists. The results show that loudness values and markings can be predicted relatively well when trained across recordings of the same piece, but fail dismally when trained across the pianist's recordings of other pieces, demonstrating that score features may trump individual style when modeling loudness choices. Evidence suggests that all the features chosen for the task are relevant, and analysis of the results reveals the forms (such as the return of the theme) and structures (such as dynamic-marking repetitions) that influence the predictability of loudness and markings. Modeling of loudness trends in expressive performance appears to be a delicate matter, and sometimes loudness expression can be a matter of the performer's idiosyncracy.

[1]  Florian Krebs,et al.  An Assessment of Learned Score Features for Modeling Expressive Dynamics in Music , 2014, IEEE Transactions on Multimedia.

[2]  Satoru Fukayama,et al.  Statistical Approach to Automatic Expressive Rendition of Polyphonic Piano Music , 2013, Guide to Computing for Expressive Music Performance.

[3]  Craig Stuart Sapp Hybrid Numeric/Rank Similarity Metrics for Musical Performance Analysis , 2008, ISMIR.

[4]  B. Repp A microcosm of musical expression. I. Quantitative analysis of pianists' timing in the initial measures of Chopin's Etude in E major. , 1998, The Journal of the Acoustical Society of America.

[5]  Giovanni De Poli Expressiveness in music performance , 2006 .

[6]  Maria Mifsud Computational modelling of expressive music performance , 2017 .

[7]  B H Repp,et al.  A microcosm of musical expression: II. Quantitative analysis of pianists' dynamics in the initial measures of Chopin's Etude in E major. , 1999, The Journal of the Acoustical Society of America.

[8]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[9]  Gerhard Widmer,et al.  YQX Plays Chopin , 2009, AI Mag..

[10]  Elaine Chew,et al.  Practical Implications of Dynamic Markings in the Score: Is Piano Always Piano? , 2014, Semantic Audio.

[11]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[12]  Gerhard Widmer,et al.  Computational Models of Expressive Music Performance: The State of the Art , 2004 .

[13]  N. Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods: Kernel-Induced Feature Spaces , 2000 .

[14]  Emery Schubert,et al.  Expressiveness in music performance : empirical approaches across styles and cultures , 2014 .

[15]  Yves Chauvin,et al.  Backpropagation: theory, architectures, and applications , 1995 .

[16]  Miguel Molina-Solana,et al.  Transcribing Debussy's Syrinx dynamics through Linguistic Description: The MUDELD algorithm , 2016, Fuzzy Sets Syst..

[17]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[18]  Marko Robnik-Sikonja,et al.  An adaptation of Relief for attribute estimation in regression , 1997, ICML.

[19]  Peter Grosche,et al.  High resolution audio synchronization using chroma onset features , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[21]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[22]  Yves Chauvin,et al.  Back-Propagation: Theory, Architecture, and Applications , 1995 .