Dissociating sensory and cognitive theories of harmony perception through computational modeling

Two approaches exist for explaining harmonic expectation. The sensory approach claims that harmonic expectation is a low-level process driven by sensory responses to acoustic properties of musical sounds. Conversely, the cognitive approach describes harmonic expectation as a high-level cognitive process driven by the recognition of syntactic structure learned through experience. Many previous studies have sought to distinguish these two hypotheses, largely yielding support for the cognitive hypothesis. However, subsequent re-analysis has shown that most of these results can parsimoniously be explained by a computational model from the sensory tradition, namely Leman’s (2000) model of auditory short- term memory (Bigand, Delbé, Poulin-Charronnat, Leman, & Tillmann, 2014). In this research we re-examine the explanatory power of auditory short-term memory models, and compare them to a new model in the Information Dynamics Of Music (IDyOM) tradition, which simulates a cognitive theory of harmony perception based on statistical learning and probabilistic prediction. We test the ability of these models to predict the surprisingness of chords within chord sequences (N = 300), as reported by a sample group of university undergraduates (N = 50). In contrast to previous studies, which typically use artificial stimuli composed in a classical idiom, we use naturalistic chord sequences sampled from a large dataset of popular music. Our results show that the auditory short-term memory models have remarkably low explanatory power in this context. In contrast, the new statistical learning model predicts surprisingness ratings relatively effectively. We conclude that auditory short-term memory is insufficient to explain harmonic expectation, and that cognitive processes of statistical learning and probabilistic prediction provide a viable alternative.

[1]  F. Lerdahl,et al.  Perception of musical tension in short chord sequences: The influence of harmonic function, sensory dissonance, horizontal motion, and musical training , 1996, Perception & psychophysics.

[2]  M. Pearce Statistical learning and probabilistic prediction in music cognition: mechanisms of stylistic enculturation , 2018, Annals of the New York Academy of Sciences.

[3]  R. Shepard 11 – Structural Representations of Musical Pitch , 1982 .

[4]  Marcus T. Pearce,et al.  An Energy-based Generative Sequence Model for Testing Sensory Theories of Western Harmony , 2018, ISMIR.

[5]  Maximos A. Kaliakatsos-Papakostas,et al.  An Idiom-independent Representation of Chords for Computational Music Analysis and Generation , 2014, ICMC.

[6]  Stephen McAdams,et al.  Simulating melodic and harmonic expectations for tonal cadences using probabilistic models , 2018 .

[7]  Daniel Shanahan,et al.  The Use of Large Corpora to Train a New Type of Key-Finding Algorithm: An Improved Treatment of the Minor Mode , 2013 .

[8]  M. Leman An auditory model of the role of short-term memory in probe-tone ratings. , 2000 .

[9]  Daniel Müllensiefen,et al.  The Musicality of Non-Musicians: An Index for Assessing Musical Sophistication in the General Population , 2014, PloS one.

[10]  Geraint A. Wiggins,et al.  The Prediction of Merged Attributes with Multiple Viewpoint Systems , 2016 .

[11]  P. Janata,et al.  A combined model of sensory and cognitive representations underlying tonal expectations in music: from audio signals to behavior. , 2014, Psychological review.

[12]  A. Milne,et al.  Empirically testing Tonnetz, voice-leading, and spectral models of perceived triadic distance , 2016 .

[13]  Robin C. Laney,et al.  A Spectral Pitch Class Model of the Probe Tone Data and Scalic Tonality , 2015 .

[14]  Marc Leman,et al.  The Cortical Topography of Tonal Structures Underlying Western Music , 2002, Science.

[15]  Darrell Conklin,et al.  Music Generation from Statistical Models of Harmony , 2016 .

[16]  Barbara Tillmann,et al.  Empirical evidence for musical syntax processing? Computer simulations reveal the contribution of auditory short-term memory , 2014, Front. Syst. Neurosci..

[17]  Robin C. Laney,et al.  Modelling the similarity of pitch collections with expectation tensors , 2011 .

[18]  Marcus T. Pearce,et al.  The construction and evaluation of statistical models of melodic structure in music perception and composition , 2005 .