A Probabilistic Model of Melody Perception

This study presents a probabilistic model of melody perception, which infers the key of a melody and also judges the probability of the melody itself. The model uses Bayesian reasoning: For any "surface" pattern and underlying "structure," we can infer the structure maximizing P(structure|surface) based on knowledge of P(surface, structure). The probability of the surface can then be calculated as ∑ P(surface, structure), summed over all structures. In this case, the surface is a pattern of notes; the structure is a key. A generative model is proposed, based on three principles: (a) melodies tend to remain within a narrow pitch range; (b) note-to-note intervals within a melody tend to be small; and (c) notes tend to conform to a distribution (or key profile) that depends on the key. The model is tested in three ways. First, it is tested on its ability to identify the keys of a set of folksong melodies. Second, it is tested on a melodic expectation task in which it must judge the probability of different notes occurring given a prior context; these judgments are compared with perception data from a melodic expectation experiment. Finally, the model is tested on its ability to detect incorrect notes in melodies by assigning them lower probabilities than the original versions.

[1]  Eugene Narmour,et al.  The Analysis and Cognition of Basic Melodic Structures: The Implication-Realization Model , 1990 .

[2]  C. Krumhansl Music Psychology and Music Theory: Problems and Prospects , 1995 .

[3]  Alistair Mutch Structure and information , 2008 .

[4]  Amos Storkey,et al.  Advances in Neural Information Processing Systems 20 , 2007 .

[5]  J. Sloboda,et al.  Perception And Cognition Of Music , 2004 .

[6]  Elizabeth K. Johnson,et al.  Statistical learning of tone sequences by human infants and adults , 1999, Cognition.

[7]  Edward E. Smith,et al.  An Invitation to cognitive science , 1997 .

[8]  Ian H. Witten,et al.  Multiple viewpoint systems for music prediction , 1995 .

[9]  Peter Desain,et al.  On tempo tracking: Tempogram Representation and Kalman filtering , 2000, ICMC.

[10]  Kunio Kashino,et al.  Application of the Bayesian probability network to music scene analysis , 1998 .

[11]  Alison Gopnik,et al.  Children's causal inferences from indirect evidence: Backwards blocking and Bayesian reasoning in preschoolers , 2004, Cogn. Sci..

[12]  D. Temperley Bayesian Models of Musical Structure and Cognition , 2004 .

[13]  Mark D. Plumbley,et al.  Polyphonic transcription by non-negative sparse coding of power spectra , 2004, ISMIR.

[14]  Cheryl Olman,et al.  Classification objects, ideal observers & generative models , 2004, Cogn. Sci..

[15]  Rens Bod,et al.  Memory-Based Models of Melodic Analysis: Challenging the Gestalt Principles , 2002 .

[16]  A. Unyk,et al.  The influence of expectancy on melodic perception , 1987 .

[17]  Diana Deutsch,et al.  THE PROCESSING OF PITCH COMBINATIONS , 1999 .

[18]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[19]  M. Jones,et al.  Musical and Temporal Influences on Key Discovery , 1994 .

[20]  W. Dowling Emotion and Meaning in Music , 2008 .

[21]  Joshua B. Tenenbaum,et al.  Bayesian Modeling of Human Concept Learning , 1998, NIPS.

[22]  L. Cuddy,et al.  Expectancies generated by melodic intervals: Perceptual judgments of melodic continuity , 1995, Perception & psychophysics.

[23]  Paul T. von Hippel,et al.  Why Do Skips Precede Reversals? The Effect of Tessitura on Melodic Structure , 2000 .

[24]  W. Press,et al.  Numerical Recipes in Fortran: The Art of Scientific Computing.@@@Numerical Recipes in C: The Art of Scientific Computing. , 1994 .

[25]  Kaare Brandt Petersen,et al.  The Matrix Cookbook , 2006 .

[26]  Ali Taylan Cemgil,et al.  Monte Carlo Methods for Tempo Tracking and Rhythm Quantization , 2011, J. Artif. Intell. Res..

[27]  C. Krumhansl,et al.  Melodic Expectation in Finnish Spiritual Folk Hymns: Convergence of Statistical, Behavioral, and Computational Approaches , 1999 .

[28]  M K Tanenhaus,et al.  A constraint-based lexicalist account of the subject/object attachment preference , 1994, Journal of psycholinguistic research.

[29]  A. Iserles Numerical recipes in C—the art of scientific computing , by W. H. Press, B. P. Flannery, S. A. Teukolsky and W. T. Vetterling. Pp 735. £27·50. 1988. ISBN 0-521-35465-X (Cambridge University Press) , 1989, The Mathematical Gazette.

[30]  Paul T. von Hippel,et al.  Redefining Pitch Proximity: Tessitura and Mobility as Constraints on Melodic Intervals , 2000 .

[31]  Virginia Teller Review of Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition by Daniel Jurafsky and James H. Martin. Prentice Hall 2000. , 2000 .

[32]  John A. Sloboda,et al.  The effect of item position on the likelihood of identification by inference in prose reading and music reading , 1976 .

[33]  J. Youngblood Style as Information , 1958 .

[34]  Maryellen C. MacDonald,et al.  The lexical nature of syntactic ambiguity resolution , 1994 .

[35]  David Temperley,et al.  A Bayesian Approach to Key-Finding , 2002, ICMAI.

[36]  G. Miller,et al.  Cognitive science. , 1981, Science.

[37]  William Eastman Lake,et al.  Melodic perception and cognition : the influence of tonality , 1987 .

[38]  Piet G. Vos,et al.  A parallel-processing key-finding model , 1996 .

[39]  E. Schellenberg,et al.  Simplifying the Implication-Realization Model of Melodic Expectancy , 1997 .

[40]  Ramon Fuller Structure and Information in Webern's Symphonie, Op. 21 , 1967 .

[41]  Stephen McAdams,et al.  Musical Forces and Melodic Expectations: Comparing Computer Models and Experimental Results , 2004 .

[42]  M. Schmuckler Expectation in music: Investigation of melodic and harmonic processes. , 1989 .

[43]  R. Jackendoff,et al.  A Generative Theory of Tonal Music , 1985 .

[44]  Keith D. Martin,et al.  A Blackboard System for Automatic Transcription of Simple Polyphonic Music , 1996 .

[45]  C. Krumhansl,et al.  Tracing the dynamic changes in perceived tonal organization in a spatial representation of musical keys. , 1982, Psychological review.

[46]  Mark Steedman,et al.  On Interpreting Bach , 1987 .

[47]  James C. Carlsen Some factors which influence melodic expectancy , 1981 .

[48]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[49]  Chris Mellish,et al.  Statistical Learning of Harmonic Movement , 1999 .

[50]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[51]  L. Cuddy,et al.  Expectancies generated by melodic intervals: Evaluation of principles of melodic implication in a melody-completion task , 1997, Perception & psychophysics.

[52]  G. A. Miller,et al.  The Trill Threshold , 1950 .

[53]  Christopher Raphael,et al.  Automatic Transcription of Piano Music , 2002, ISMIR.

[54]  Steve L Arson Musical Forces and Melodic Expectations: Comparing Computer Models and Experimental Results , 2004 .

[55]  Mayumi Adachi,et al.  Expectancy in melody: tests of children and adults. , 2002, Journal of experimental psychology. General.

[56]  Guy J. Brown,et al.  A blackboard architecture for computational auditory scene analysis , 1999, Speech Commun..

[57]  Anssi Klapuri,et al.  Automatic Music Transcription as We Know it Today , 2004 .

[58]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[59]  Carl Schachter,et al.  Harmony And Voice Leading , 1978 .

[60]  M. Jones,et al.  Temporal Aspects of Stimulus-Driven Attending in Dynamic Arrays , 2002, Psychological science.

[61]  Schellenberg Eg Expectancy in melody: tests of the implication-realization model , 1996 .

[62]  Joel E. Cohen,et al.  Information theory and music , 2007 .

[63]  Christopher Raphael,et al.  Functional Harmonic Analysis Using Probabilistic Models , 2004, Computer Music Journal.

[64]  Robert O. Gjerdingen,et al.  The Cognition of Basic Musical Structures , 2004 .

[65]  Mark D. Plumbley,et al.  Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[66]  Jason Eisner,et al.  Discovering syntactic deep structure via Bayesian statistics , 2002, Cogn. Sci..

[67]  Mark B. Sandler,et al.  Techniques for Automatic Music Transcription , 2000, ISMIR.

[68]  Dirk-Jan Povel,et al.  Exploring the elementary harmonic forces in the tonal system , 1996 .

[69]  E. Schellenberg,et al.  Expectancy in melody: tests of the implication-realization model , 1996, Cognition.

[70]  Christopher Raphael,et al.  A hybrid graphical model for rhythmic parsing , 2002, Artif. Intell..