Modeling melody recognition using a sequence recognition neural network with meta-level processes

This research models human performance in the Dalla Bella, Peretz, and Aronoff melody recognition study. They compared performance between musicians and nonmusicians in the recognition (and perception) of melodies. They used a gating task to identify three events in the melody perception/recognition process. These were the familiarity emergence point (FEP), the isolation point (IP), and the recognition point (RP). We develop a simulation to model hypothesized cognitive processes underlying these events. The IP is modeled using a winner-take-all connectionist network adapted to operate with temporal input sequences. Meta-level processes examine the dynamic state of the recognition network to model the FEP and the RP.

[1]  F Grosjean,et al.  Spoken word recognition processes and the gating paradigm , 1980, Perception & psychophysics.

[2]  D Norris,et al.  Merging information in speech recognition: Feedback is never necessary , 2000, Behavioral and Brain Sciences.

[3]  R. Zatorre,et al.  Spectral and temporal processing in human auditory cortex. , 2001, Cerebral cortex.

[4]  Geoffrey R. Loftus,et al.  Accounts of the confidence-accuracy relation in recognition memory , 2000, Psychonomic bulletin & review.

[5]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[6]  Klaus Frieler,et al.  Measuring melodic similarity: Human vs. algorithmic Judgments , 2004 .

[7]  F. Grosjean,et al.  The gating paradigm: A comparison of successive and individual presentation formats , 1984 .

[8]  A. Koriat When confidence in a choice is independent of which choice is made , 2008, Psychonomic bulletin & review.

[9]  Edward T. Auer,et al.  Probabilistic Phonotactics in Spoken Word Recognition , 2008 .

[10]  Daniel L. Schacter,et al.  Understanding metamemory: Neural correlates of the cognitive process and subjective level of confidence in recognition memory , 2006, NeuroImage.

[11]  W. Marslen-Wilson Functional parallelism in spoken word-recognition , 1987, Cognition.

[12]  A Koriat,et al.  The combined contributions of the cue-familiarity and accessibility heuristics to feelings of knowing. , 2001, Journal of experimental psychology. Learning, memory, and cognition.

[13]  James L. McClelland,et al.  The TRACE model of speech perception , 1986, Cognitive Psychology.

[14]  D. Norris,et al.  Shortlist B: a Bayesian model of continuous speech recognition. , 2008, Psychological review.

[15]  I. Peretz,et al.  Time course of melody recognition: A gating paradigm study , 2003, Perception & psychophysics.

[16]  L. Gagnon,et al.  Corpus d'extraits musicaux: indices relatifs à la familiarité, à l'âge d'acquisition et aux évocations verbales. , 1995 .

[17]  D. Norris Shortlist: a connectionist model of continuous speech recognition , 1994, Cognition.

[18]  J J Hopfield,et al.  Neural computation by concentrating information in time. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Karin Humphreys,et al.  The psychology of language : from data to theory , 2001 .

[20]  Isabelle Peretz,et al.  Brain specialization for music. , 2002, The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry.

[21]  Trevor A. Harley,et al.  The psychology of language : from data to theory , 2001 .

[22]  D. Pisoni,et al.  Recognizing Spoken Words: The Neighborhood Activation Model , 1998, Ear and hearing.

[23]  Charles J Limb,et al.  Structural and functional neural correlates of music perception. , 2006, The anatomical record. Part A, Discoveries in molecular, cellular, and evolutionary biology.

[24]  A. Friederici,et al.  Musical syntax is processed in Broca's area: an MEG study , 2001, Nature Neuroscience.

[25]  I. Peretz Brain Specialization for Music , 2001, Annals of the New York Academy of Sciences.