Effects of memory size on melody recognition in a simulation of cohort theory

This research models human performance in Dalla Bella, Peretz, and Aronoff (2003) melody recognition study. They compared performance between musicians and nonmusicians in the perception and recognition of isolated melodies. Inspired by cohort theory, they used a gating task to identify three cognitive events in the melody perception/recognition process. These events were the familiarity emergence point (FEP), the isolation point (IP), and the recognition point (RP). Dalla Bella et al. explain their results using cohort theory. In the present research, we develop a simulation based on a sequence recognition neural network (SRN) that is compatible with cohort theory. We use it to model the hypothesized cognitive processes underlying the occurrence of these events. Two separate networks with different stored memory sizes are used to model musicians and nonmusicians. Our simulations capture the qualitative pattern of results in Dalla Bella et al. and show how stored memory size may affect the melody recognition process. The IP is modeled using the core SRN that contains a winner-take-all (WTA) mechanism. The FEP and RP are modeled by introducing meta-level readout units that monitor the dynamically evolving state of the WTA network. The unit that models the FEP plays a causal role in determining the RP. Therefore, the FEP is not an epiphenomenon in our simulation. It was not previously known whether the core SRN contained enough information to allow simple meta-level readout of dynamic cognitive states. Our simulations show that such readout is in fact possible.

[1]  Aniruddh D. Patel Music, Language, and the Brain , 2007 .

[2]  D Norris,et al.  Merging information in speech recognition: Feedback is never necessary , 2000, Behavioral and Brain Sciences.

[3]  F. Grosjean,et al.  The gating paradigm: A comparison of successive and individual presentation formats , 1984 .

[4]  Treebank Penn,et al.  Linguistic Data Consortium , 1999 .

[5]  Geoffrey R. Loftus,et al.  Accounts of the confidence-accuracy relation in recognition memory , 2000, Psychonomic bulletin & review.

[6]  James L. McClelland,et al.  The TRACE model of speech perception , 1986, Cognitive Psychology.

[7]  J. Saffran,et al.  Music and Language: A Developmental Comparison , 2004 .

[8]  W. Marslen-Wilson Functional parallelism in spoken word-recognition , 1987, Cognition.

[9]  A. Koriat When confidence in a choice is independent of which choice is made , 2008, Psychonomic bulletin & review.

[10]  Daniel L. Schacter,et al.  Understanding metamemory: Neural correlates of the cognitive process and subjective level of confidence in recognition memory , 2006, NeuroImage.

[11]  J J Hopfield,et al.  Neural computation by concentrating information in time. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Klaus Frieler,et al.  Measuring melodic similarity: Human vs. algorithmic Judgments , 2004 .

[13]  Charles J Limb,et al.  Structural and functional neural correlates of music perception. , 2006, The anatomical record. Part A, Discoveries in molecular, cellular, and evolutionary biology.

[14]  A Koriat,et al.  The combined contributions of the cue-familiarity and accessibility heuristics to feelings of knowing. , 2001, Journal of experimental psychology. Learning, memory, and cognition.

[15]  Anthony S. Maida,et al.  Modeling melody recognition using a sequence recognition neural network with meta-level processes , 2009, 2009 International Joint Conference on Neural Networks.

[16]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[17]  D. Norris,et al.  Shortlist B: a Bayesian model of continuous speech recognition. , 2008, Psychological review.

[18]  I. Peretz,et al.  Time course of melody recognition: A gating paradigm study , 2003, Perception & psychophysics.

[19]  D. Pisoni,et al.  Recognizing Spoken Words: The Neighborhood Activation Model , 1998, Ear and hearing.

[20]  John J. Hopfield,et al.  Neural Architecture and Biophysics for Sequence Recognition , 1989 .

[21]  F Grosjean,et al.  Spoken word recognition processes and the gating paradigm , 1980, Perception & psychophysics.

[22]  L. Gagnon,et al.  Corpus d'extraits musicaux: indices relatifs à la familiarité, à l'âge d'acquisition et aux évocations verbales. , 1995 .

[23]  D. Norris Shortlist: a connectionist model of continuous speech recognition , 1994, Cognition.