Parallel Networks that Learn to Pronounce English Text

This paper describes NETtalk, a class of massively-parallel network systems that learn to convert English text to speech. The memory representations for pronunciations are learned by practice and are shared among many processing units. The performance of NETtalk has some similarities with observed human performance. (i) The learning follows a power law. (ii) The more words the network learns, the better it is at generalizing and correctly pronouncing new words, (iii) The performance of the network degrades very slowly as connections in the network are damaged: no single link or processing unit is essential. (iv) Relearning after damage is much faster than learning during the original training. (v) Distributed or spaced practice is more effective for long-term retention than massed practice. Network models can be constructed that have the same performance and learning characteristics on a particular task, but differ completely at the levels of synaptic strengths and single-unit responses. However, hierarchical clustering techniques applied to NETtalk reveal that these different networks have similar internal representations of letter-to-sound correspondences within groups of processing units. This suggests that invariant internal representations may be found in assemblies of neurons intermediate in size between highly localized and completely distributed representations.

[1]  S. Kaplan The Physiology of Thought , 1950 .

[2]  A. Gamba,et al.  Further experiments with PAPA , 1961 .

[3]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[4]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[5]  L. R. Peterson,et al.  Effect of spacing presentations on retention of a paired associate over short intervals. , 1963, Journal of experimental psychology.

[6]  R. Venezky The Structure of English Orthography , 1965 .

[7]  H. C. LONGUET-HIGGINS,et al.  Holographic Model of Temporal Recall , 1968, Nature.

[8]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[9]  R. Venezky,et al.  Development of Two Letter-Sound Patterns in Grades One Through Three. , 1973 .

[10]  Douglas L. Hintzman,et al.  Theoretical implications of the spacing effect. , 1974 .

[11]  R. Sperber Developmental changes in effects of spacing of trials in retardate discrimination learning and memory. , 1974 .

[12]  Brian Everitt,et al.  Cluster analysis , 1974 .

[13]  R. L. Solso Theories in cognitive psychology : the Loyola symposium , 1975 .

[14]  A. Glenberg Monotonic and nonmonotonic lag effects in paired-associate and recognition memory paradigms , 1976 .

[15]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[16]  D. L. Hintzman Repetition and Memory1 , 1976 .

[17]  V. Mountcastle,et al.  An organizing principle for cerebral function : the unit module and the distributed system , 1978 .

[18]  L. Jacoby On interpreting the effects of repetition: Solving a problem versus remembering a solution , 1978 .

[19]  Dennis H. Klatt,et al.  Software for a cascade/parallel formant synthesizer , 1980 .

[20]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[21]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..

[22]  D. Norman Learning and Memory , 1982 .

[23]  John R. Anderson Acquisition of cognitive skill. , 1982 .

[24]  Stephen Grossberg,et al.  Absolute stability of global pattern formation and parallel memory storage by competitive neural networks , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[25]  Geoffrey E. Hinton,et al.  OPTIMAL PERCEPTUAL INFERENCE , 1983 .

[26]  Geoffrey E. Hinton,et al.  Massively Parallel Architectures for AI: NETL, Thistle, and Boltzmann Machines , 1983, AAAI.

[27]  Geoffrey E. Hinton,et al.  Parallel visual computation , 1983, Nature.

[28]  Robert L. Mercer,et al.  An information theoretic approach to the automatic determination of phonemic baseforms , 1984, ICASSP.

[29]  A G Barto,et al.  Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.

[30]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[31]  Kenneth Ward Church Stress assignment in letter‐to‐sound rules for speech synthesis , 1985 .

[32]  Terrence J. Sejnowski,et al.  Open questions about computation in cerebral cortex , 1986 .

[33]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[34]  James L. McClelland,et al.  James L. McClelland, David Rumelhart and the PDP Research Group, Parallel distributed processing: explorations in the microstructure of cognition . Vol. 1. Foundations . Vol. 2. Psychological and biological models . Cambridge MA: M.I.T. Press, 1987. , 1989, Journal of Child Language.

[35]  J. Hopfield,et al.  Computing with neural circuits: a model. , 1986, Science.

[36]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[37]  James L. McClelland,et al.  On learning the past-tenses of English verbs: implicit rules or parallel distributed processing , 1986 .

[38]  Eric Saund Abstraction and Representation of Continuous Variables in Connectionist Networks , 1986, AAAI.

[39]  James L. McClelland,et al.  Psychological and biological models , 1986 .

[40]  S. Kelso,et al.  Hebbian synapses in hippocampus. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[41]  S Dehaene,et al.  Spin glass model of learning by selection. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Geoffrey E. Hinton,et al.  Learning and relearning in Boltzmann machines , 1986 .

[43]  Jerome A. Feldman,et al.  Neural Representation of Conceptual Knowledge. , 1986 .

[44]  T. D. Harrison,et al.  Boltzmann machines for speech recognition , 1986 .

[45]  Geoffrey E. Hinton,et al.  Learning symmetry groups with hidden units: beyond the perceptron , 1986 .

[46]  G. Lynch,et al.  Synapses, circuits, and the beginnings of memory , 1986 .

[47]  F. H. C. Crick,et al.  Certain aspects of the anatomy and physiology of the cerebral cortex , 1986 .

[48]  Charles R. Rosenberg THE SPACING EFFECT ON NETTALK, A MASSIVELY-PARALLEL NETWORK , 1986 .

[49]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[50]  R. F. Thompson,et al.  Modeling the neural substrates of associative learning and memory: a computational approach. , 1987, Psychological review.

[51]  David B. Pisoni,et al.  Text-to-speech: the mitalk system , 1987 .

[52]  Richard F. Thompson,et al.  Modeling the Neural Substrates of Associative Learning and Memory: A Computational Approach , 1987 .

[53]  David B. Parker,et al.  A comparison of algorithms for neuron-like cells , 1987 .

[54]  G. Toulouse,et al.  Spin glass model of learning by selection , 1987 .

[55]  Elie Bienenstock,et al.  A neural network for the retrieval of superimposed connection patterns , 1987 .

[56]  Lokendra Shastri,et al.  Learned phonetic discrimination using connectionist networks , 1990, ECST.

[57]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[58]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[59]  Terrence J. Sejnowski,et al.  NETtalk: a parallel network that learns to read aloud , 1988 .

[60]  G. Bower,et al.  From conditioning to category learning: an adaptive network model. , 1988, Journal of experimental psychology. General.

[61]  M. Karjalainen Book Reviews: From Text to Speech: The MITalk System , 1988, CL.

[62]  Terrence J. Sejnowski,et al.  A Parallel Network that Learns to Play Backgammon , 1989, Artif. Intell..

[63]  Geoffrey E. Hinton,et al.  Learning distributed representations of concepts. , 1989 .

[64]  Geoffrey E. Hinton,et al.  Parallel Models of Associative Memory , 1989 .