Learning to pronounce written words : a study in inductive language learning

ion by compression, 23, 144 levels of, 10, 12, 16, 46, 47 activation, 29 propagation, 29 alphabet orthographic, 179 phonemic, 179 alphabetic writing system, 2 analogical modelling, 10 analogy, 2 analogy principle, 2 arbiter model, 151 module, 148, 151 arcing, 150 ART/S, 113–116 experiments with, 115 articulatory features, 48, 180 artificial data, 155 artificial neural networks, 28 associative relation, 7 back-propagation, 28–30, 185 bagging, 150 benchmark problem, 166 bias class, 56 boosting, 150 BP, 28, 140 C4.5, 35, 39–41, 142 case-based reasoning, 21 CELEX, 26, 50 Charivarius, 14 chunk-based word pronunciation, 167 classification, 8 cluster in instance space, 156 clustering, 13 co-articulation, 44 coda, 49 combiner model, 150 module, 147, 150 compounding, 44 compression, 23, 24, 37, 38 connection, 29 weight, 29 connectionism, 28 consonant, 49 corpus-based linguistics, 2 correspondence, 2 data characteristics, 153–165 DC, 56 decision tree learning, 39 decision trees, 37 induction of, 37

[1]  Ivan Bratko,et al.  Machine learning in artificial intelligence , 1993, Artif. Intell. Eng..

[2]  James A. Anderson,et al.  Neurocomputing: Foundations of Research , 1988 .

[3]  Donald E. Knuth,et al.  The art of computer programming, volume 3: (2nd ed.) sorting and searching , 1998 .

[4]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[5]  J. Rissanen A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .

[6]  L. Katz,et al.  The reading process is different for different orthographies : the orthographic depth hypothesis , 1992 .

[7]  Daniel Jones,et al.  English Pronouncing Dictionary , 1917 .

[8]  Walter Daelemans,et al.  Measuring the Complexity of Writing Systems , 1994, J. Quant. Linguistics.

[9]  S. Harnad Metaphor and Mental Duality , 2019, Language, Mind, and Brain.

[10]  M. S. Hunnicutt,et al.  Phonological Rules For A Text To Speech Sytem , 1979, ACL Microfiche Series 1-83, Including Computational Linguistics.

[11]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[12]  David B. Pisoni,et al.  Text-to-speech: the mitalk system , 1987 .

[13]  Gavin Burnage Celex-a guide for users , 1990 .

[14]  Akira Nakanishi,et al.  Writing Systems of the World , 1980 .

[15]  G. Miller,et al.  Linguistic theory and psychological reality , 1982 .

[16]  Yves Lepage,et al.  Saussurian analogy: a theoretical account and its application , 1996, COLING.

[17]  Robert C. Holte,et al.  Concept Learning and the Problem of Small Disjuncts , 1989, IJCAI.

[18]  Maria Wolters,et al.  A Dual Route Neural Net Approach to Grapheme-to-Phoneme Conversion , 1996, ICANN.

[19]  Walter Daelemans,et al.  A Neural Network for Hyphenation , 1992 .

[20]  E. Rosch,et al.  Family resemblances: Studies in the internal structure of categories , 1975, Cognitive Psychology.

[21]  Walter Daelemans,et al.  Abstraction Considered Harmful : Lazy Learning of Language Processing , 1996 .

[22]  Kai Ming Ting,et al.  Model Combination in the Multiple-Data-Batches Scenario , 1997, ECML.

[23]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[24]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[25]  H. Gardner,et al.  Language and Learning: The Debate between Jean Piaget and Noam Chomsky , 1983 .

[26]  Sally Andrews,et al.  Frequency and neighborhood effects on lexical access: Lexical similarity or orthographic redundancy? , 1992 .

[27]  David S. Touretzky,et al.  A Connectionist Learning Approach to Analyzing Linguistic Stress , 1991, NIPS.

[28]  David W. Aha,et al.  Generalizing from Case studies: A Case Study , 1992, ML.

[29]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[30]  R. Glushko The Organization and Activation of Orthographic Knowledge in Reading Aloud. , 1979 .

[31]  Laurie Bauer,et al.  English Word-Formation: Frontmatter , 1983 .

[32]  Thomas G. Dietterich,et al.  Readings in Machine Learning , 1991 .

[33]  Thomas G. Dietterich,et al.  A study of distance-based machine learning algorithms , 1994 .

[34]  D. Wolpert On Overfitting Avoidance as Bias , 1993 .

[35]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[36]  Walter Daelemans,et al.  SHOE: The extraction of hierarchical structure for machine learning of natural language , 1991 .

[37]  M. Coltheart Lexical access in simple reading tasks , 1978 .

[38]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[39]  Walter Daelemans,et al.  The Acquisition of Stress: A Data-Oriented Approach , 1994, Comput. Linguistics.

[40]  Royal Skousen,et al.  Analogical Modeling Of Language , 1989 .

[41]  R. Venezky The Structure of English Orthography , 1965 .

[42]  J. Zwart The Minimalist Program , 1998, Journal of Linguistics.

[43]  Mark S. Seidenberg,et al.  The basis of consistency effects in word naming , 1990 .

[44]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[45]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[46]  Saso Dzeroski,et al.  ILPNET Repositories on WWW: Inductive Logic Programming Systems, Datasets and Bibliography , 1996, AI Commun..

[47]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[48]  Dennis L. Wilson,et al.  Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[49]  M. Pazzani,et al.  Learning probabilistic relational concept descriptions , 1996 .

[50]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[51]  Cullen Schaffer,et al.  A Conservation Law for Generalization Performance , 1994, ICML.

[52]  Antal van den Bosch,et al.  A Connectionist Model for Bootstrap Learning of Syllabic Structure. , 1998 .

[53]  Anne Cutler,et al.  The role of strong syllables in segmentation for lexical access , 1988 .

[54]  Amanda J. C. Sharkey,et al.  Modularity, Combining and Artificial Neural Nets , 1997, Connect. Sci..

[55]  Walter Daelemans,et al.  Language-Independent Data-Oriented Grapheme-to-Phoneme Conversion , 1996 .

[56]  Walter Daelemans,et al.  Experience-driven language acquisition and processing , 1996 .

[57]  Walter Daelemans,et al.  Unsupervised Discovery of Phonological Categories through Supervised Learning of Morphological Rules , 1996, COLING.

[58]  François Yvon Prononcer par analogie : motivation, formalisation et evaluation , 1996 .

[59]  W. R. Garner Concept Learning: An Information- Processing Problem , 1964 .

[60]  Lutz Prechelt,et al.  A Set of Neural Network Benchmark Problems and Benchmarking Rules , 1994 .

[61]  Michael Don Palmer,et al.  Reflections on language , 1977 .

[62]  Kenneth Ward Church,et al.  Complexity, Two-Level Morphology and Finnish , 1988, COLING.

[63]  Walter Daelemans,et al.  Generalization performance of backpropagation learning on a syllabification task , 1992 .

[64]  Yunheng Ji MORPHOLOGY , 1937, A Grammar of Italian Sign Language (LIS).

[65]  Pat Langley,et al.  Elements of Machine Learning , 1995 .

[66]  David W. Aha,et al.  Incremental Constructive Induction: An Instance-Based Approach , 1991, ML.

[67]  Charles X. Ling,et al.  Learning the Past Tense of English Verbs: The Symbolic Pattern Associator vs. Connectionist Models , 1993, J. Artif. Intell. Res..

[68]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[69]  Walter Daelemans,et al.  Skousen's analogical modeling algorithm: a comparison with lazy learning , 1997 .

[70]  Ron Kohavi,et al.  Lazy Decision Trees , 1996, AAAI/IAAI, Vol. 1.

[71]  Steven Bird,et al.  Computational phonology: A constraint-based approach , 1995, CL.

[72]  Andrew R. Golding Pronouncing names by a combination of rule-based and case-based reasoning , 1992 .

[73]  Royal Skousen,et al.  Real-Time Morphology: Symbolic Rules or Analogical Networks? , 1989 .

[74]  Kenneth Ward Church,et al.  Morphology and rhyming: two powerful alternatives to letter-to-sound rules for speech synthesis , 1990, SSW.

[75]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[76]  C. Lee Giles,et al.  What Size Neural Network Gives Optimal Generalization? Convergence Properties of Backpropagation , 1998 .

[77]  Russell Greiner,et al.  Computational learning theory and natural learning systems , 1997 .

[78]  Geoffrey Sampson,et al.  Writing Systems: A Linguistic Introduction , 1986 .

[79]  Walter Daelemans,et al.  A feature-relevance heuristic for indexing and compressing large case bases , 1997 .

[80]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[81]  David W. Aha,et al.  Learning Representative Exemplars of Concepts: An Initial Case Study , 1987 .

[82]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[83]  Walter Daelemans,et al.  Linguistic pattern matching capabilities of connectionist networks , 1992 .

[84]  Halbert White,et al.  Connectionist nonparametric regression: Multilayer feedforward networks can learn arbitrary mappings , 1990, Neural Networks.

[85]  Steven L. Salzberg On Comparing Classifiers: A Critique of Current Research and Methods , 1999 .

[86]  Ton Van der Wouden,et al.  CELEX: Building a Multifunctional, Polytheoretical Lexical Database , 1988 .

[87]  Walter Daelemansz,et al.  Learnability and Markedness: Dutch Stress Assignment , 1993 .

[88]  Michael Gasser,et al.  Networks that Learn about Phonological Feature Persistence , 1990 .

[89]  Richard Sproat,et al.  Morphology and computation , 1992 .

[90]  G. Lugosi,et al.  Strong Universal Consistency of Neural Network Classifiers , 1993, Proceedings. IEEE International Symposium on Information Theory.

[91]  Salvatore J. Stolfo,et al.  A Comparative Evaluation of Voting and Meta-learning on Partitioned Data , 1995, ICML.

[92]  Jaime G. Carbonell,et al.  Introduction: Paradigms for Machine Learning , 1989, Artif. Intell..

[93]  Robert I. Damper,et al.  A recurrent network that learns to pronounce English text , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[94]  Sholom M. Weiss,et al.  Computer Systems That Learn , 1990 .

[95]  Jianping Zhang,et al.  Selecting Typical Instances in Instance-Based Learning , 1992, ML.

[96]  Foster J. Provost,et al.  Small Disjuncts in Action: Learning to Diagnose Errors in the Local Loop of the Telephone Network , 1993, ICML.

[97]  Jean Voisin,et al.  An application of the multiedit-condensing technique to the reference selection problem in a print recognition system , 1987, Pattern Recognit..

[98]  Wendy G. Lehnert,et al.  Case-based Problem Solving with a Large Knowledge Base of Learned Cases , 1987, AAAI.

[99]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[100]  Steven Bird Introduction to computational phonology , 1994 .

[101]  A. Prince,et al.  On stress and linguistic rhythm , 1977 .

[102]  James Kelly,et al.  AutoClass: A Bayesian Classification System , 1993, ML.

[103]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[104]  Walter Daelemans,et al.  Memory-based lexical acquisition and processing , 1993, EAMT.

[105]  姚小平,et al.  语言学简史 : [英文版] = A Short History of Linguistics , 1969 .

[106]  John E. Moody,et al.  The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems , 1991, NIPS.

[107]  K. P. Mohanan,et al.  The Theory of Lexical Phonology , 1982 .

[108]  Leo Breiman,et al.  Bias, Variance , And Arcing Classifiers , 1996 .

[109]  G. Zipf The Psycho-Biology Of Language: AN INTRODUCTION TO DYNAMIC PHILOLOGY , 1999 .

[110]  Aj.M.M. Weijters A SIMPLE LOOK-UP PROCEDURE SUPERIOR TO NETTALK? , 1991 .

[111]  Paul W. B. Atkins,et al.  Models of reading aloud: Dual-route and parallel-distributed-processing approaches. , 1993 .

[112]  Sheri Hunnicutt Grapheme-to-phoneme rules: A review , 1980 .

[113]  René Kager,et al.  The metrical theory of word stress , 1995 .

[114]  Michael I. Jordan Attractor dynamics and parallelism in a connectionist sequential machine , 1990 .

[115]  Thomas G. Dietterich,et al.  Error-Correcting Output Codes: A General Method for Improving Multiclass Inductive Learning Programs , 1991, AAAI.

[116]  黃崇冀,et al.  Machine learning : an artificial intelligence approach , 1988 .

[117]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[118]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[119]  Kimmo Koskenniemi,et al.  A General Computational Model for Word-Form Recognition and Production , 1984, ACL.

[120]  Laurie Bauer,et al.  Introducing Linguistic Morphology , 1988 .

[121]  David E. Rumelhart,et al.  Weight elimination and effective network size , 1994, COLT 1994.

[122]  Noam Chomsky,et al.  Aspects of the Theory of Syntax. , 1966 .

[123]  Miroslav Kubat,et al.  Initialization of neural networks by means of decision trees , 1995, Knowl. Based Syst..

[124]  David H. Wolpert,et al.  Constructing a generalizer superior to NETtalk via a mathematical theory of generalization , 1990, Neural Networks.

[125]  R. Treiman,et al.  Toward an understanding of English syllabification , 1990 .

[126]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[127]  Scott E. Fahlman,et al.  An empirical study of learning speed in back-propagation networks , 1988 .

[128]  W. Levelt,et al.  Speaking: From Intention to Articulation , 1990 .

[129]  Walter Daelemans,et al.  Morphological Analysis as Classification: an Inductive-Learning Approach , 1996, ArXiv.

[130]  Vincent J. van Heuven,et al.  Analysis and synthesis of speech: strategic research towards high-quality text-to-speech generation , 1993 .

[131]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[132]  P. Flach Conjectures: an inquiry concerning the logic of induction , 1995 .

[133]  Maria Wolters,et al.  A Diphone{based Text-to-speech System for Scottish Gaelic , 1997 .

[134]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[135]  B. Dresher,et al.  A computational learning model for metrical phonology , 1990, Cognition.

[136]  F. D. Saussure Cours de linguistique générale , 1924 .

[137]  Herbert A. Simon,et al.  Search and Reasoning in Problem Solving , 1983, Artif. Intell..

[138]  David L. Waltz,et al.  Toward memory-based reasoning , 1986, CACM.

[139]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[140]  Michael Kenstowicz,et al.  Phonology In Generative Grammar , 1994 .

[141]  Benny Lautrup,et al.  Neural Networks: Computers With Intuition , 1990 .

[142]  James L. McClelland,et al.  Understanding normal and impaired word reading: computational principles in quasi-regular domains. , 1996, Psychological review.

[143]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[144]  J. Blevins The Syllable in Phonological Theory , 1995 .

[145]  H. Gardner The mind's new science: a history of the cognitive revolution , 1985 .

[146]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[147]  Antal van den Bosch,et al.  Automatic phonetic transcription of words based on sparse data , 1997 .

[148]  Walter Daelemans,et al.  A computational model of P&P: Dresher and Kaye (1990) revisited , 1995 .

[149]  M. Halle,et al.  English stress : its form, its growth, and its role in verse , 1977 .

[150]  Walter Daelemans,et al.  Data-Oriented Methods for Grapheme-to-Phoneme Conversion , 1993, EACL.