A Mathematical Model of Historical Semantics and the Grouping of Word Meanings into Concepts

A statistical analysis of polysemy in sixteen English and French dictionaries has revealed that, in each dictionary, the number of senses per word has a near-exponential distribution. A probabilistic model of historical semantics is presented which explains this distribution. This mathematical model also provides a means of estimating the average number of distinct concepts per word, which was found to be considerably less than the average number of senses listed per word. The grouping of word senses into concepts is based on whether they could inspire the same new senses (by metaphor, metonymy, etc.), that is, their potential future rather than their history.

[1]  F. G. Healey Review. Harrap's Slang Dictionary. English-French/French-English. 50,000 modern words and expressions from the colloquial to the very vulgar , 1985 .

[2]  Bruno Gaume,et al.  Forms of meaning, meaning of forms , 2002, J. Exp. Theor. Artif. Intell..

[3]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[4]  Catherine Fuchs,et al.  Les ambiguïtés du français , 1996 .

[5]  Jonathan Crowther Oxford advanced learner's encyclopedic dictionary , 1992 .

[6]  Martin H. Levinson The Meaning of Everything: The Story of the Oxford English Dictionary , 2004 .

[7]  Dirk Geeraerts,et al.  Diachronic Prototype Semantics: A Contribution to Historical Lexicology , 1997 .

[8]  C. Onions,et al.  The Oxford Dictionary of English Etymology , 1967 .

[9]  R. Burchfield Oxford English dictionary , 1982 .

[10]  Jason Eisner,et al.  Lexical Semantics , 2020, The Handbook of English Linguistics.

[11]  Rada Mihalcea,et al.  EZ.WordNet: Principles for Automatic Generation of a Coarse Grained WordNet , 2001, FLAIRS Conference.

[12]  Piek Vossen The Language of Word Meaning: Condensed Meaning in EuroWordNet , 2001 .

[13]  George Kingsley Zipf,et al.  Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology , 2012 .

[14]  Donka Minkova,et al.  English Words: History and Structure , 2001 .

[15]  M. Swadesh Lexico-Statistical Dating of Prehistoric Ethnic Contacts , 1952 .

[16]  Yorick Wilks,et al.  Combining Weak Knowledge Sources for Sense Disambiguation , 1999, IJCAI.

[17]  Takenobu Tokunaga,et al.  Combining general hand-made and automatically constructed thesauri for information retrieval , 1999, IJCAI 1999.

[18]  D. A. Cruse,et al.  Computational lexical semantics: Polysemy and related phenomena from a cognitive linguistic viewpoint , 1995 .

[19]  B. Joseph,et al.  Historical Linguistics , 1999 .

[20]  B. Hayes THE WEB OF WORDS , 1999 .

[21]  Takenobu Tokunaga,et al.  Combining General Hand-Made and Automatically Constructed Thesauri for Query Expansion in Information Retrieval , 1999, IJCAI.

[22]  Raimo Anttila,et al.  Historical and comparative linguistics , 1989 .

[23]  Robert V. Hogg,et al.  Introduction to Mathematical Statistics. , 1966 .

[24]  C. Habel,et al.  Language , 1931, NeuroImage.

[25]  Terttu Nevalainen,et al.  EARLY MODERN ENGLISH LEXIS AND SEMANTICS , 2000 .

[26]  John Sinclair Collins Cobuild Learner's Dictionary , 1996 .

[27]  Claude Baudoin Iic,et al.  Vocabulary , 2007, Selections from Horace Odes III.

[28]  G. Reinsel,et al.  Introduction to Mathematical Statistics (4th ed.). , 1980 .

[29]  ResnikPhilip,et al.  Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation , 1999 .

[30]  Claude Margueron,et al.  Larousse dictionnaire général : Français-Italien, Italien-Français = Dizionario : Francese-Italiano, Italiano-Francese , 1994 .

[31]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[32]  DAVID G. KENDALL,et al.  Introduction to Mathematical Statistics , 1947, Nature.

[33]  Derek de Solla Price,et al.  A general theory of bibliometric and other cumulative advantage processes , 1976, J. Am. Soc. Inf. Sci..

[34]  Paul Procter,et al.  Longman Dictionary of Contemporary English , 1978 .

[35]  Bob Duckett,et al.  The Cambridge History of the English Language , 1999 .

[36]  M. Pagel The Evolutionary Emergence of Language: The History, Rate and Pattern of World Linguistic Evolution , 2000 .

[37]  R. Bailly,et al.  Dictionnaire des synonymes de la langue française , 1948 .

[38]  Stephan Gramley,et al.  The Vocabulary of World English , 2001 .

[39]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..