Analogies between linguistics and information theory

An analogy is established between the syntagm and paradigm from Saussurean linguistics and the message and messages for selection from the information theory initiated by Claude Shannon. The analogy is pursued both as an end in itself and for its analytic value in understanding patterns of retrieval from full-text systems. The multivalency of individual words when isolated from their syntagm is contrasted with the relative stability of meaning of multiword sequences, when searching ordinary written discourse. The syntagm is understood as the linear sequence of oral and written language. Saussure's understanding of the word, as a unit that compels recognition by the mind, is endorsed, although not regarded as final. The lesser multivalency of multiword sequences is understood as the greater determination of signification by the extended syntagm. The paradigm is primarily understood as the network of associations a word acquires when considered apart from the syntagm. The restriction of information theory to expression or signals, and its focus on the combinatorial aspects of the message, is sustained. The message in the model of communication in information theory can include sequences of written language. Shannon's understanding of the written word, as a cohesive group of letters, with strong internal statistical influences, is added to the Saussurean conception. Sequences of more than one word are regarded as weakly correlated concatenations of cohesive units.

[1]  Claude E. Shannon,et al.  Communication theory of secrecy systems , 1949, Bell Syst. Tech. J..

[2]  Andrew Carstairs Reading Saussure: A Critical Commentary on the "Cours de linguistique générale," (review) , 1988 .

[3]  Julian Warner Writing and literary work in copyright: a binational and historical analysis , 1993 .

[4]  Julian Warner,et al.  From writing to computers , 1994 .

[5]  Julian Warner The public reception of the Research Assessment Exercise 1996 , 1998, Inf. Res..

[6]  Julian Warner,et al.  Selection power and selection labor for information retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[7]  Michael Mandelstam,et al.  On the Bandwagon? , 2007 .

[8]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[9]  Sanzo Komatsu Arts & Humanities Citation Index. , 1999 .

[10]  Sergio Verdu,et al.  Information theory: 50 years of discovery , 2000 .

[11]  David C. Blair,et al.  Knowledge management: Hype, hope, or help? , 2002, J. Assoc. Inf. Sci. Technol..

[12]  J. Myers,et al.  The Enlarged Devil's Dictionary , 1983 .

[13]  Christine A. Montgomery,et al.  Linguistics and information science , 1972, J. Am. Soc. Inf. Sci..

[14]  L. Goddard Information Theory , 1962, Nature.

[15]  V. Voloshinov Marxism and the philosophy of language , 1973 .

[16]  Edsger W. Dijkstra,et al.  Letters to the editor: go to statement considered harmful , 1968, CACM.

[17]  Julian Warner Information, Knowledge, Text , 2001 .

[18]  Claude E. Shannon,et al.  Recent Contributions to The Mathematical Theory of Communication , 2009 .

[19]  Jerzy W. Grzymala-Busse,et al.  Entropy of English Text: Experiments with Humans and a Machine Learning System Based on Rough Sets , 1998, Inf. Sci..

[20]  Norman Roberts SOCIAL CONSIDERATIONS TOWARDS A DEFINITION OF INFORMATION SCIENCE , 1976 .

[21]  J. Wilkins An essay towards a real character, and a philosophical language, 1668 , 1968 .

[22]  Patrick Wilson,et al.  Svenonius, Elaine . The Intellectual Foundations of Information Organization . Cambridge, Mass.: MIT Pr. (Digital Libraries and Electronic Publishing), 2000. 255p. $37, alk. paper (ISBN 0-262 19433-3). LC 99-41301. , 2001 .

[23]  J. Goody,et al.  The Consequences of Literacy , 1963, Comparative Studies in Society and History.

[24]  Julian Warner Humanizing Information Technology , 2004 .

[25]  George A. Miller,et al.  On Human Communication: A Review, a Survey, and a Criticism. , 1957 .

[26]  V. Gordon Childe,et al.  Society and knowledge , 1973 .

[27]  Jean‐Claude Gardin,et al.  DOCUMENT ANALYSIS AND LINGUISTIC THEORY , 1973 .

[28]  Philip N. Johnson-Laird,et al.  The computer and the mind - An introduction to cognitive science [Book Review] , 1989, Proceedings of the IEEE.

[29]  Toby Berger,et al.  Lossy Source Coding , 1998, IEEE Trans. Inf. Theory.

[30]  Dario Schor,et al.  Google advanced search , 2006 .

[31]  Giambattista Vico,et al.  On the Most Ancient Wisdom of the Italians: Unearthed from the Origins of the Latin Language , 1988 .

[32]  J. Warner Information society or cash nexus?: a study of the United States as a copyright haven , 1999 .

[33]  Edsger W. Dijkstra,et al.  Go To Statement Considered Harmful , 2022, Software Pioneers.

[34]  Julian Warner,et al.  Information and redundancy in the legend of Theseus , 2003, J. Documentation.

[35]  G. Zipf The Psycho-Biology Of Language: AN INTRODUCTION TO DYNAMIC PHILOLOGY , 1999 .

[36]  Donald MacKenzie,et al.  Mechanizing Proof: Computing, Risk, and Trust , 2001 .

[37]  R. Barthes Elements of Semiology , 1967 .

[38]  D. Sperber,et al.  Relevance: Communication and Cognition , 1989 .

[39]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[40]  Jean Starobinski,et al.  Words upon words : the anagrams of Ferdinand de Saussure , 1982 .

[41]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[42]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[43]  Julian Warner,et al.  Labor in information systems , 2006, Annu. Rev. Inf. Sci. Technol..