Polysemy and synonymy in syntactic dependency networks

The relationship between two important semantic properties (polysemy and synonymy) of language and one of the most fundamental syntactic network properties (a degree of the node) is observed. Based on the synergetic theory of language, it is hypothesized that a word which occurs in more syntactic contexts, i.e. it has a higher degree, should be more polysemous and have more synonyms than a word which occurs in less syntactic contexts, i.e. it has a lesser degree. Six languages are used for hypotheses testing and, tentatively, the hypotheses are corroborated. The analysis of syntactic dependency networks presented in this study brings a new interpretation of the well-known relationship between frequency and polysemy (or synonymy).

[1]  Haitao Liu,et al.  Quantitative Properties of English Verb Valency , 2011, Journal of Quantitative Linguistics.

[2]  Petr Pajas,et al.  Full Valency. Verb Valency without Distinguishing Complements and Adjuncts , 2010, J. Quant. Linguistics.

[3]  Bethany S. Dohleman Exploratory social network analysis with Pajek , 2006 .

[4]  Lu Wang Synergetic Studies on Some Properties of Lexical Structures in Chinese , 2014, J. Quant. Linguistics.

[5]  Robert Malouf,et al.  Algorithms for Linguistic Processing, NWO PIONIER, Progress Report , 2002 .

[6]  Igor Mel’čuk,et al.  Dependency Syntax: Theory and Practice , 1987 .

[7]  Herbert J. Carlin,et al.  Network theory , 1964 .

[8]  Ján Macutek,et al.  On the quantitative analysis of verb valency in Czech , 2010, Text and Language.

[9]  Haitao Liu,et al.  How do Local Syntactic Structures Influence Global Properties in Language Networks? , 2010, Glottometrics.

[10]  Gerhard Heyer,et al.  Begriffsdynamik und Lexikonstruktur , 1993 .

[11]  G. Zipf The Psycho-Biology Of Language: AN INTRODUCTION TO DYNAMIC PHILOLOGY , 1999 .

[12]  Patrick Colm Hogan,et al.  The Cambridge encyclopedia of the language sciences , 2011 .

[13]  Zdeněk Žabokrtský,et al.  The role of syntax in complex networks: Local and global importance of verbs in a syntactic dependen , 2011 .

[14]  Mariona Taulé,et al.  AnCora: Multilevel Annotated Corpora for Catalan and Spanish , 2008, LREC.

[15]  Alexander Mehler,et al.  Automatic Language Classification by means of Syntactic Dependency Networks , 2011, J. Quant. Linguistics.

[16]  Haitao Liu,et al.  Language clusters based on linguistic complex networks , 2010 .

[17]  Jinyun Ke,et al.  Analysing Language Development from a Network Approach* , 2006, J. Quant. Linguistics.

[18]  D. W. Scott On optimal and data based histograms , 1979 .

[19]  Ján Macutek,et al.  Word form and lemma syntactic dependency networks in Czech: a comparative study , 2009, Glottometrics.

[20]  D. Wolfe,et al.  Nonparametric Statistical Methods. , 1974 .

[21]  G. Zipf,et al.  The Psycho-Biology of Language , 1936 .

[22]  Germán Colomá Towards a Synergetic Statistical Model of Language Phonology* , 2014, J. Quant. Linguistics.

[23]  Richard Hudson,et al.  Language Networks: The New Word Grammar , 2007 .

[25]  Reinhard Kühler,et al.  Quantitative Analysis of Syntactic Structures in the Framework of Synergetic Linguistics , 2007 .

[26]  R. Ferrer i Cancho,et al.  Zipf's law from a communicative phase transition , 2005 .

[27]  Stephen P. Borgatti,et al.  Network Theory , 2013 .

[28]  Anat Ninio Syntactic Development: Its input and output , 2011 .

[29]  Gosse Bouma,et al.  Algorithms for Linguistic Processing , 1999 .

[30]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[31]  Richard Johansson,et al.  The CoNLL 2008 Shared Task on Joint Parsing of Syntactic and Semantic Dependencies , 2008, CoNLL.

[32]  R. Harald Baayen,et al.  Semantic Density and Past-Tense Formation in Three Germanic Languages , 2005 .

[33]  Reinhard Köhler,et al.  Zur linguistischen Synergetik : Struktur und Dynamik der Lexik , 1986 .

[34]  Haitao Liu,et al.  Can syntactic networks indicate morphological complexity of a language , 2011 .

[35]  Anat Ninio,et al.  Language and the Learning Curve: A New Theory of Syntactic Development , 2006 .

[36]  Reinhard Kohler Quantitative Syntax Analysis , 2012 .

[37]  M. Hoey Lexical Priming: A New Theory of Words and Language , 2005 .

[38]  Marie Mikulová,et al.  Prague Dependency Treebank , 2017 .

[39]  R. Ferrer i Cancho Why do syntactic links not cross , 2006 .

[40]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[41]  Haitao Liu,et al.  Approaching human language with complex networks. , 2014, Physics of life reviews.

[42]  Ramon Ferrer-i-Cancho,et al.  Beyond description. Comment on "Approaching human language with complex networks" by Cong & Liu , 2014, Physics of life reviews.

[43]  Reinhard Köhler Properties of lexical units and systems (Eigenschaften lexikalischer Einheiten und Systeme) , 2005, Quantitative Linguistik / Quantitative Linguistics.

[44]  Haitao Liu,et al.  What role does syntax play in a language network , 2008 .

[45]  William R. Penuel,et al.  The ‘New’ Science of Networks and the Challenge of School Change , 2007 .

[46]  Aleš Horák,et al.  The Global WordNet Grid Software Design , 2008 .

[47]  Sabine Brants,et al.  The TIGER Treebank , 2001 .

[48]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[49]  Roberto Basili,et al.  Building the Italian Syntactic-Semantic Treebank , 2003 .

[50]  Juhan Tuldava,et al.  Probleme und Methoden der quantitativ-systemischen Lexikologie , 1998 .

[51]  Ricard Solé,et al.  Language: Syntax for free? , 2005, Nature.

[52]  Haitao Liu,et al.  Synergetic Properties of Chinese Verb Valency , 2014, J. Quant. Linguistics.

[53]  G. Zipf The meaning-frequency relationship of words. , 1945, The Journal of general psychology.

[54]  Béla Bollobás,et al.  The consequences of Zipf's law for syntax and symbolic reference , 2005, Proceedings of the Royal Society B: Biological Sciences.

[55]  Haitao Liu The complexity of Chinese syntactic dependency networks , 2008 .

[56]  Ramon Ferrer i Cancho,et al.  When language breaks into pieces. A conflict between communication through isolated signals and language. , 2006, Bio Systems.

[57]  Emmerich Kelih,et al.  Modelling polysemy in different languages: A continuous approach , 2008, Glottometrics.

[58]  Ramon Ferrer-i-Cancho,et al.  Some Word Order Biases from Limited Brain Resources: a Mathematical Approach , 2008, Adv. Complex Syst..

[59]  Ricard V. Solé,et al.  Emergence of Scale-Free Syntax Networks , 2007, Evolution of Communication and Language in Embodied Agents.

[60]  Reinhard Köhler,et al.  Patterns in syntactic dependency networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[61]  Ricard V. Solé,et al.  Least effort and the origins of scaling in human language , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[62]  Ján Macutek,et al.  Evaluating goodness-of-fit of discrete distribution models in quantitative linguistics , 2013, J. Quant. Linguistics.

[63]  Reinhard Köhler,et al.  Syntactic Structures: Properties and Interrelations , 1999 .

[64]  B. Karaoglan,et al.  Investigation of Zipf’s ‘law-of-meaning’ on Turkish corpora , 2007, 2007 22nd international symposium on computer and information sciences.

[65]  Daniel Zeman,et al.  HamleDT: To Parse or Not to Parse? , 2012, LREC.

[66]  Reinhard Köhler Quantitative Analysis of Syntactic Structures in the Framework of Synergetic Linguistics , 2007, Aspects of Automatic Text Analysis.