Quantitative typological analysis of Romance languages

Abstract Based on real-text corpora with syntactic annotation, this study quantitatively addressed the following two questions: whether quantitative methods and indexes can point to the diachronic syntactic drifts characterizing the evolution from Latin to Romance languages and whether these methods and indexes can provide evidence to evince the shared syntactic features among Romance languages and define them as a distinctive language subgroup. Our study shows that the distributions of dependency directions are suggestive of positive answers to the above two questions. In addition, the dependency syntactic networks extracted from the dependency treebanks reflect the degree of inflectional variation of a language, and the clustering analysis shows that these parameters, in spite of some imperfections, can also help differentiate Romance languages from Latin diachronically and from other languages synchronically.

[1]  Matthew S. Dryer,et al.  Word Order , 2022 .

[2]  Peter Koch,et al.  Connexiones romanicae : Dependenz und Valenz in romanischen Sprachen , 1991 .

[3]  Adam Ledgeway SYNTACTIC AND MORPHOSYNTACTIC TYPOLOGY AND CHANGE , 2010 .

[4]  J. B. Solodow Latin Alive: The Survival of Latin in English and the Romance Languages , 2010 .

[5]  Martin Harris,et al.  The evolution of French syntax : a comparative approach , 1983 .

[6]  L. R. Palmer,et al.  The Latin Language , 1954 .

[7]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[8]  R. Wright Du latin aux langues romanes: études de linguistique historique , 1990 .

[9]  E. Gibson Linguistic complexity: locality of syntactic dependencies , 1998, Cognition.

[10]  Michael Cysouw,et al.  New approaches to cluster analysis of typological indices , 2007, Exact Methods in the Study of Language and Text.

[11]  Haitao Liu,et al.  Using a Chinese treebank to measure dependency distance , 2009 .

[12]  Fidel Ramírez,et al.  Computing topological parameters of biological networks , 2008, Bioinform..

[13]  Emmerich Kelih,et al.  The type-token relationship in Slavic parallel texts , 2010, Glottometrics.

[14]  Édouard Bourciez,et al.  Éléments de linguistique romane , 1948 .

[15]  W. Bruce Croft Typology and Universals , 1990 .

[16]  Haitao Liu The complexity of Chinese syntactic dependency networks , 2008 .

[17]  Haitao Liu,et al.  Dependency direction as a means of word-order typology: A method based on dependency treebanks , 2010 .

[18]  David Temperley,et al.  Dependency-length minimization in natural and artificial languages* , 2008, J. Quant. Linguistics.

[19]  Haitao Liu,et al.  Can syntactic networks indicate morphological complexity of a language , 2011 .

[20]  Richard Hudson,et al.  An Introduction to Word Grammar , 2010 .

[21]  Roberto Basili,et al.  Building the Italian Syntactic-Semantic Treebank , 2003 .

[22]  Sebastian Riedel,et al.  The CoNLL 2007 Shared Task on Dependency Parsing , 2007, EMNLP.

[23]  Díaz de Ilarraza Construction of a Basque Dependency Treebank , 2003 .

[24]  Marián Sloboda Typology and Universals (review) , 2005 .

[25]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[26]  Armin Schwegler,et al.  Analyticity and Syntheticity: A Diachronic Perspective with Special Reference to Romance Languages , 1990 .

[27]  Kemal Oflazer,et al.  The Annotation Process in the Turkish Treebank , 2003, LINC@EACL.

[28]  Max Bane,et al.  Quantifying and Measuring Morphological Complexity , 2007 .

[29]  J. Greenberg A Quantitative Approach to the Morphological Typology of Language , 1960, International Journal of American Linguistics.

[30]  Haitao Liu,et al.  How do Local Syntactic Structures Influence Global Properties in Language Networks? , 2010, Glottometrics.

[31]  David Bamman,et al.  The Design and Use of a Latin Dependency Treebank , 2006 .

[32]  Zdeněk Žabokrtský,et al.  The role of syntax in complex networks: Local and global importance of verbs in a syntactic dependen , 2011 .

[33]  Dilek Z. Hakkani-Tür,et al.  Building a Turkish Treebank , 2003 .

[34]  Haitao Liu,et al.  Language clusters based on linguistic complex networks , 2010 .

[35]  Haitao Liu,et al.  Dependency Distance as a Metric of Language Comprehension Difficulty , 2008 .

[36]  Gabriel Altmann,et al.  Allgemeine Sprachtypologie : Prinzipien und Messverfahren , 1973 .

[37]  Stelios Piperidis,et al.  Theoretical and Practical Issues in the Construction of a Greek Dependency Treebank , 2005 .

[38]  W. Schmidt,et al.  Die Sprachfamilien und Sprachenkreise der Erde , 1927, Nature.

[39]  Nigel Vincent,et al.  The Romance Languages , 1988 .

[40]  V. Moulton,et al.  Neighbor-net: an agglomerative method for the construction of phylogenetic networks. , 2002, Molecular biology and evolution.

[41]  Montserrat Civit Torruella,et al.  Design Principles for a Spanish Treebank , 2002 .

[42]  Eckhard Bick,et al.  Floresta Sintá(c)tica: A treebank for Portuguese , 2002, LREC.

[43]  David Gil,et al.  The World Atlas of Language Structures , 2005 .

[44]  Pascal Denis,et al.  Statistical French Dependency Parsing: Treebank Conversion and First Results , 2010, LREC.

[45]  Alexandra Kinyon,et al.  Building a Treebank for French , 2000, LREC.

[46]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[47]  Alexander Mehler,et al.  Automatic Language Classification by means of Syntactic Dependency Networks , 2011, J. Quant. Linguistics.

[48]  E. Magni The Evolution of Latin Word (Dis)order , 2009 .

[49]  Joseph H. Greenberg,et al.  Some Universals of Grammar with Particular Reference to the Order of Meaningful Elements , 1990, On Language.

[50]  Gabriel Altmann,et al.  Hapax Legomena and Language Typology , 2008, J. Quant. Linguistics.

[51]  Richard Hudson,et al.  Language Networks: The New Word Grammar , 2007 .

[52]  Haitao Liu Quantitative analysis of Zamenhof’s Esenco kaj estonteco , 2011 .

[53]  Gregory Crane,et al.  An Ownership Model of Annotation: The Ancient Greek Dependency Treebank , 2009 .

[54]  Jae Jung Song,et al.  Linguistic Typology: Morphology and Syntax , 2000 .

[55]  Wenwen Li,et al.  Chinese Syntactic and Typological Properties Based on Dependency Syntactic Treebanks , 2009 .

[56]  Heinz Happ,et al.  Grundfragen einer Dependenz-Grammatik des Lateinischen , 1976 .

[57]  Brigitte L. M. Bauer,et al.  The Emergence and Development of SVO Patterning in Latin and French: Diachronic and Psycholinguistic Perspectives , 1995 .

[58]  J. Adams A typological approach to Latin word order , 1976, Indogermanische Forschungen.