Lexicon and Syntax: Complexity across Genres and Language Varieties

English. This paper presents first results of an ongoing work to investigate the interplay between lexical complexity and syntactic complexity with respect to nominal lexicon and how it is affected by textual genre and level of linguistic complexity within genre. A cross-genre analysis is carried out for the Italian language using multi–leveled linguistic features automatically extracted from dependency parsed corpora. Italiano. Questo articolo presenta i primi risultati di un lavoro in corso volto a indagare la relazione tra complessità lessicale e complessità sintattica rispetto al lessico nominale e in che modo sia influenzata dal genere testuale e dal livello di complessità linguistica interno al genere. Un’analisi comparativa su più generi è condotta per la lingua italiana usando caratteristiche linguistiche multilivello estratte automaticamente da corpora annotati fino alla sintassi a dipen-

[1]  D. Bernhard,et al.  Recent Advances in Automatic Readability Assessment and Text Simplification , 2014 .

[2]  Daniel Gildea,et al.  Do Grammars Minimize Dependency Length? , 2010, Cogn. Sci..

[3]  Felice Dell'Orletta,et al.  Linguistic Profiling based on General-purpose Features and Native Language Identification , 2013, BEA@NAACL-HLT.

[4]  R. W. Morris,et al.  The Wilcoxon rank sum test , 1976 .

[5]  Silvia Bernardini,et al.  The WaCky wide web: a collection of very large linguistically processed web-crawled corpora , 2009, Lang. Resour. Evaluation.

[6]  E. Gibson Linguistic complexity: locality of syntactic dependencies , 1998, Cognition.

[7]  Felice Dell'Orletta,et al.  Accurate Dependency Parsing with a Stacked Multilayer Perceptron , 2009 .

[8]  Richard Futrell,et al.  Quantifying Word Order Freedom in Dependency Corpora , 2015, DepLing.

[9]  Holger Diessel,et al.  Competing motivations for the ordering of main and adverbial clauses , 2005 .

[10]  John A. Hawkins,et al.  A Performance Theory of Order and Constituency , 1995 .

[11]  Felice Dell'Orletta,et al.  Assessing the Readability of Sentences: Which Corpora and Features? , 2014, BEA@ACL.

[12]  Randall J. Ryder,et al.  The Relationship Between Word Frequency and Word Knowledge , 1988 .

[13]  Benedikt Szmrecsanyi,et al.  Linguistic Complexity: Second Language Acquisition, Indigenization, Contact , 2012 .

[14]  E. Gibson The dependency locality theory: A distance-based theory of linguistic complexity. , 2000 .

[15]  Felice Dell'Orletta,et al.  On the order of Words in Italian: a Study on Genre vs Complexity , 2017, DepLing.

[16]  Haitao Liu,et al.  The effects of genre on dependency distance and dependency direction , 2017 .