A grammar-book treebank of Turkish

This paper introduces a new Turkish dependency treebank following the Universal Dependencies annotation scheme. The treebank is built on example sentences from a grammar book, which cover a wide range of the linguistic constructions. Thus, the resulting treebank is a valuable resource for theoretical (linguistic) research as well as testing computational tools for the coverage of the constructions found in the language. This version differs from the published version slightly. Besides potential differences in typesetting, a dependency label in Fig. 3b is corrected.

[1]  Çağrı Çöltekin,et al.  A Freely Available Morphological Analyzer for Turkish , 2010, LREC.

[2]  Çağri Çoltekin A set of open source tools for Turkish natural language processing , 2014, LREC 2014.

[3]  A. Göksel,et al.  Turkish: A Comprehensive Grammar , 2004 .

[4]  Ruket Cakici,et al.  Wide-coverage parsing for Turkish , 2009 .

[5]  Christo Kirov,et al.  A Universal Feature Schema for Rich Morphological Annotation and Fine-Grained Cross-Lingual Part-of-Speech Tagging , 2015, SFCM.

[6]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[7]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[8]  Gülsen Eryigit,et al.  Representation of Morphosyntactic Units and Coordination Structures in the Turkish Dependency Treebank , 2013, SPMRL@EMNLP.

[9]  Timothy Baldwin,et al.  From Database to Treebank: On Enhancing Hypertext Grammars with Grammar Engineering and Treebank Search , 2012 .

[10]  Olcay Taner Yildiz,et al.  Constructing a Turkish-English Parallel TreeBank , 2014, ACL.

[11]  Joakim Nivre,et al.  Swedish-Turkish Parallel Treebank , 2008, LREC.

[12]  Samuel R. Bowman,et al.  A Gold Standard Dependency Corpus for English , 2014, LREC.

[13]  Kemal Oflazer,et al.  The Annotation Process in the Turkish Treebank , 2003, LINC@EACL.

[14]  Ondrej Dusek,et al.  HamleDT: Harmonized multi-language dependency treebank , 2014, Lang. Resour. Evaluation.

[15]  Gökhan Tür,et al.  Statistical Morphological Disambiguation for Agglutinative Languages , 2000, COLING.

[16]  Francis M. Tyers,et al.  Universal Dependencies , 2017, EACL.

[17]  Dilek Z. Hakkani-Tür,et al.  Building a Turkish Treebank , 2003 .

[18]  Wolfgang Seeker,et al.  A Graph-based Lattice Dependency Parser for Joint Morphological Segmentation and Syntactic Analysis , 2015, Transactions of the Association for Computational Linguistics.

[19]  Koenraad De Smedt,et al.  An Open Infrastructure for Advanced Treebanking , 2013 .

[20]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[21]  Francis M. Tyers,et al.  Towards a free/open-source universal-dependency treebank for Kazakh , 2015 .

[22]  Çagri Çöltekin Turkish NLP web services in the WebLicht environment , 2015 .

[23]  Erhard W. Hinrichs,et al.  The Tüba-D/Z Treebank: Annotating German with a Context-Free Backbone , 2004, LREC.