Glossing a multi-purpose sign language corpus

This paper describes the strategies that have been developed for creating consistent gloss annotations in the latest update to the Corpus NGT. Although the project aims to embrace the plea for ID-glosses in Johnston (2008), there is no reference lexicon that could be used in the creation of the annotations. An idiosyncratic strategy was developed that involved the creation of a temporary ‘glossing lexicon’, which includes conventions for distinguishing regional and other variants, true and apparent homonymy, and other difficulties that are specifically related to the glossing of two-handed simultaneous constructions on different tiers.

[1]  O. A. Crasborn,et al.  Linguistic variation as a challenge for interpreters in the Netherlands , 2009 .

[2]  Onno Crasborn,et al.  Ethical implications of language standardisation for sign language interpreters , 2005 .

[3]  M. González Rodríguez,et al.  Proceedings of the third International Conference on Language Resources and Evaluation , 2002 .

[4]  Onno Crasborn,et al.  Lexically bound mouth actions in Sign Language of the Netherlands: A comparison between different registers and age groups , 2009 .

[5]  Onno Crasborn,et al.  The Corpus NGT: An online corpus for professionals and laymen , 2008 .

[6]  Thomas Hanke,et al.  iLex - A tool for Sign Language Lexicography and Corpus Analysis , 2002, LREC.

[7]  Thomas Hanke,et al.  GlossLexer: A multimedia lexical database for sign language dictionary compilation , 2002 .

[8]  G. M. Schermer,et al.  From Variant to Standard: An Overview of the Standardization Process of the Lexicon of Sign Language of the Netherlands over Two Decades , 2003 .

[9]  Adam Schembri,et al.  British Sign Language Corpus Project: Open access archives and the Observer’s Paradox. , 2008, LREC 2008.

[10]  Trevor Johnston,et al.  “Real data are messy”: Considering cross-linguistic analysis of constituent ordering in Australian Sign Language (Auslan), Vlaamse Gebarentaal (VGT), and Irish Sign Language (ISL) , 2007 .

[11]  Onno Crasborn,et al.  Enhanced ELAN functionality for sign language corpora , 2008, LREC 2008.

[12]  Christopher Miller Section I: Some reflections on the need for a common sign notation , 2002 .

[13]  Bencie Woll,et al.  Sharing sign language data online: experiences from the ECHO project , 2007 .

[14]  Eleni Efthimiou,et al.  Proceedings of the 3rd Workshop on the Representation and Processing of Signed Languages : Construction and Exploitation of Sign Language Corpora , 2008 .