BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains

There are as many sign languages as there are deaf communities in the world. Linguists have been collecting corpora of different sign languages and annotating them extensively in order to study and understand their properties. On the other hand, the field of computer vision has approached the sign language recognition problem as a grand challenge and research efforts have intensified in the last 20 years. However, corpora collected for studying linguistic properties are often not suitable for sign language recognition as the statistical methods used in the field require large amounts of data. Recently, with the availability of inexpensive depth cameras, groups from the computer vision community have started collecting corpora with large number of repetitions for sign language recognition research. In this paper, we present the BosphorusSign Turkish Sign Language corpus, which consists of 855 sign and phrase samples from the health, finance and everyday life domains. The corpus is collected using the state-of-the-art Microsoft Kinect v2 depth sensor, and will be the first in this sign language research field. Furthermore, there will be annotations rendered by linguists so that the corpus will appeal both to the linguistic and sign language recognition research communities.

[1]  Thad Starner,et al.  A novel approach to American Sign Language (ASL) phrase verification using reversed signing , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[2]  Hermann Ney,et al.  RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus , 2012, LREC.

[3]  Jonas Beskow,et al.  A Kinect Corpus of Swedish Sign Language Signs , 2013 .

[4]  Sergio Escalera,et al.  ChaLearn Looking at People Challenge 2014: Dataset and Results , 2014, ECCV Workshops.

[5]  Hermann Ney,et al.  Benchmark Databases for Video-Based Automatic Sign Language Recognition , 2008, LREC.

[6]  Onno Crasborn,et al.  The Corpus NGT: An online corpus for professionals and laymen , 2008 .

[7]  Wen Gao,et al.  A SRN/HMM system for signer-independent continuous sign language recognition , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[8]  Thomas Hanke,et al.  iLex - A tool for Sign Language Lexicography and Corpus Analysis , 2002, LREC.

[9]  Thomas Hanke,et al.  DGS corpus project - Development of a corpus based electronic dictionary German Sign Language / German , 2009 .

[10]  Peter Wittenburg,et al.  Annotation by Category: ELAN and ISO DCR , 2008, LREC.

[11]  Thomas Hanke HamNoSys – Representing Sign Language Data in Language Resources and Language Processing Contexts , 2004 .

[12]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Richard Bowden,et al.  Sign Language Recognition , 2011, Visual Analysis of Humans.

[14]  Zhengyou Zhang,et al.  Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[15]  K. Allan,et al.  Classifiers , 2015 .

[16]  Scott K. Liddell Grammar, Gesture, and Meaning in American Sign Language , 2003 .

[17]  Andy Way,et al.  The ATIS Sign Language Corpus , 2008, LREC.

[18]  Lale Akarun,et al.  HOSPISIGN: AN INTERACTIVE SIGN LANGUAGE PLATFORM FOR HEARING IMPAIRED , 2015 .

[19]  Jordan Fenlon,et al.  Building the British Sign Language Corpus , 2013 .

[20]  W. Stokoe Sign Language Structure , 1980 .

[21]  Manuel Carreiras,et al.  LSE-Sign: A lexical database for Spanish Sign Language , 2015, Behavior Research Methods.

[22]  John Glauert,et al.  Dicta-Sign – Building a Multilingual Sign Language Corpus , 2012 .

[23]  Trevor Johnston,et al.  From archive to corpus: transcription and annotation in the creation of signed language corpora , 2008, PACLIC.

[24]  Stan Sclaroff,et al.  The American Sign Language Lexicon Video Dataset , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.