论文信息 - Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition

Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition

This paper discusses the design, recording and preprocessing of a Czech sign language corpus. The corpus is intended for training and testing of sign language recognition (SLR) systems. The UWB-07-SLR-P corpus contains video data of 4 signers recorded from 3 different perspectives. Two of the perspectives contain whole body and provide 3D motion data, the third one is focused on signers face and provide data for face expression and lip feature extraction. Each signer performed 378 signs with 5 repetitions. The corpus consists of several types of signs: numbers (35 signs), one and two-handed finger alphabet (64), town names (35) and other signs (244). Each sign is stored in a separate AVI file. In total the corpus consists of 21853 video files in total length of 11.1 hours. Additionally each sign is preprocessed and basic features such as 3D hand and head trajectories are available. The corpus is mainly focused on feature extraction and isolated SLR rather than continuous SLR experiments.

[1] Marek Hrúz,et al. Design of a Multi-Modal Information Kiosk for Aurally Handicapped People , 2007 .

[2] Andrey Ronzhin,et al. Audio-Visual Speech Recognition for Slavonic Languages (Czech and Russian) , 2006 .

[3] Jirí Zahradil,et al. Czech-Sign Speech Corpus for Semantic Based Machine Translation , 2006, TSD.

[4] Rachel Sutton-Spence,et al. ECHO data set for British Sign Language (BSL) , 2004 .

[5] Surendra Ranganath,et al. Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[6] Jan Zelinka,et al. Design and recording of Czech speech corpus for audio-visual continuous speech recognition , 2005, AVSP.

[7] Lale Akarun,et al. Sign Language Tutoring tool , 2005, 2005 13th European Signal Processing Conference.

[8] Marek Hrúz,et al. Design and recording of Czech sign language corpus for automatic sign language recognition , 2007, INTERSPEECH.

[9] Hermann Ney,et al. A German Sign Language Corpus of the Domain Weather Report , 2006, LREC.

[10] Hermann Ney,et al. Speech recognition techniques for a sign language recognition system , 2007, INTERSPEECH.

[11] Lale Akarun,et al. Speech and sliding text aided sign retrieval from hearing impaired sign news videos , 2007, Journal on Multimodal User Interfaces.