User, who art thou? User Profiling for Oral Corpus Platforms

This contribution presents the background, design and results of a study of users of three oral corpus platforms in Germany. Roughly 5.000 registered users of the Database for Spoken German (DGD), the GeWiss corpus and the corpora of the Hamburg Centre for Language Corpora (HZSK) were asked to participate in a user survey. This quantitative approach was complemented by qualitative interviews with selected users. We briefly introduce the corpus resources involved in the study in section 2. Section 3 describes the methods employed in the user studies. Section 4 summarizes results of the studies focusing on selected key topics. Section 5 attempts a generalization of these results to larger contexts.

[1]  Heike Zinsmeister,et al.  Requirements of a user-friendly, general-purpose corpus query interface , 2008 .

[2]  Steve Krug,et al.  Don't Make Me Think!: A Common Sense Approach to Web Usability , 2000 .

[3]  Thomas C. Schmidt,et al.  Multilingual Corpora at the Hamburg Centre for Language Corpora , 2011 .

[4]  Thomas Schmidt More) common ground for processing spoken language corpora , 2014 .

[5]  Thomas Schmidt,et al.  The Database for Spoken German ― DGD2 , 2014, LREC.

[6]  Daniel Stein,et al.  The HZSK Repository: Implementation, Features, and Use Cases of a Repository for Spoken Language Corpora , 2014, D Lib Mag..

[7]  Thomas Schmidt,et al.  The Research and Teaching Corpus of Spoken German ― FOLK , 2014, LREC.

[8]  Laurence Anthony,et al.  A critical look at software tools in corpus linguistics , 2013 .

[9]  Ana Frankenberg-Garcia,et al.  The corpus, its users and their needs: a user-oriented evaluation of COMPARA , 2007 .

[10]  Simon Krek,et al.  Compilation, transcription and usage of a reference speech corpus: the case of the Slovene corpus GOS , 2013, Language Resources and Evaluation.

[11]  Richard Wright,et al.  Accessing the spoken word , 2005, International Journal on Digital Libraries.

[12]  Carole Etienne,et al.  Grands corpus et linguistique outillée pour l'étude du français en interaction (plateforme CLAPI et corpus CIEL) , 2010 .

[13]  Olivier Baude,et al.  (Re)faire le corpus d’Orléans quarante ans après :quoi de neuf, linguiste ? , 2011 .

[14]  Michal Křen Recent Developments in the Czech National Corpus , 2015 .

[15]  Herman Stehouwer,et al.  Federated Search: Towards a Common Search Infrastructure , 2012, LREC.