论文信息 - Building an Audio-Visual Corpus of Australian English: Large Corpus Collection with an Economical Portable and Replicable Black Box

Building an Audio-Visual Corpus of Australian English: Large Corpus Collection with an Economical Portable and Replicable Black Box

The Big Australian Speech Corpus project incorporates the strategic goals of 30 Chief Investigators from various speech science areas. Speech from 1000 geographically and socially diverse speakers is being recorded using a uniform and automated protocol plus standardized hardware and software to produce a widely applicable and extensible database – AusTalk. Here we describe the project’s major components and organization; share the lessons learnt from difficulties and challenges; and present the results achieved so far. Index Terms: speech corpus, AV data, Australian English.

[1] Cynthia G. Clopper,et al. Prosodic Effects on Word Reduction , 2002, Language and speech.

[2] Steve Cassidy,et al. Ingesting the Auslan Corpus into the DADA Annotation Store , 2009, Linguistic Annotation Workshop.

[3] Takaaki Kuratate,et al. A blueprint for a comprehensive Australian English auditory-visual speech corpus , 2009 .

[4] Michael Clyne,et al. Ethnic Varieties of Australian English , 2001 .

[5] P. Lang. International affective picture system (IAPS) : affective ratings of pictures and instruction manual , 2005 .

[6] Dominique Estival,et al. The Big Australian Speech Corpus (The Big ASC) , 2010 .

[7] M. MacMahon. The woman behind ‘Arthur’ , 1991 .

[8] Maja Pantic,et al. Cost-Effective Solution to Synchronized Audio-Visual Capture Using Multiple Sensors , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[9] Felicity Cox,et al. Timing Differences in the VC Rhyme of Standard Australian English and Lebanese Australian English , 2011, ICPhS.

[10] P. Lang,et al. International Affective Picture System (IAPS): Instruction Manual and Affective Ratings (Tech. Rep. No. A-4) , 1999 .

[11] Julie Vonwiller,et al. Speaker and Material Selection for the Australian National Database of Spoken Language , 1995, J. Quant. Linguistics.