The Big Australian Speech Corpus (The Big ASC)

Under an ARC Linkage Infrastructure, Equipment and Facilities (LIEF) grant, speech science and technology experts from across Australia have joined forces to organise the recording of audio-visual (AV) speech data from representative speakers of Australian English in all capital cities and some regional centres. The Big Australian Speech Corpus (the Big ASC) will provide a standard recording setup and a collaboratively-designed elicitation protocol to create a corpus of AV speech data incorporating annotations and metadata, accessible via a centralised storage facility. The Big ASC infrastructure will provide a significant boost to research in speech science and human communication in Australia.

[1]  Takaaki Kuratate,et al.  A blueprint for a comprehensive Australian English auditory-visual speech corpus , 2009 .

[2]  Felicity Cox,et al.  The Border Effect: Vowel Differences across the NSW - Victorian Border ∗ , 2003 .

[3]  Anne H. Anderson,et al.  The Hcrc Map Task Corpus , 1991 .

[4]  Ruth Campbell,et al.  Evaluating Theories of Language: Evidence from Disordered Communication , 1996 .

[5]  Philip Rose Forensic Speaker Identification , 2002 .

[6]  Jonathan Harrington,et al.  A national database of spoken language: concept, design, and implementation , 1990, ICSLP.

[7]  Felicity Cox,et al.  The changing face of Australian English vowels , 2001 .

[8]  David B. Grayden,et al.  Testing Auditory Processing Skills and their Associations with Language in 4—5-year-olds , 2010, Language and speech.

[9]  Robert Dale,et al.  Algorithms for Generating Referring Expressions: Do They Do What People Do? , 2006, INLG.

[10]  Andrew Butcher,et al.  Linguistic aspects of Australian Aboriginal English , 2008, Clinical linguistics & phonetics.

[11]  Kirsty McDougall,et al.  Individual Variation in the Frication of Voiceless Plosives in Australian English: A Study of Twins' Speech , 2010 .

[12]  David M. W. Powers,et al.  Suffix Tree Based Approach for Chinese Information Retrieval , 2008, 2008 Eighth International Conference on Intelligent Systems Design and Applications.

[13]  Felicity Cox,et al.  Regional variation in the vowels of female adolescents from sydney , 1998, ICSLP.

[14]  Vidhyasaharan Sethu,et al.  Speaker dependency of spectral features and speech production cues for automatic emotion classification , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Roland Göcke,et al.  Statistical analysis of the relationship between audio and video speech parameters for Australian English , 2003, AVSP.

[16]  Mohammed Bennamoun,et al.  Sparse Representation for Video-Based Face Recognition , 2009, ICB.

[17]  John Ingram,et al.  Connected Speech Processes in Australian English. , 1989 .

[18]  Yan Li,et al.  Speech Separation Based on higher Order Statistics Using Recurrent Neural Networks , 2001, HIS.

[19]  Steven J. Simske,et al.  Recognition of emotions in interactive voice response systems , 2003, INTERSPEECH.

[20]  David M. W. Powers,et al.  User keyword preference: the Nwords and Rwords experiments , 2008, Int. J. Internet Protoc. Technol..

[21]  David M. W. Powers,et al.  Rough Diamonds in Natural Language Learning , 2009, RSKT.

[22]  Janet Fletcher,et al.  Intonational Variation in Four Dialects of English: The High Rising Tune , 2010 .

[23]  Anne Cutler,et al.  The predominance of strong initial syllables in the English vocabulary , 1987 .

[24]  Anne Cutler,et al.  Unfolding of phonetic information over time: a database of Dutch diphone perception. , 2003, The Journal of the Acoustical Society of America.

[25]  Trent W. Lewis,et al.  Language teaching in a mixed reality games environment , 2008, PETRA '08.

[26]  Roland Göcke,et al.  Towards Affective Sensing , 2007, HCI.

[27]  Trent W. Lewis,et al.  Distinctive feature fusion for improved audio-visual phoneme recognition , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..

[28]  Dat Tran,et al.  A Fuzzy Approach to Speaker Verification , 2002, Int. J. Pattern Recognit. Artif. Intell..

[29]  David M. W. Powers,et al.  Characterization and evaluation of similarity measures for pairs of clusterings , 2009, Knowledge and Information Systems.

[30]  Takaaki Kuratate Text-to-AV synthesis system for Thinking Head Project , 2008, AVSP.

[31]  D. Grayden,et al.  Effect of Age and Cognition on Childhood Speech in Noise Perception Abilities , 2006, Audiology and Neurotology.

[32]  G. Clark,et al.  Electrode Discrimination and Speech Perception in Young Children Using Cochlear Implants , 2000, Ear and hearing.

[33]  Sharynne McLeod,et al.  Production of /st/ clusters in trochaic and iambic contexts by typically developing children , 2008 .

[34]  Felicity Cox,et al.  Reversal of short front vowel raising in Australian English , 2008, INTERSPEECH.

[35]  Nenagh Kemp,et al.  The Spelling of Vowels Is Influenced by Australian and British English Dialect Differences , 2009 .

[36]  David Grayden,et al.  Speech perception for adults who use hearing aids in conjunction with cochlear implants in opposite ears. , 2006, Journal of speech, language, and hearing research : JSLHR.

[37]  Robert Dale,et al.  Referring Expression Generation through Attribute-Based Heuristics , 2009, ENLG.

[38]  Anne Cutler,et al.  The lexical statistics of word recognition problems caused by L2 phonetic confusion , 2005, INTERSPEECH.

[39]  Mitsuo Gen,et al.  Fuzzy Methods for Voice-Based Person Authentication , 2004 .

[40]  Trent W. Lewis,et al.  Distinctive feature fusion for recognition of australian English consonants , 2008, INTERSPEECH.