Recent years have seen increased interest within the speaker recognition community in high-level features including, for example, lexical choice, idiomatic expressions or syntactic structures. The promise of speaker recognition in forensic applications drives development toward systems robust to channel differences by selecting features inherently robust to channel difference. Within the language recognition community, there is growing interest in differentiating not only languages but also mutually intelligible dialects of a single language. Decades of research in dialectology suggest that high-level features can enable systems to cluster speakers according to the dialects they speak. The Phanotics (Phonetic Annotation of Typicality in Conversational Speech) project seeks to identify high-level features characteristic of American dialects, annotate a corpus for these features, use the data to dialect recognition systems and also use the categorization to create better models for speaker recognition. The data, once published, should be useful to other developers of speaker and dialect recognition systems and to dialectologists and sociolinguists. We expect the methods will generalize well beyond the speakers, dialects, and languages discussed here and should, if successful, provide a model for how linguists and technology developers can collaborate in the future for the benefit of both groups and toward a deeper understanding of how languages vary and change.
[1]
Stephanie Strassel,et al.
Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium
,
2004,
LREC.
[2]
Dominique Estival,et al.
Construction of a phonotactic dialect corpus using semiautomatic annotation
,
2007,
INTERSPEECH.
[3]
David Miller,et al.
The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text
,
2004,
LREC.
[4]
Mark Liberman,et al.
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
,
2006,
LREC.