PhoneBook: a phonetically-rich isolated-word telephone-speech database

Describes the collection of a phonetically-rich isolated-word telephone-speech database, "PhoneBook", which was undertaken because of (1) the lack of available large-vocabulary isolated-word data, (2) anticipated continued importance of isolated-word and keyword-spotting technology to speech-recognition-based applications over the telephone, and (3) findings that continuous-speech training data is inferior to isolated-word training for isolated-word recognition. PhoneBook has nearly 8000 distinct words, selected for complete coverage of phoneme contexts enumerated using both triphones and a novel method which takes into account syllable position, lexical stress, and non-adjacent-phoneme coarticulatory effects. PhoneBook consists of more than 92000 utterances, averaging over 11 talkers for each word. A demographically-representative set of over 1300 native speakers of American English each made a single telephone call and read 75 words. The paper describes the word list design, talker enrolment procedure, recording procedure and equipment, utterance verification method, and summary statistics for PhoneBook, which will be made available through the Linguistic Data Consortium.