A telephone speech database of spelled and spoken names

This report describes a telephone speech corpus collected at the Oregon Graduate Institute's Center for Spoken Language Understanding. Over four thousand people called in response to public requests. They were prompted by a recorded voice to say and spell their rst and last names|with and without pauses, to say what city they grew up in and what city they were calling from, and to answer two yes/no questions. In order to collect su cient instances of each letter, about 1000 callers also recited the alphabet. Each call is checked and transcribed by two people. In addition, a subset of the calls is being phonetically labeled.