Telephone data collection using the World Wide Web

Over the past year our group has begun development of telephone-based speech understanding capability for our GALAXY conversational system. An important part of this process has been the collection of telephone speech which was used for training and evaluation. In the first phase of data collection our goal was to collect read speech from a wide variety of talkers, telephone handsets, and noise/channel conditions. In the second phase of data collection our additional goal was to collect spontaneous telephone speech from subjects actually using the system. In order to maximize variation in telephone conditions, as well as ease of use for subjects, the data collection software was designed to telephone subjects at their specified phone numbers around North America. Subjects initiate the data collection session by submitting an electronic form accessible by a WWW browser. For read speech collection, a set of prompts is automatically generated for the subject. This paper describes the design of the data collection system we are using for these purposes. To date we have collected over 9,000 utterances from over 270 subjects.

[1]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Victor Zue,et al.  GALAXY: a human-language interface to on-line travel information , 1994, ICSLP.

[3]  Victor Zue,et al.  The Collection and Preliminary Analysis of a Spontaneous Speech Database , 1989, HLT.

[4]  Joseph Picone,et al.  The voice across Japan database-the Japanese language contribution to Polyphone , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Joseph Picone,et al.  Voice across Hispanic America: a telephone speech corpus of American Spanish , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6]  John J. Godfrey,et al.  Macrophone: an American English telephone speech corpus for the Polyphone project , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Victor Zue,et al.  Collection of Spontaneous Speech for the ATIS Domain and Comparative Analyses of Data Collected at MIT and TI , 1991, HLT.

[8]  Victor Zue,et al.  Collection and Analyses of WSJ-CSR Data at MIT , 1992, HLT.

[9]  Victor Zue Navigating the Information Superhighway Using Spoken Language Interfaces , 1995, IEEE Expert.