Deploying speech applications over the web

At Digital Equipment Corporation's Cambridge Research Lab (CRL), the Speech Interaction Group has been focusing on building speech applications for deployment over the World-Wide Web. Web-based speech applications require the browser to capture and transmit speech to remote servers for back-end processing, maintain application state, and present multi-media responses. This paper describes the group's strategy for delivering speech applications built around a mechanism, the digital Voice Plugin, for capturing and transmitting audio from a browser. It describes a conversational application implemented within this framework and discusses the problems of delivering these systems on the Web. In addition, we brie y touch upon some other Web-based speech applications that have been developed at CRL.

[1]  Victor Zue,et al.  GALAXY: a human-language interface to on-line travel information , 1994, ICSLP.

[2]  Samuel Bayer Embedding speech in Web interfaces , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3]  James R. Glass,et al.  Telephone data collection using the World Wide Web , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4]  Keith Waters,et al.  Driving synthetic mouth gestures: phonetic recognition for faceme! , 1997, EUROSPEECH.