Automated directory assistance system - from theory to practice

Abstract The automated directory assistance system (ADAS) is traditionally formulated as an automatic speech recognition (ASR) problem. Recently, it has been formulated as a voice search problem, where a spoken utterance is firstly converted into text, which in turn is used to search for the listing. In this paper, we focus on the design and development of the utterance-to-listing component of ADAS. We show that many theoretical and practical issues need to be resolved when applying the basic idea of voice search to the development of ADAS. We share our experiences in addressing these issues, especially in pre-processing the listing database, generating a high performance LM, and developing efficient, accurate, and robust search algorithms. Field tests of our prototype system indicate that an 81% task completion rate can be achieved. Index Terms : speech recognition, directory assistance, voice search, TFIDF, spoken dialog system, vector space model 1. Introduction An automated directory assistance system (ADAS) [1] [2] [3] [5] [6] is a spoken dialog system that provides the caller with the phone number and/or address of the business or residential listing he/she requests. It is a very complicated system that involves automatic speech recognition (ASR), listing lookup, disambiguation, and dialog design. The core element of the ADAS is the utterance-to-listing (U2L) component that maps an utterance K

[1]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[2]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[3]  Andreas Kellner,et al.  Towards an automated directory information system , 1997, EUROSPEECH.

[4]  F. Canavesio,et al.  Automation of Telecom Italia directory assistance service: field trial results , 1998, Proceedings 1998 IEEE 4th Workshop Interactive Voice Technology for Telecommunications Applications. IVTTA '98 (Cat. No.98TH8376).

[5]  Shankar Kumar,et al.  Normalization of non-standard words , 2001, Comput. Speech Lang..

[6]  Lou Boves,et al.  Business listings in automatic directory assistance , 2001, INTERSPEECH.

[7]  Richard M. Schwartz,et al.  A scalable architecture for Directory Assistance automation , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Geoffrey Zweig,et al.  Automatic construction of unique signatures and confusable sets for natural language directory assistance applications , 2003, INTERSPEECH.

[9]  Esther Levin,et al.  Voice user interface design for automated directory assistance , 2005, INTERSPEECH.

[10]  Dong Yu,et al.  N-Gram Based Filler Model for Robust Grammar Authoring , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11]  Tim Paek,et al.  People watcher: a game for eliciting human-transcribed data for automated directory assistance , 2007, INTERSPEECH.