A Critical Review of State-Of-The-Art Technologies for Cross-Language Speech Retrieval

Recent developments in monolingual speech retrieval, ahtomatic speech recognition and crosslanguage text retrieval suggest that cro.s.slanguage speech retrieval deserves some enquiry. To direct work in this new area, it is important to appreciate the possibilities and limitations of the component technologies. This paper gives a critical review of the state-of-the-art of technologies for speech retrieval in cross-la~lguage environments and outlines some possible experimental paradigms.

[1]  Karen Spärck Jones,et al.  Retrieving spoken documents by combining multiple index sources , 1996, SIGIR '96.

[2]  Michael J. Witbrock,et al.  News-on-Demand: An Application of Informedia® Technology , 1995, D Lib Mag..

[3]  Alon Lavie,et al.  JANUS-II-translation of spontaneous conversational speech , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[4]  Douglas W. Oard,et al.  A survey of multilingual text retrieval , 1996 .

[5]  Xavier L. Aubert,et al.  The Philips large-vocabulary recognition system for american English, French, and German , 1995, EUROSPEECH.

[6]  Gregory Grefenstette,et al.  Querying across languages: a dictionary-based approach to multilingual information retrieval , 1996, SIGIR '96.

[7]  Peter Schäuble,et al.  Cross-language speech retrieval: establishing a baseline performance , 1997, SIGIR '97.

[8]  Lori Lamel,et al.  Transcribing broadcast news shows , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Larry Gillick,et al.  Multilingual speech recognition at Dragon Systems , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[10]  Chao-Huang Chang,et al.  CCLMDS'96: towards a speaker-independent large-vocabulary Mandarin dictation system , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Peter Schäuble,et al.  A system for retrieving speech documents , 1992, SIGIR '92.

[12]  Sean Connolly,et al.  Improvements in switchboard recognition and topic identification , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[13]  Thomas Bub,et al.  VERBMOBIL: the evolution of a complex large speech-to-speech translation system , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[14]  Roger C. F. Tucker,et al.  Automatic language identification using sub-word models , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Marc A. Zissman,et al.  Automatic dialect identification of extemporaneous conversational, Latin American Spanish speech , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[16]  Jean-Luc Gauvain,et al.  Developments in continuous speech dictation using the 1995 ARPA NAB news task , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[17]  Vijay Balasubramanian,et al.  Speech-Based Retrieval Using Semantic Co-Occurrence Filtering , 1994, HLT.

[18]  Steven DeGennaro,et al.  1.0 TANGORA - a large vocabulary speech recognition system for five languages , 1991, EUROSPEECH.

[19]  Lori Lamel,et al.  Issues in Large Vocabulary, Multilingual Speech Recognition , 1995, EUROSPEECH.

[20]  Steve R. Waterhouse,et al.  Transcription of broadcast television and radio news: the 1996 ABBOT system , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Alon Lavie,et al.  Translation of conversational speech with JANUS-II , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[22]  Lori Lamel,et al.  Developments in large vocabulary, continuous speech recognition of German , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[23]  Herbert Gish,et al.  Approaches to topic identification on the switchboard corpus , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Jean Paul Ballerini,et al.  Experiments in multilingual information retrieval using the SPIDER system , 1996, SIGIR '96.

[25]  Stéphane H. Maes,et al.  Transcription of broadcast news-system robustness issues and adaptation techniques , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[26]  Manny Rayner,et al.  Adapting the Core Language Engine to French and Spanish , 1996, ArXiv.

[27]  Manny Rayner,et al.  The Speech-Language Interface in the Spoken Language Translator , 1994, ArXiv.

[28]  Mark J. F. Gales,et al.  Improving environmental robustness in large vocabulary speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[29]  Jean-Luc Gauvain,et al.  Spoken language processing in a multilingual context , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[30]  Karen Spärck Jones,et al.  Open-vocabulary speech indexing for voice and video mail retrieval , 1997, MULTIMEDIA '96.

[31]  Mark J. F. Gales,et al.  Broadcast news transcription using HTK , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.