LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES

The network technology and the Internet are creating a completely new information era. It is believed that in the near future numerous of digital libraries and a great variety of multimedia databases, which consist of heterogeneous types of information including text, audio, image, video and so on, will be available worldwide via the Internet. This paper deals with the problem of Chinese text and Mandarin speech information retrieval with Mandarin speech queries. Instead of using the syllable-based information alone, the word-based information was also successfully incorporated to further improve the retrieving performance. A prototype system with an interface supporting some user-friendly functions was successfully implemented and the initial test results verified the feasibility of our approaches.

[1]  Keh-Jiann Chen,et al.  Unconstrained speech retrieval for Chinese document databases with very large vocabulary and unlimited domains , 1995, EUROSPEECH.

[2]  Lin-Shan Lee,et al.  Very-large-vocabulary Mandarin voice message file retrieval using speech queries , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3]  Karen Spärck Jones,et al.  Experiments in Spoken Document Retrieval , 1996, Inf. Process. Manag..

[4]  Lin-Shan Lee,et al.  Syllable-based relevance feedback techniques for Mandarin voice record retrieval using speech queries , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Lin-Shan Lee,et al.  Intelligent retrieval of dynamic networked information from mobile terminals using spoken natural language queries , 1998 .

[6]  Victor Zue,et al.  Phonetic recognition for spoken document retrieval , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[7]  Lin-Shan Lee,et al.  A*-admissible key-phrase spotting with sub-syllable level utterance verification , 1998, ICSLP.