In the future network era, huge volume of information on all subject domains will be readily available via the network. Also, all the network information are dynamic, ever-changing and exploding. Furthermore, many of the spoken language processing applications will have to do with the content of the network information, which is dynamic. This means dynamic lexicons, language models and so on will be required. In order to cope with such a new network environment, automatic approaches for the collection, classification, indexing, organization and utilization of the linguistic data obtainable from the networks for language processing applications will be very important. On the one hand, high performance spoken language technology can hopefully be developed based on such dynamic linguistic data on the network. On the other hand, it is also necessary that such spoken language technology can be intelligently adapted to the content of the dynamic and the ever-changing network information. Some basic concept for live lexicons and dynamic corpora adapted to the network resources has been developed for Chinese spoken language processing applications and briefly summarized here in this paper. Although the major considerations here are for Chinese language, the concept may equally apply to other languages as well.
[1]
Donald R. Morrison,et al.
PATRICIA—Practical Algorithm To Retrieve Information Coded in Alphanumeric
,
1968,
J. ACM.
[2]
Lin-Shan Lee,et al.
Voice dictation of Mandarin Chinese
,
1997,
IEEE Signal Process. Mag..
[3]
Gaston H. Gonnet,et al.
New Indices for Text: Pat Trees and Pat Arrays
,
1992,
Information Retrieval: Data Structures & Algorithms.
[4]
Biing-Hwang Juang,et al.
Fundamentals of speech recognition
,
1993,
Prentice Hall signal processing series.
[5]
Lee-Feng Chien,et al.
Automatic acquisition of phrasal knowledge for English-Chinese bilingual information retrieval
,
1998,
SIGIR '98.
[6]
Lin-Shan Lee,et al.
Intelligent retrieval of dynamic networked information from mobile terminals using spoken natural language queries
,
1998
.