Association-Based Segmentation for Chinese-Crossed Query Expansion

The continually and high-rate growth of China's economy has attracted more and more international investors. These investors have an urgent need of identifying patterns in Chinese information, which are potentially useful in making competitive decisions. The first step of deeply understanding and analyzing Chinese information is how to effectively search those likely relevant to a user query. However, queries provided by users are often incomplete and inappropriate to the information systems, especially for retrieving Chinese-crossed information. In this paper, we present a segmentation based on actionable Chinese term-association analysis for better understanding user queries so as to automatically generate Chinese-crossed-query expansions. The semantics behind the actionable term-association rules is thus studied. Experiments conducted have shown that our approach is efficient and promising.

[1]  Osmar R. Zaïane,et al.  Text document categorization by term association , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[2]  Beng Chin Ooi,et al.  Mining Term Association Rules for Global Query Expansion: A Case Study with Topic 202 from TREC4 , 2000 .

[3]  Yehuda Lindell,et al.  Text Mining at the Term Level , 1998, PKDD.

[4]  Chengqi Zhang,et al.  Identifying frequent terms in text databases by association semantics , 2003, Proceedings ITCC 2003. International Conference on Information Technology: Coding and Computing.

[5]  Mathias Géry,et al.  Knowledge Discovery for Automatic Query Expansion on the World-Wide Web , 1999, ER.

[6]  Chengqi Zhang,et al.  Post-mining: maintenance of association rules by weighting , 2003, Inf. Syst..

[7]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[8]  Jean-Pierre Chevallet,et al.  Relations between Terms Discovered by Association Rules , 2000 .

[9]  Beng Chin Ooi,et al.  Mining term association rules for automatic global query expansion: methodology and preliminary results , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.