论文信息 - High-performance FAQ retrieval using an automatic clustering method of query logs

High-performance FAQ retrieval using an automatic clustering method of query logs

To resolve some of lexical disagreement problems between queries and FAQs, we propose a reliable FAQ retrieval system using query log clustering. On indexing time, the proposed system clusters the logs of users' queries into predefined FAQ categories. To increase the precision and the recall rate of clustering, the proposed system adopts a new similarity measure using a machine readable dictionary. On searching time, the proposed system calculates the similarities between users' queries and each cluster in order to smooth FAQs. By virtue of the cluster-based retrieval technique, the proposed system could partially bridge lexical chasms between queries and FAQs. In addition, the proposed system outperforms the traditional information retrieval systems in FAQ retrieval.

Jungyun Seo | Harksoo Kim

[1] John D. Lafferty,et al. A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[2] Hideki Kozima,et al. Similarity between Words Computed by Spreading Activation on an English Dictionary , 1993, EACL.

[3] Eriks Sneiders,et al. Automated FAQ Answering: Continued Experience with Shallow Language Understanding , 1999 .

[4] Peter Willett,et al. Comparison of Hierarchie Agglomerative Clustering Methods for Document Retrieval , 1989, Comput. J..

[5] Kristian J. Hammond,et al. Question Answering from Frequently Asked Question Files: Experiences with the FAQ FINDER System , 1997, AI Mag..

[6] Jörg P. Müller,et al. Doing business in the information marketplace: a case study , 1999, AGENTS '99.

[7] Kristian J. Hammond,et al. FAQ finder: a case-based approach to knowledge navigation , 1995, Proceedings the 11th Conference on Artificial Intelligence for Applications.

[8] W. Bruce Croft,et al. Document clustering: An evaluation of some experiments with the cranfield 1400 collection , 1975, Inf. Process. Manag..

[9] C. J. van Rijsbergen,et al. The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[10] Marek Świdziński,et al. The Design of a Universal Basic Dictionary of Contemporary Polish , 1990 .

[11] Michael McGill,et al. Introduction to Modern Information Retrieval , 1983 .