论文信息 - Query Similarity Computing Based on System Similarity Measurement

Query Similarity Computing Based on System Similarity Measurement

Query similarity computation is one of important factors in the process of query clustering. It has been used widely in the field of information processing. In this paper, a unified model for query similarity computation is presented based on system similarity. The novel approach of similarity computation uses the literal, semantic and statistical relative features of query. The method can take advantage of the normal approaches to improve the computation accuracy. Experiments show that the proposed method is an effective solution to the query similarity computation problem, and it can be generalized to measure the similarity of other components of text, such as sentences, paragraphs etc.

[1] Philip Resnik,et al. Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[2] Dekang Lin,et al. Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[3] Johanna D. Moore,et al. 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, COLING-ACL '98, August 10-14, 1998, Université de Montréal, Montréal, Quebec, Canada. Proceedings of the Conference. , 1998 .

[4] Zhou Meili. Some Concepts and Mathematical Consideration of Similarity System Theory , 1992 .

[5] Peter D. Turney. Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[6] Qun Liu,et al. Semantic computation in a Chinese Question-Answering system , 2002, Journal of Computer Science and Technology.

[7] Sergei Nirenburg,et al. Two Approaches to Matching in Example-Based Machine Translation , 1993, TMI.

[8] Charles Elkan,et al. The Field Matching Problem: Algorithms and Applications , 1996, KDD.

[9] Luc De Raedt,et al. Machine Learning: ECML 2001 , 2001, Lecture Notes in Computer Science.

[10] Carolyn J. Crouch,et al. An approach to the automatic construction of global thesauri , 1990, Inf. Process. Manag..