UVA: Language Modeling Techniques for Web People Search
暂无分享,去创建一个
In this paper we describe our participation in the SemEval 2007 Web People Search task. Our main aim in participating was to adapt language modeling tools for the task, and to experiment with various document representations. Our main finding is that single pass clustering, using title, snippet and body to represent documents, is the most effective setting.
[1] Julio Gonzalo,et al. The SemEval-2007 WePS Evaluation: Establishing a benchmark for the Web People Search Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).
[2] David M. Pennock,et al. Categories and Subject Descriptors , 2001 .
[3] Thomas Kalt,et al. A New Probabilistic Model of Text Classification and Retrieval , 1998 .
[4] Thomas Hofmann,et al. Probabilistic Latent Semantic Analysis , 1999, UAI.