Novelty Detection via Answer Updating

Abstract : The detection of new and novel information in a document stream is an important component of potential applications. This paper describes an answer updating approach to novelty detection at the sentence level. Specifically, we explore the use of question-answering techniques for novelty detection. New information is defined as new/previously unseen answers to questions representing a user's information need. A sentence is treated as novel sentence if the system believes that it may contain a previously unseen answer to the question. In our answer updating approach, there are two important steps: question formulation and new answer detection. Experiments were carried out on data from the TREC 2002 novelty track using the proposed approach. The results show that novelty detection via answer updating outperforms other novelty measures reported in the literature in terms of precision at low recall.

[1]  Yi Zhang,et al.  Novelty and redundancy detection in adaptive filtering , 2002, SIGIR '02.

[2]  Joe Carthy,et al.  First Story Detection using a Composite Document Representation , 2001, HLT.

[3]  Xiaoyan Li,et al.  Syntactic features in question answering , 2003, SIGIR.

[4]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[5]  W. Bruce Croft,et al.  Evaluating Question-Answering Techniques in Chinese , 2001, HLT.

[6]  Donna K. Harman,et al.  Overview of the TREC 2002 Novelty Track , 2002, TREC.

[7]  Thorsten Brants,et al.  A System for new event detection , 2003, SIGIR.

[8]  Kui-Lam Kwok,et al.  TREC 2002 Web, Novelty and Filtering Track Experiments using PIRCS , 2002, TREC.

[9]  James Allan,et al.  First story detection in TDT is hard , 2000, CIKM '00.

[10]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[11]  Dragomir R. Radev,et al.  The University of Michigan at TREC 2002: Question Answering and Novelty Tracks , 2002, TREC.

[12]  James Allan,et al.  Retrieval and novelty detection at the sentence level , 2003, SIGIR.

[13]  Yiming Yang,et al.  Topic-conditioned novelty detection , 2002, KDD.

[14]  S. Robertson The probability ranking principle in IR , 1997 .

[15]  Richard M. Schwartz,et al.  An Algorithm that Learns What's in a Name , 1999, Machine Learning.

[16]  Padmini Srinivasan,et al.  Novel Results and Some Answers - The University of Iowa TREC 11 Results , 2002, TREC.

[17]  Yiqun Liu,et al.  THU TREC 2002: Novelty Track Experiments , 2002, TREC.

[18]  Tsutomu Hirao,et al.  A Machine Learning Approach for QA and Novelty Tracks: NTT System Description , 2002, TREC.

[19]  James Allan,et al.  On-Line New Event Detection and Tracking , 1998, SIGIR.