Topic Detection and Tracking with Spatio-Temporal Evidence

Topic Detection and Tracking is an event-based information organization task where online news streams are monitored in order to spot new unreported events and link documents with previously detected events. The detection has proven to perform rather poorly with traditional information retrieval approaches. We present an approach that formalizes temporal expressions and augments spatial terms with ontological information and uses this data in the detection. In addition, instead using a single term vector as a document representation, we split the terms into four semantic classes and process and weigh the classes separately. The approach is motivated by experiments.

[1]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[2]  Yiming Yang,et al.  Learning approaches for detecting and tracking news events , 1999, IEEE Intell. Syst..

[3]  Helena Ahonen-Myka,et al.  Applying Semantic Classes in Event Detection and Tracking , 2002 .

[4]  Yiming Yang,et al.  Improving text categorization methods for event tracking , 2000, SIGIR '00.

[5]  James Allan,et al.  First story detection in TDT is hard , 2000, CIKM '00.

[6]  J. Allan,et al.  On-Line New Event Detection using Single Pass Clustering , 1998 .

[7]  James Allan,et al.  Explorations within topic tracking and detection , 2002 .

[8]  Yiming Yang,et al.  Topic-conditioned novelty detection , 2002, KDD.

[9]  Duane Szafron,et al.  Temporal Granularity: Completing the Puzzle , 2004, Journal of Intelligent Information Systems.

[10]  L. Baker,et al.  A Hierarchical Probabilistic Model for Novelty Detection in Text , 1999, NIPS 1999.

[11]  James Allan,et al.  Relevance models for topic detection and tracking , 2002 .

[12]  Frank Schilder,et al.  From Temporal Expressions To Temporal Information: Semantic Tagging Of News Messages , 2001, The Language of Time - A Reader.

[13]  C. J. van Rijsbergen,et al.  Information Retrieval , 1979, Encyclopedia of GIS.

[14]  Victor Lavrenko,et al.  Event Tracking , 1998 .

[15]  Klaus Krippendorff,et al.  On the Reliability of Unitizing Continuous Data , 1995 .

[16]  Helena Ahonen-Myka,et al.  Extraction of Temporal Expressions from Finnish News-feed , 2003 .

[17]  Ronald Rosenfeld,et al.  Large-Scale Topic Detection and Language Model Adaptation. , 1997 .

[18]  Lillian Lee,et al.  On the effectiveness of the skew divergence for statistical language analysis , 2001, AISTATS.

[19]  Yiming Yang,et al.  Topic Detection and Tracking Pilot Study Final Report , 1998 .

[20]  Dov M. Gabbay,et al.  Handbook of logic in artificial intelligence and logic programming (Vol. 4): epistemic and temporal reasoning , 1995 .

[21]  Larry Gillick,et al.  Text segmentation and topic tracking on broadcast news via a hidden Markov model approach , 1998, ICSLP.

[22]  Antony Galton Time and change for AI , 1995 .