A review of retrospective news event detection

Retrospective news event detection (RED) has been studied for many years in order to discover previous unidentified events. There are ongoing works done to improve RED techniques such as distance measure and clustering approaches to overcome issues such as huge dimensionality of data. This paper discusses three major sequential stages in RED (data preprocessing, data representation and data organization) and reports the limitation in each stage. Finally we present the suggested RED with respect to crimes domain.

[1]  Helen M. Meng,et al.  Using contextual analysis for news event detection , 2001, Int. J. Intell. Syst..

[2]  Kai Yang,et al.  News clustering system based on text mining , 2010, 2010 IEEE International Conference on Advanced Management Science(ICAMS 2010).

[3]  Anastasios Tombros,et al.  The effectiveness of query-based hierarchic clustering of documents for information retrieval , 2002 .

[4]  Fazli Can,et al.  New event detection and topic tracking in Turkish , 2010 .

[5]  L. M. Yusuf,et al.  Features Discovery for Web Classification Using Support Vector Machine , 2010, 2010 International Conference on Intelligent Computing and Cognitive Informatics.

[6]  Aidong Zhang,et al.  An iterative strategy for pattern discovery in high-dimensional data sets , 2002, CIKM '02.

[7]  P JonesWilliam,et al.  Pictures of relevance: a geometric analysis of similarity measures , 1987 .

[8]  George W. Furnas,et al.  Pictures of relevance: A geometric analysis of similarity measures , 1987, J. Am. Soc. Inf. Sci..

[9]  Bin Wang,et al.  A probabilistic model for retrospective news event detection , 2005, SIGIR '05.

[10]  Ramesh Nallapati,et al.  Event threading within news topics , 2004, CIKM '04.

[11]  Liang-Chu Chen,et al.  Exploring the Effects of Text Clustering on On-Line Military News Based on Quantitative Association Rule , 2009, 2009 International Conference on Asian Language Processing.

[12]  Yancheng He,et al.  A Two-layer Text Clustering Approach for Retrospective News Event Detection , 2010, 2010 International Conference on Artificial Intelligence and Computational Intelligence.

[13]  Rabiah Abdul Kadir,et al.  Automatic lexicon generator , 2010, 2010 International Conference on Information Retrieval & Knowledge Management (CAMP).

[14]  Zhou Xusheng,et al.  A Topic Detection Method Based on Bicharacteristic Vectors , 2009, 2009 International Conference on Networks Security, Wireless Communications and Trusted Computing.

[15]  Ryoji Kataoka,et al.  A clustering method for news articles retrieval system , 2005, WWW '05.

[16]  B. Chandra,et al.  A multivariate time series clustering approach for crime trends prediction , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[17]  Fabio Crestani,et al.  An Approach to Indexing and Clustering News Stories Using Continuous Language Models , 2010, NLDB.

[18]  James Allan,et al.  On-Line New Event Detection and Tracking , 1998, SIGIR.

[19]  Timo Honkela,et al.  WEBSOM - Self-organizing maps of document collections , 1998, Neurocomputing.

[20]  Yiming Yang,et al.  Learning approaches for detecting and tracking news events , 1999, IEEE Intell. Syst..

[21]  Xiangying Dai,et al.  Event identification within news topics , 2010, 2010 International Conference on Intelligent Computing and Integrated Systems.

[22]  Hsinchun Chen,et al.  COPLINK: managing law enforcement data and knowledge , 2003, CACM.

[23]  Xiaolong Wang,et al.  Online topic detection and tracking of financial news based on hierarchical clustering , 2010, 2010 International Conference on Machine Learning and Cybernetics.

[24]  Taeho Jo,et al.  Clustering news groups using inverted index based NTSO , 2009, 2009 First International Conference on Networked Digital Technologies.

[25]  James Allan,et al.  Text classification and named entities for new event detection , 2004, SIGIR '04.

[26]  S. Hansen,et al.  Review of data mining clustering techniques to analyze data with high dimensionality as applied in gene expression data (June 2008) , 2008, 2008 International Conference on Service Systems and Service Management.

[27]  George Tsatsaronis,et al.  A Generalized Vector Space Model for Text Retrieval Based on Semantic Relatedness , 2009, EACL.

[28]  Yiming Yang,et al.  Topic-conditioned novelty detection , 2002, KDD.

[29]  Fabio Crestani,et al.  Estimating real-valued characteristics of criminals from their recorded crimes , 2008, CIKM '08.

[30]  Damminda Alahakoon,et al.  Mining Multi-modal Crime Patterns at Different Levels of Granularity Using Hierarchical Clustering , 2008, 2008 International Conference on Computational Intelligence for Modelling Control & Automation.

[31]  Thorsten Brants,et al.  A System for new event detection , 2003, SIGIR.

[32]  Christos Bouras,et al.  Improving Text Summarization Using Noun Retrieval Techniques , 2008, KES.

[33]  James Allan,et al.  Taking Topic Detection From Evaluation to Practice , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[34]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[35]  James Allan,et al.  Relevance models for topic detection and tracking , 2002 .

[36]  Xiulan Hao,et al.  Topic detection and tracking oriented to BBS , 2010, 2010 International Conference on Computer, Mechatronics, Control and Electronic Engineering.

[37]  Sharad C. Seth,et al.  A trainable, single-pass algorithm for column segmentation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[38]  Christos Bouras,et al.  Assigning Web News to Clusters , 2010, 2010 Fifth International Conference on Internet and Web Applications and Services.

[39]  Luis Gravano,et al.  An investigation of linguistic features and clustering algorithms for topical document clustering , 2000, SIGIR '00.

[40]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[41]  Aladdin Ayesh,et al.  Using Self Organizing Map to cluster Arabic crime documents , 2010, Proceedings of the International Multiconference on Computer Science and Information Technology.

[42]  Fabio Crestani,et al.  Design of an Interface for Interactive Topic Detection and Tracking , 2009, FQAS.

[43]  Masnizah Mohd Named entity patterns across news domains , 2007 .

[44]  Young-Woo Seo,et al.  Text clustering for topic detection , 2004 .