Event Information Organization Algorithm about Enterprises Based on Timeline

To organize process of a number of events containing common topics, the paper proposed an event information organization algorithm by two text clustering, topic clustering which combines Latent dirichlet allocation model (LDA model) and TF-IDF model and event Hierarchical Clustering based on timeline. Topic clustering is a topic clustering through dynamic K-means in which the document vector distance is calculated. It can be known that events sets have a common topic by this clustering method. And another clustering method event hierarchical clustering is a clustering events phase based on timeline. In a word, this paper organizes event information by combining the above two clustering methods.

[1]  Heng Ji,et al.  Refining Event Extraction through Cross-Document Inference , 2008, ACL.

[2]  Alistair Moffat,et al.  Effective document presentation with a locality-based similarity heuristic , 1999, SIGIR '99.

[3]  Qi He,et al.  A Model for Anticipatory Event Detection , 2006, ER.

[4]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[5]  Thorsten Brants,et al.  A System for new event detection , 2003, SIGIR.

[6]  Hwee Tou Ng,et al.  A maximum entropy approach to information extraction from semi-structured and free text , 2002, AAAI/IAAI.

[7]  Kuo Zhang,et al.  New event detection based on indexing-tree and named entity , 2007, SIGIR.

[8]  James Allan,et al.  On-Line New Event Detection and Tracking , 1998, SIGIR Forum.

[9]  Chris H. Q. Ding,et al.  Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization , 2008, SIGIR '08.

[10]  Joe Carthy,et al.  Combining semantic and syntactic document classifiers to improve first story detection , 2001, SIGIR '01.

[11]  Heng Ji,et al.  Language Specific Issue and Feature Exploration in Chinese Event Extraction , 2009, NAACL.

[12]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[13]  David Ahn,et al.  The stages of event extraction , 2006 .

[14]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[15]  Wei Wang,et al.  Web Event Topic Analysis by Topic Feature Clustering and Extended LDA Model , 2014, J. Softw..