Event Detection for Supporting Environmental Scanning: An Information Extraction-based Approach

Environmental scanning, the acquisition and use of the information about events, trends, and relationships in an organization’s external environment, permits an organization to adapt to its environment and to develop effective responses to secure or improve the organization’s position in the future. Event detection technique that identifies the onset of new events from streams of news stories would facilitate the process of organization’s environmental scanning. However, traditional feature-based event detection techniques cannot capture the genuine properties of an event contained in a news story and cannot support event categorization and news stories filtering. In this study, we developed an information extraction-based event detection (NEED) technique that combines information extraction and text categorization techniques to address the problems inherent to traditional feature-based event detection techniques. Using a traditional feature-based event detection technique (INCR) as benchmarks, the empirical evaluation results showed that the proposed NEED technique improved the effectiveness of event detection measured by miss and false alarm rates.

[1]  Hwee Tou Ng,et al.  Feature selection, perceptron learning, and a usability case study for text categorization , 1997, SIGIR '97.

[2]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[3]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[4]  James Allan,et al.  On-Line New Event Detection and Tracking , 1998, SIGIR.

[5]  K. Minton Extraction Patterns for Information Extraction Tasks : A Survey , 1999 .

[6]  Ellen Riloff,et al.  Information extraction as a basis for high-precision text classification , 1994, TOIS.

[7]  Sholom M. Weiss,et al.  Automated learning of decision rules for text categorization , 1994, TOIS.

[8]  Andrew McCallum,et al.  Distributional clustering of words for text classification , 1998, SIGIR '98.

[9]  Yiming Yang,et al.  Expert network: effective and efficient learning from human decisions in text categorization and retrieval , 1994, SIGIR '94.

[10]  Line Eikvil,et al.  Information Extraction from World Wide Web - A Survey , 1999 .

[11]  Yiming Yang,et al.  Learning approaches for detecting and tracking news events , 1999, IEEE Intell. Syst..

[12]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[13]  W. Bruce Croft,et al.  Combining classifiers in text categorization , 1996, SIGIR '96.

[14]  David D. Lewis,et al.  A comparison of two learning algorithms for text categorization , 1994 .

[15]  Atro Voutilainen,et al.  NPtool, a Detector of English Noun Phrases , 1995, VLC@ACL.

[16]  Niv Ahituv,et al.  Environmental scanning and information systems in relation to success in introducing new products , 1998, Inf. Manag..

[17]  Yoram Singer,et al.  Context-sensitive learning methods for text categorization , 1996, SIGIR '96.

[18]  Susan T. Dumais,et al.  Inductive learning algorithms and representations for text categorization , 1998, CIKM '98.

[19]  Yiming Yang,et al.  An example-based mapping method for text categorization and retrieval , 1994, TOIS.

[20]  David W. Embley,et al.  Ontology-based extraction and structuring of information from data-rich unstructured documents , 1998, CIKM '98.

[21]  David L. Waltz,et al.  Classifying news stories using memory based reasoning , 1992, SIGIR '92.

[22]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[23]  Hinrich Schütze,et al.  A comparison of classifiers and document representations for the routing problem , 1995, SIGIR '95.