Extracting Violent Events From On-Line News for Ontology Population

This paper presents NEXUS, an event extraction system, developed at the Joint Research Center of the European Commission utilized for populating violent incident knowledge bases. It automatically extracts security-related facts from on-line news articles. In particular, the paper focuses on a novel bootstrapping algorithm for weakly supervised acquisition of extraction patterns from clustered news, cluster-level information fusion and pattern specification language. Finally, a preliminary evaluation of NEXUS on real-world data is given which revealed acceptable precision and a strong application potential.

[1]  Bruno Pouliquen,et al.  Towards Automatic Event Tracking , 2006, ISI.

[2]  Bruno Pouliquen,et al.  Navigating multilingual news collections using automatically extracted information , 2005 .

[3]  Ulrich Schäfer,et al.  Shallow Processing with Unification and Typed Feature Structures - Foundations and Applications , 2004, Künstliche Intell..

[4]  Roman Yangarber,et al.  Counter-Training in Discovery of Semantic Patterns , 2003, ACL.

[5]  Ralph Grishman,et al.  Real-time event extraction for infectious disease outbreaks , 2002 .

[6]  Diana Maynard,et al.  JAPE: a Java Annotation Patterns Engine , 2000 .

[7]  Jakub Piskorski,et al.  Ontology Based Analysis of Violent Events , 2007, 2007 IEEE Intelligence and Security Informatics.

[8]  Ellen Riloff Bootstrapping for text learning tasks , 1999 .

[9]  Vipul Kashyap,et al.  Relationships at the Heart of Semantic Web: Modeling, Discovering, and Exploiting Complex Semantic Relationships , 2004 .

[10]  Asunción Gómez-Pérez,et al.  Building a chemical ontology using Methontology and the Ontology Design Environment , 1999, IEEE Intell. Syst..

[11]  Gary King,et al.  An Automated Information Extraction Tool for International Conflict Data with Performance as Good as Human Coders: A Rare Events Evaluation Design , 2003, International Organization.

[12]  Nicholas Kushmerick,et al.  Event Extraction from Heterogeneous News Sources , 2006 .

[13]  Satoshi Sekine,et al.  Preemptive Information Extraction using Unrestricted Relation Discovery , 2006, NAACL.

[14]  Ido Dagan,et al.  Scaling Web-based Acquisition of Entailment Relations , 2004, EMNLP.

[15]  Doug Downey,et al.  Learning text patterns for web information extraction and assessment , 2004, AAAI 2004.