Extraction of nationality from crime news

Most of the crimes committed today are reported on the Internet through news, blogs and social networking sites. These sources have provided a huge amount of crime data, presenting a need for a means to extract useful information. In this research, the evaluation of Direct and Indirect extraction of nationality from crime news, along with the additional references to identify the nationalities of suspects, victims and witnesses is presented. Named entity recognition using gazetteers and rule-based extraction are used, in addition to co-reference resolution to link references. The proposed approach was evaluated and compared to manual extraction system. The results indicate that the proposed approach is able to extract most information related to nationality from crimes news and identify the additional information references. Its performance proved good, with 55% precision, 96% recall, and 70% f-measure evaluation metrics.

[1]  Chih Hao Ku,et al.  Crime Information Extraction from Police and Witness Narrative Reports , 2008, 2008 IEEE Conference on Technologies for Homeland Security.

[2]  Xianpei Han,et al.  CASIANED: People Attribute Extraction based on Information Extraction , 2009 .

[3]  Marie-Francine Moens,et al.  Information Extraction: Algorithms and Prospects in a Retrieval Context , 2006, The Information Retrieval Series.

[4]  Hussein Zedan,et al.  Crime Type Document Classification from Arabic Corpus , 2009, 2009 Second International Conference on Developments in eSystems Engineering.

[5]  Gondy Leroy,et al.  Natural language processing and e-Government: crime information extraction from heterogeneous data sources , 2008, DG.O.

[6]  Lynette Hirschman,et al.  Evaluating Message Understanding Systems: An Analysis of the Third Message Understanding Conference (MUC-3) , 1993, CL.

[7]  C. Tang The Linkages among Inflation, Unemployment and Crime Rates in Malaysia , 2009 .

[8]  Claire Cardie,et al.  Evaluating an Information Extraction System , 1994 .

[9]  Hsinchun Chen,et al.  Using Coplink to Analyze Criminal-Justice Data , 2002, Computer.

[10]  Fabio Crestani,et al.  Towards an Automated Approach to Offender Profiling , 2008, 2008 International Conference on Computational Sciences and Its Applications.

[11]  Gondy Leroy,et al.  Natural Language Processing and e-Government: Extracting Reusable Crime Report Information , 2007, 2007 IEEE International Conference on Information Reuse and Integration.

[12]  Luo Xiao,et al.  Information Extraction from the Web: System and Techniques , 2004, Applied Intelligence.

[13]  Vladia Pinheiro,et al.  Natural Language Processing based on Semantic inferentialism for extracting crime information from text , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.

[14]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[15]  Ronen Feldman,et al.  The Text Mining Handbook: Information Extraction , 2006 .

[16]  Ellen Riloff,et al.  Exploiting Role-Identifying Nouns and Expressions for Information Extraction , 2007 .

[17]  Hussein Zedan,et al.  Automated dictionary construction from Arabic corpus for meaningful crime information extraction and document classification , 2010, 2010 International Conference on Computer Information Systems and Industrial Management Applications (CISIM).

[18]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[19]  Information Extraction and Linking in a Retrieval Context , 2009, ECIR.

[20]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[21]  Fabio Crestani,et al.  An Approach to Indexing and Clustering News Stories Using Continuous Language Models , 2010, NLDB.

[22]  Thanaruk Theeramunkong,et al.  A Feature-Based Approach for Relation Extraction from Thai News Documents , 2009, PAISI.

[23]  Douglas E. Appelt,et al.  Introduction to Information Extraction Technology , 1999, IJCAI 1999.

[24]  Marie-Francine Moens,et al.  Intelligent information retrieval tools for police. Intelligence and security informatics. Proceedings , 2006 .

[25]  Rong Zheng,et al.  Crime Data Mining: An Overview and Case Studies , 2003, DG.O.

[26]  C. Lee Giles,et al.  Accessibility of information on the Web , 2000, INTL.

[27]  Hsinchun Chen,et al.  Extracting Meaningful Entities from Police Narrative Reports , 2002, DG.O.

[28]  Fabio Crestani,et al.  Mining Police Digital Archives to Link Criminal Styles with Offender Characteristics , 2007, ICADL.

[29]  Barry Smyth,et al.  Are people biased in their use of search engines? , 2008, CACM.

[30]  Fabio Crestani,et al.  Application of Language Models to Suspect Prioritisation and Suspect Likelihood in Serial Crimes , 2007, Third International Symposium on Information Assurance and Security.