Crime Information Extraction from Police and Witness Narrative Reports

To solve crimes, investigators often rely on interviews with witnesses, victims, or criminals themselves. The interviews are transcribed and the pertinent data is contained in narrative form. To solve one crime, investigators may need to interview multiple people and then analyze the narrative reports. There are several difficulties with this process: interviewing people is time consuming, the interviews - sometimes conducted by multiple officers - need to be combined, and the resulting information may still be incomplete. For example, victims or witnesses are often too scared or embarrassed to report or prefer to remain anonymous. We are developing an online reporting system that combines natural language processing with insights from the cognitive interview approach to obtain more information from witnesses and victims. We report here on information extraction from police and witness narratives. We achieved high precision, 94% and 96% and recall, 85% and 90%, for both narrative types.

[1]  Ge Yu,et al.  A Study on Information Extraction from PDF Files , 2005, ICMLC.

[2]  Shyam Varan Nath,et al.  Crime Pattern Detection Using Data Mining , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[3]  Wendy G. Lehnert,et al.  Information extraction , 1996, CACM.

[4]  Günter Köhnken,et al.  Effects of the cognitive interview on the recall of familiar and unfamiliar events. , 1995 .

[5]  Hsinchun Chen,et al.  Extracting Meaningful Entities from Police Narrative Reports , 2002, DG.O.

[6]  R. Bull,et al.  The cognitive interview: Its origins, empirical support, evaluation and practical implications , 1991 .

[7]  Pu-Jen Cheng,et al.  Annotating text segments in documents for search , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[8]  Roger Anderson,et al.  Homeland Security , 2004, Gov. Inf. Q..

[9]  Wei Li,et al.  Information Extraction Supported Question Answering , 1999, TREC.

[10]  Ronen Feldman,et al.  TEG—a hybrid approach to information extraction , 2005, Knowledge and Information Systems.

[11]  J. Py,et al.  A technique for enhancing memory in eye witness testimonies for use by police officers and judicial officials : the cognitive interview , 2001 .

[12]  Rong Zheng,et al.  Crime Data Mining: An Overview and Case Studies , 2003, DG.O.

[13]  P. Lejins Uniform Crime Reports , 1966 .

[14]  Hamish Cunningham,et al.  GATE-a General Architecture for Text Engineering , 1996, COLING.

[15]  Kalina Bontcheva,et al.  Rapid customization of an information extraction system for a surprise language , 2003, TALIP.

[16]  Steven Walczak,et al.  Exploiting the Information Web , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[17]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[18]  D.E. Brown,et al.  Uniform Crime Report "SuperClean" Data Cleaning Tool , 2006, 2006 IEEE Systems and Information Engineering Design Symposium.

[19]  Edith D. de Leeuw,et al.  Reducing missing data in surveys: an overview of methods , 2001 .

[20]  Gondy Leroy,et al.  Reporting On-Campus Crime Online: User Intention to Use , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).

[21]  Jari Björne,et al.  BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[22]  Gondy Leroy,et al.  Natural Language Processing and e-Government: Extracting Reusable Crime Report Information , 2007, 2007 IEEE International Conference on Information Reuse and Integration.

[23]  Yaping Lin,et al.  Using hidden Markov model for information extraction based on multiple templates , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.