Gaining insight in domestic violence with Emergent Self Organizing Maps

Topographic maps are an appealing exploratory instrument for discovering new knowledge from databases. During the past years, new types of Self Organizing Maps (SOM) were introduced in the literature, including the recent Emergent SOM. The ESOM tool is used here to analyze a large set of police reports describing a wide range of violent incidents that occurred during the year 2007 in the Amsterdam-Amstelland police region (the Netherlands). This article aims to demonstrate that the ESOM tool provides a valuable exploratory instrument for examining unstructured text in police reports. First, it is shown how ESOM was used to discover a range of new features that better distinguish domestic from non-domestic violence cases. Second, it is demonstrated how this resulted in a significant improvement in classification accuracy. Third, the ESOM tool facilitates an in-depth investigation of the nature and scope of domestic violence, which is particularly useful for the domain expert. Interestingly, it was discovered that the definition of domestic violence employed by the management was much broader than the definition employed by police officers. Fourth, the ESOM tool enables an accurate and automated assignment of either a domestic or a non-domestic violence label to unclassified cases. Finally, ESOM is a highly accurate and comprehensible case triage model for detecting and retrieving wrongly classified cases.

[1]  Ramasamy Uthurusamy,et al.  Evolving data into mining solutions for insights , 2002, CACM.

[2]  Marc M. Van Hulle,et al.  Faithful Representations and Topographic Maps: From Distortion- to Information-Based Self-Organization , 2000 .

[3]  Sergei M. Ananyan Crime pattern analysis through text mining , 2004, AMCIS.

[4]  Lior Rokach,et al.  Decomposition Methodology for Knowledge Discovery and Data Mining - Theory and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[5]  P. Collier Policing and the Intelligent Application of Knowledge , 2006 .

[6]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[7]  Edwin P. D. Pednault Representation is everything , 2000, CACM.

[8]  Lior Rokach,et al.  Decomposition Methodology for Knowledge Discovery and Data Mining , 2005, The Data Mining and Knowledge Discovery Handbook.

[9]  Ronald J. Brachman,et al.  The Process of Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[10]  Christos Faloutsos,et al.  Data-driven evolution of data mining algorithms , 2002, CACM.

[11]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[12]  J. Edwards,et al.  Communicating knowledge about police performance , 2004 .

[13]  Weiguo Fan,et al.  Tapping the power of text mining , 2006, CACM.

[14]  Alfred Ultsch,et al.  The architecture of emergent self-organizing maps to reduce projection errors , 2005, ESANN.

[15]  Ramasamy Uthurusamy,et al.  EVOLVING DATA MINING INTO SOLUTIONS FOR INSIGHTS , 2002 .

[16]  Ron Kohavi,et al.  Emerging trends in business analytics , 2002, CACM.

[17]  Alfred Ultsch Density Estimation and Visualization for Data Containing Clusters of Unknown Structure , 2004, GfKl.

[18]  Wessel Kraaij,et al.  Maximizing classifier utility for a given accuracy , 2008 .

[19]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[21]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[22]  Gang Wang,et al.  Crime data mining: a general framework and some examples , 2004, Computer.

[23]  A. Ultsch Maps for the Visualization of high-dimensional Data Spaces , 2003 .

[24]  Shlomo Argamon,et al.  Toward meaningful computing , 2006, CACM.

[25]  F. Mörchen,et al.  ESOM-Maps : tools for clustering , visualization , and classification with Emergent SOM , 2005 .