A Case of Using Formal Concept Analysis in Combination with Emergent Self Organizing Maps for Detecting Domestic Violence

In this paper, we propose a framework for iterative knowledge discovery from unstructured text using Formal Concept Analysis and Emergent Self Organizing Maps. We apply the framework to a real life case study using data from the Amsterdam-Amstelland police. The case zooms in on the problem of distilling concepts for domestic violence from the unstructured text in police reports. Our human-centered framework facilitates the exploration of the data and allows for an efficient incorporation of prior expert knowledge to steer the discovery process. This exploration resulted in the discovery of faulty case labellings, common classification errors made by police officers, confusing situations, missing values in police reports, etc. The framework was also used for iteratively expanding a domain-specific thesaurus. Furthermore, we showed how the presented method was used to develop a highly accurate and comprehensible classification model that automatically assigns a domestic or non-domestic violence label to police reports.

[1]  F. Mörchen,et al.  ESOM-Maps : tools for clustering , visualization , and classification with Emergent SOM , 2005 .

[2]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[3]  Alfred Ultsch,et al.  The architecture of emergent self-organizing maps to reduce projection errors , 2005, ESANN.

[4]  Rudolf Wille,et al.  Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts , 2009, ICFCA.

[5]  Deborah L. McGuinness,et al.  Integrated Support for Data Archeology , 1993, Int. J. Cooperative Inf. Syst..

[6]  Gerd Stumme,et al.  Conceptual Knowledge Discovery in Databases Using Formal Concept Analysis Methods , 1998, PKDD.

[7]  Gerd Stumme,et al.  Formal Concept Analysis on Its Way from Mathematics to Computer Science , 2002, ICCS.

[8]  Armand Hatchuel,et al.  A NEW APPROACH OF INNOVATIVE DESIGN : AN INTRODUCTION TO C-K THEORY. , 2003 .

[9]  A. Ultsch Maps for the Visualization of high-dimensional Data Spaces , 2003 .

[10]  Jonas Poelmans,et al.  An Exploration into the Power of Formal Concept Analysis for Domestic Violence Analysis , 2008, ICDM.

[11]  Ronald J. Brachman,et al.  The Process of Knowledge Discovery in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[12]  Marc M. Van Hulle,et al.  Faithful Representations and Topographic Maps: From Distortion- to Information-Based Self-Organization , 2000 .

[13]  Uta Priss Formal concept analysis in information science , 2006 .

[14]  Alfred Ultsch,et al.  Data Mining and Knowledge Discovery with Emergent Self-Organizing Feature Maps for Multivariate Time Series , 1999 .

[15]  Rudolf Wille,et al.  Why can concept lattices support knowledge discovery in databases? , 2002, J. Exp. Theor. Artif. Intell..

[16]  Alfred Ultsch Density Estimation and Visualization for Data Containing Clusters of Unknown Structure , 2004, GfKl.