Automated criminal link analysis based on domain knowledge

Link (association) analysis has been used in the criminal justice domain to search large datasets for associations between crime entities in order to facilitate crime investigations. However, link analysis still faces many challenging problems, such as information overload, high search complexity, and heavy reliance on domain knowledge. To address these challenges, this article proposes several techniques for automated, effective, and efficient link analysis. These techniques include the co-occurrence analysis, the shortest path algorithm, and a heuristic approach to identifying associations and determining their importance. We developed a prototype system called CrimeLink Explorer based on the proposed techniques. Results of a user study with 10 crime investigators from the Tucson Police Department showed that our system could help subjects conduct link analysis more efficiently than traditional single-level link analysis tools. Moreover, subjects believed that association paths found based on the heuristic approach were more accurate than those found based solely on the co-occurrence analysis and that the automated link analysis system would be of great help in crime investigations.

[1]  Gang Wang,et al.  Crime data mining: a general framework and some examples , 2004, Computer.

[2]  Lior Rokach,et al.  An Introduction to Decision Trees , 2007 .

[3]  William M. Pottenger,et al.  A semi-supervised active learning algorithm for information extraction from textual data: Research Articles , 2005 .

[4]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[5]  Hsinchun Chen,et al.  Introduction to the special topic issue: Intelligence and security informatics , 2005, J. Assoc. Inf. Sci. Technol..

[6]  David D. Jensen Statistical challenges to inductive inference in linked data , 1999, AISTATS.

[7]  Hsinchun Chen,et al.  Fighting organized crimes: using shortest-path algorithms to identify associations in criminal networks , 2004, Decis. Support Syst..

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Ted E. Senator,et al.  Restructuring Databases for Knowledge Discovery by Consolidation and Link Formation , 1995, KDD.

[10]  Shri K. Goyal,et al.  Compass: an expert system for telephone switch maintenance , 1985 .

[11]  Donald E. Brown,et al.  Criminal Incident Data Association Using the OLAP Technology , 2003, ISI.

[12]  Hsinchun Chen,et al.  Extracting Meaningful Entities from Police Narrative Reports , 2002, DG.O.

[13]  James Evans,et al.  Optimization algorithms for networks and graphs , 1992 .

[14]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[15]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[16]  Jesus Mena,et al.  Investigative Data Mining for Security and Criminal Detection , 2002 .

[17]  Neil R. Smalheiser,et al.  Artificial Intelligence An interactive system for finding complementary literatures : a stimulus to scientific discovery , 1995 .

[18]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[19]  D. Mcandrew The Structural Analysis of Criminal Networks , 2021, The Social Psychology of Crime.

[20]  Ray Wild,et al.  Optimization Algorithms for Networks and Graphs , 1980 .

[21]  Erik M. van Mulligen,et al.  Constructing an associative concept space for literature-based discovery , 2004, J. Assoc. Inf. Sci. Technol..

[22]  Breck Baldwin Coreference as the Foundations for Link Analysis over Free Text Databases , 1998 .

[23]  Edward A. Fox,et al.  Connecting topics in document collections with stepping stones and pathways , 2005, CIKM '05.

[24]  Hsinchun Chen,et al.  Using Coplink to Analyze Criminal-Justice Data , 2002, Computer.

[25]  Fernando Gomide Fuzzy engineering expert systems with neural network applications , 2003 .

[26]  Sumit Sarkar,et al.  Bayesian Models for Early Warning of Bank Failures , 2001, Manag. Sci..

[27]  K. J. Lynch,et al.  Automatic construction of networks of concepts characterizing document databases , 1992, IEEE Trans. Syst. Man Cybern..

[28]  Ramesh Sharda,et al.  Bankruptcy prediction using neural networks , 1994, Decis. Support Syst..

[29]  Weiguo Fan,et al.  Literature-based discovery on the World Wide Web , 2002, TOIT.

[30]  Donald E. Brown,et al.  Data association methods with applications to law enforcement , 2003, Decis. Support Syst..

[31]  Lee S. Strickland,et al.  Technology, security, and individual privacy: New tools, new threats, and new public perceptions , 2005, J. Assoc. Inf. Sci. Technol..

[32]  Douglas H. Harris,et al.  The Application of Link Analysis to Police Intelligence , 1975 .

[33]  Michael D. Gordon,et al.  Literature-based discovery by lexical statistics , 1999 .

[34]  Stephen F. Smith,et al.  ISIS—a knowledge‐based system for factory scheduling , 1984 .

[35]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[36]  Malcolm K. Sparrow,et al.  The application of network analysis to criminal intelligence: An assessment of the prospects , 1991 .

[37]  E H Shorthffe,et al.  Computer-based medical consultations mycin , 1976 .

[38]  Henry G. Goldberg,et al.  Restructuring Transactional Data for Link Analysis in the FinCEN AI System , 1998 .

[39]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[40]  Barbara M. Wildemuth,et al.  The effects of domain knowledge on search tactic formulation , 2004, J. Assoc. Inf. Sci. Technol..

[41]  James Martin,et al.  Building expert systems: a tutorial , 1988 .

[42]  Yang Xiang,et al.  Visualizing criminal relationships: comparison of a hyperbolic tree and a hierarchical list , 2005, Decis. Support Syst..

[43]  McLean Va,et al.  Automatic Information Extraction from Documents: A Tool for Intelligence and Law Enforcement Analysts , 1998 .

[44]  Christopher C. Yang,et al.  Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web corpus for crime analysis , 2005, J. Assoc. Inf. Sci. Technol..

[45]  William M. Pottenger,et al.  A semi-supervised active learning algorithm for information extraction from textual data , 2005, J. Assoc. Inf. Sci. Technol..

[46]  Jeffery L. Kennington,et al.  The one-to-one shortest-path problem: An empirical analysis with the two-tree Dijkstra algorithm , 1993, Comput. Optim. Appl..