Data mining can be used to model crime detection problems, detect unusual patterns, terrorist activities and fraudulent behaviour. Crimes are a social nuisance and cost our society dearly in several ways. Any research that can help in solving crimes faster will pay for itself. Here we look at use of clustering algorithm for a data mining approach to help detect the crimes patterns and speed up the process of solving crime. We will look at k-means clustering with some enhancements to aid in the process of identification of crime patterns. The k-means algorithm is one of the frequently used clustering method in data mining, due to its performance in clustering massive data sets. The final clustering result of the k-means clustering algorithm greatly depends upon the correctness of the initial centroids, which are selected randomly. The original k-means algorithm converges to local minimum, not the global optimum. Many improvements were already proposed to improve the performance of the k-means, but most of these require additional inputs like threshold values for the number of data points in a set. In this paper a new method is proposed for finding the better initial centroids and to provide an efficient way of assigning the data points to suitable clusters with reduced time complexity.
[1]
Gary LaFree,et al.
Introducing the Global Terrorism Database
,
2007
.
[2]
M. P. Sebastian,et al.
Improving the Accuracy and Efficiency of the k-means Clustering Algorithm
,
2009
.
[3]
Tamas Abraham,et al.
Investigative profiling with computer forensic log data and association rules
,
2002,
2002 IEEE International Conference on Data Mining, 2002. Proceedings..
[4]
Rong Zheng,et al.
Crime Data Mining: An Overview and Case Studies
,
2003,
DG.O.
[5]
I. D. Wilson,et al.
Predicting the geo-temporal variations of crime and disorder
,
2003
.
[6]
Walter A. Kosters,et al.
Data Mining Approaches to Criminal Career Analysis
,
2006,
Sixth International Conference on Data Mining (ICDM'06).
[7]
Wang Xue-ying.
K Mean Cluster Algorithm with Refined Initial Center Point
,
2009
.