Improving ATM coverage area using density based clustering algorithm and voronoi diagrams

Abstract Facility location is a problem of paramount importance and optimizing business operations without affecting customer service is very challenging. In the case of banking services, the location of bank branches and ATMs must match the service demands (turn around time for service, reachability etc) of the customers’ and the expected quality of service is determined by the socio-economic background of the customer. Therefore, it is necessary to formulate the optimization problem so as to reflect the customers’ expectations and tolerance for quality of service in a given geographical region. The ability to do so requires clustering people living in the region into several smaller areas called service areas. An ideal clustering algorithm should consider the social behavior of people living in the service area and the uncertainty associated with their social behavior. In this paper, we propose a modification to generalized density based clustering algorithm (GDBSCAN) to deal with fuzziness in the values describing the population demographics and the preferences for ATM location among customers utilizing the ATM services. The modified version of GDBSCAN clustering algorithm, which we call GFDBSCAN, is used to cluster people around key socio-economic parameters. GFDBSCAN can also be used to cluster geographical regions based on the requirement and preferences expressed by the customers for services like business outlets, ATMs, bank branch operations, public utilities, etc. We apply the proposed algorithm to cluster geo-spatial data based on personal traits of people living in the geographical area under study. We measure and compare the performance of GFDBSCAN with other popular clustering algorithms using Silhouette coefficient, Dunn index and Davies–Bouldin index. We plot the clustering results on Google maps for better visualization of results. We found that GFDBSCAN is better able to cope with fuzziness in the values of both spatial and non-spatial attributes. We finally use voronoi diagrams to identify the ideal locations to place ATMs so as to ensure that the customers’ preferences are served and, at the same time, the service area of each ATM is optimized.

[1]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[2]  Miodrag Potkonjak,et al.  Coverage problems in wireless ad-hoc sensor networks , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[3]  Qian Wang,et al.  Algorithms for a Facility Location Problem with Stochastic Customer Demand and Immobile Servers , 2002, Ann. Oper. Res..

[4]  Raghu Kisore Neelisetti,et al.  Improving Reliability Of Wireless Sensor Networks for Target Tracking using Wireless Acoustic Sensors , 2009 .

[5]  Mansour A. Aldajani,et al.  Location of banking automatic teller machines based on convolution , 2009, Comput. Ind. Eng..

[6]  Satish V. Ukkusuri,et al.  A Voronoi-Based Heuristic Algorithm for Locating Distribution Centers in Disasters , 2012 .

[7]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Justo Puerto,et al.  Location Theory - A Unified Approach , 2005 .

[9]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[10]  Miodrag Potkonjak,et al.  Worst and best-case coverage in sensor networks , 2005, IEEE Transactions on Mobile Computing.

[11]  Hans-Peter Kriegel,et al.  Hierarchical density-based clustering of uncertain data , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[12]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[13]  Dalila B. M. M. Fontes,et al.  Locating and Sizing Bank-Branches by Opening, Closing or Maintaining Facilities , 2005, OR.

[14]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  Vipin Kumar,et al.  Similarity Measures for Categorical Data: A Comparative Evaluation , 2008, SDM.

[17]  Hokey Min,et al.  The Three-Hierarchical Location-Allocation of Banking Facilities with Risk and Uncertainty , 2001 .

[18]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[19]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[20]  Indranil Sengupta,et al.  Maximal Breach in Wireless Sensor Networks: Geometric Characterization and Algorithms , 2007, ALGOSENSORS.

[21]  A. Mendes,et al.  Multi-outlet retail site location assessment , 2004 .

[22]  Karim Djouani,et al.  Extended fuzzy clustering algorithm based on an inclusion concept , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[23]  Vipin Kumar,et al.  Introduction to Data Mining, (First Edition) , 2005 .

[24]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[25]  Ulrik Brandes,et al.  Experiments on Graph Clustering Algorithms , 2003, ESA.

[26]  Philip S. Yu,et al.  A Survey of Uncertain Data Algorithms and Applications , 2009, IEEE Transactions on Knowledge and Data Engineering.

[27]  Joshua Zhexue Huang,et al.  Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[28]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[29]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[30]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[31]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[32]  Hans-Peter Kriegel,et al.  Density-based clustering of uncertain data , 2005, KDD '05.

[33]  Frank Plastria S. Nickel and J. Puerto, Location Theory: A Unified Approach, Springer (2005) ISBN 3-540-24321-6 XXI+437p , 2007, Eur. J. Oper. Res..

[34]  Richard L. Church,et al.  Business Site Selection, Location Analysis and GIS , 2008 .

[35]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[36]  Michalis Vazirgiannis,et al.  Clustering validity checking methods: part II , 2002, SGMD.

[37]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[39]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .