An Immune-Inspired Technique to Identify Heavy Goods Vehicles Incident Hot Spots

We report on the adaptation of an immune-inspired instance selection technique to solve a real-world big data problem of determining vehicle incident hot spots. The technique, which is inspired by the immune system self-regulation mechanism, was originally conceptualized to eliminate very similar instances in data classification tasks. We adapt the method to detect hot spots from a telematics dataset containing hundreds of thousands of data points indicating incident locations involving heavy goods vehicles (HGVs) across the United Kingdom. The objective is to provide HGV drivers with information regarding areas of high likelihood of incidents in order to continuously improve road safety and vehicle economy. The problem presents several challenges and constraints. An accurate view of the hot spots produced in a timely manner is necessary. In addition, the solution is required to be adaptable and dynamic, as thousands of new incidents are included in the database daily. Furthermore, the impact on hot spots after informing drivers about their existence has to be considered. Our solution successfully addresses these constraints. It is fast, robust, and applicable to all different incidents investigated. The method is also self-adjustable, which means that if the hot spots configuration changes with time, the algorithm automatically evolves to present the most current topology. Our solution has been implemented by a telematics company to improve HGV safety in the United Kingdom.

[1]  Robert Ivor John,et al.  Vehicle Incident Hot Spots Identification: An Approach for Big Data , 2017, 2017 IEEE Trustcom/BigDataSE/ICESS.

[2]  Wen Cheng,et al.  Experimental evaluation of hotspot identification methods. , 2005, Accident; analysis and prevention.

[3]  Nelson F. F. Ebecken,et al.  An immune-inspired instance selection mechanism for supervised classification , 2012, Memetic Comput..

[4]  Anthony K. H. Tung,et al.  Spatial clustering methods in data mining : A survey , 2001 .

[5]  H. Robbins An Empirical Bayes Approach to Statistics , 1956 .

[6]  Farhad Samadzadegan,et al.  A Geospatial Based Neuro-Fuzzy Modeling for Regional Transportation Corridors Hazardous Zones Identification , 2014 .

[7]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[8]  Nelson F. F. Ebecken,et al.  A Strategy for Training Set Selection in Text Classification Problems , 2013 .

[9]  Tessa K Anderson,et al.  Kernel density estimation and K-means clustering to profile road accident hotspots. , 2009, Accident; analysis and prevention.

[10]  Francisco Herrera,et al.  Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study , 2003, IEEE Trans. Evol. Comput..

[11]  Yordphol Tanaboriboon,et al.  APPLYING PUBLIC PARTICIPATION APPROACH TO BLACK SPOT IDENTIFICATION PROCESS: – A Case Study in Thailand – , 2006 .

[12]  Becky P Y Loo,et al.  Validating crash locations for quantitative spatial analysis: a GIS-based approach. , 2006, Accident; analysis and prevention.

[13]  Alfonso Montella,et al.  A comparative analysis of hotspot identification methods. , 2010, Accident; analysis and prevention.

[14]  Nelson F. F. Ebecken,et al.  The SUPRAIC Algorithm: A Suppression Immune Based Mechanism to Find a Representative Training Set in Data Classification Tasks , 2007, ICARIS.

[15]  Salissou Moutari,et al.  Hotspots identification and ranking for road safety improvement: an alternative approach. , 2013, Accident; analysis and prevention.

[16]  José Francisco Martínez Trinidad,et al.  A review of instance selection methods , 2010, Artificial Intelligence Review.

[17]  Tony R. Martinez,et al.  Reduction Techniques for Instance-Based Learning Algorithms , 2000, Machine Learning.

[18]  Tarek Sayed,et al.  Depth-based hotspot identification and multivariate ranking using the full Bayes approach. , 2013, Accident; analysis and prevention.

[19]  Peter J. Rousseeuw,et al.  Clustering by means of medoids , 1987 .

[20]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[21]  G L Ada,et al.  The clonal-selection theory. , 1987, Scientific American.

[22]  Ying Wah Teh,et al.  Big Data Clustering: A Review , 2014, ICCSA.

[23]  C. Janeway Immunobiology: The Immune System in Health and Disease , 1996 .

[24]  Richard Andrášik,et al.  Identification of hazardous road locations of traffic accidents by means of kernel density estimation and cluster significance evaluation. , 2013, Accident; analysis and prevention.

[25]  N. Ebecken,et al.  A Support Vector Machine-Based Technique for Instance Selection , 2013 .

[26]  Glen Van Brummelen,et al.  Heavenly Mathematics: The Forgotten Art of Spherical Trigonometry , 2012 .

[27]  Ian Dickinson,et al.  Overview of Telematics-Based Prognostics and Health Management Systems for Commercial Vehicles , 2013, TST.