Road-User Specific Analysis of Traffic Accident Using Data Mining Techniques

Analysis of road accident is very important because it can expose the relationship between the different types of attributes that contributes to a road accident. Attributes that affect the road accident can be road attribute, environment attributes, traffic attributes etc. Analyzing road accident can provide the information about the contribution of these attributes which can be utilized to overcome the accident rate. Nowadays, Data mining is a popular technique for examining the road accident dataset. In this study, we have performed the classification of road accident on the basis of road user category. We have used Self Organizing map (SOM), K-modes clustering technique to group the data into homogeneous segments and then applied Support vector machine (SVM), Naive Bayes (NB) and Decision tree to classify the data. We have performed classification on data with and without clustering. The result illustrates that better classification accuracy can be achieved after segmentation of data using clustering.

[1]  Prayag Tiwari Improvement of ETL through integration of query cache and scripting method , 2016, 2016 International Conference on Data Science and Engineering (ICDSE).

[2]  Jianming Ma,et al.  Crash frequency and severity modeling using clustered data from Washington State , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[3]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[4]  Durga Toshniwal,et al.  Analysing road accident data using association rule mining , 2015, 2015 International Conference on Computing, Communication and Security (ICCCS).

[5]  Manoranjan Parida,et al.  A comparative analysis of heterogeneity in road accident data using data mining techniques , 2016, Evolving Systems.

[6]  Michael J. A. Berry,et al.  Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management , 2004 .

[7]  So Young Sohn,et al.  Data fusion, ensemble and clustering to improve the classification accuracy for the severity of road traffic accidents in Korea , 2003 .

[8]  Chris Lee,et al.  Analysis of Crash Precursors on Instrumented Freeways , 2002 .

[9]  Sudhir Kumar Barai,et al.  Data mining applications in transportation engineering , 2003 .

[10]  Amedeo Napoli,et al.  Mining gene expression data with pattern structures in formal concept analysis , 2011, Inf. Sci..

[11]  M G Karlaftis,et al.  Heterogeneity considerations in accident modeling. , 1998, Accident; analysis and prevention.

[12]  Jonas Poelmans,et al.  Knowledge representation and processing with formal concept analysis , 2013, WIREs Data Mining Knowl. Discov..

[13]  Vivek Kumar,et al.  Implementation of n-gram Methodology for Rotten Tomatoes Review Dataset Sentiment Analysis , 2017, Int. J. Knowl. Discov. Bioinform..

[14]  Sergei O. Kuznetsov,et al.  Fitting Pattern Structures to Knowledge Discovery in Big Data , 2013, ICFCA.

[15]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[16]  Prayag Tiwari,et al.  Performance Evaluation of Lazy, Decision Tree Classifier and Multilayer Perceptron on Traffic Accident Analysis , 2017, Informatica.

[17]  K. Vanhoof,et al.  Profiling of High-Frequency Accident Locations by Use of Association Rules , 2003 .

[18]  Durga Toshniwal,et al.  A novel framework to analyze road accident time series data , 2016, Journal of Big Data.

[19]  Sachin Kumar,et al.  A data mining approach to characterize road accident locations , 2016, Journal of Modern Transportation.

[20]  Prayag Tiwari Advanced ETL (AETL) by integration of PERL and scripting method , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[21]  Vijender Kumar Solanki,et al.  A Conjoint Analysis of Road Accident Data using K-modes Clustering and Bayesian Networks (Road Accident Analysis using clustering and classification) , 2017, RICE.

[22]  Vivek Kumar,et al.  Improved performance of data warehouse , 2017, 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT).

[23]  Jonas Poelmans,et al.  Formal Concept Analysis in knowledge processing: A survey on models and techniques , 2013, Expert Syst. Appl..

[24]  Durga Toshniwal,et al.  Analysis of hourly road accident counts using hierarchical clustering and cophenetic correlation coefficient (CPCC) , 2016, Journal of Big Data.

[25]  Wonjong Rhee,et al.  Application of classification algorithms for analysis of road safety risk factor dependencies. , 2015, Accident; analysis and prevention.

[26]  Durga Toshniwal,et al.  A data mining framework to analyze road accident data , 2015, Journal of Big Data.

[27]  Prayag Tiwari Comparative Analysis of Big Data , 2016 .