Data Mining Approach of Accident Occurrences Identification with Effective Methodology and Implementation

Data mining is used in various domains of research to identify a new cause for tan effect in the society over the globe. This article includes the same reason for using the data mining to identify the Accident Occurrences in different regions and to identify the most valid reason for happening accidents over the globe. Data Mining and Advanced Machine Learning algorithms are used in this research approach and this article discusses about hyperline, classifications, pre-processing of the data, training the machine with the sample datasets which are collected from different regions in which we have structural and semi-structural data. We will dive into deep of machine learning and data mining classification algorithms to find or predict something novel about the accident occurrences over the globe. We majorly concentrate on two classification algorithms to minify the research and task and they are very basic and important classification algorithms. SVM (Support vector machine), CNB Classifier. This discussion will be quite interesting with WEKA tool for CNB classifier, Bag of Words Identification, Word Count and Frequency Calculation.

[1]  Russell Greiner,et al.  Computational learning theory and natural learning systems , 1997 .

[2]  Durga Toshniwal,et al.  Analysing road accident data using association rule mining , 2015, 2015 International Conference on Computing, Communication and Security (ICCCS).

[3]  Margaret H. Dunham,et al.  Data Mining: Introductory and Advanced Topics , 2002 .

[4]  Joachim M. Buhmann,et al.  Introduction to the special issue on neural networks for data mining and knowledge discovery , 2000 .

[5]  Deepak Dahiya,et al.  Performance Evaluation of Classification Algorithms on Different Data Sets , 2016 .

[6]  Joshua Zhexue Huang,et al.  Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[7]  M G Karlaftis,et al.  Heterogeneity considerations in accident modeling. , 1998, Accident; analysis and prevention.

[8]  M-Tech Student,et al.  A Survey on Decision Tree Based Approaches in Data Mining , 2015 .

[9]  Vijender Kumar Solanki,et al.  Analysis of Datamining Technique for Traffic Accident Severity Problem: A Review , 2017, RICE.

[10]  Zuhal Tanrikulu,et al.  A comparative analysis of classification algorithms in data mining for accuracy, speed and robustness , 2012, Information Technology and Management.

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  L Mussone,et al.  An analysis of urban collisions using an artificial intelligence model. , 1999, Accident; analysis and prevention.

[13]  Richi Nayak,et al.  Data Mining For Lifetime Prediction of Metallic Components , 2006, AusDM.

[14]  Monica Chiarini Tremblay,et al.  Identifying fall-related injuries: Text mining the electronic medical record , 2009, Inf. Technol. Manag..

[15]  Vivek Agarwal,et al.  Survey on Classification Techniques for Data Mining , 2015 .

[16]  Ronen Feldman,et al.  The Data Mining and Knowledge Discovery Handbook , 2005 .

[17]  Vijender Kumar Solanki,et al.  A Novel Framework to Use Association Rule Mining for classification of traffic accident severity , 2017 .

[18]  Deepika Comparative Analysis of Classification Algorithms for the Prediction of Leukemia Cancer , 2015 .

[19]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[20]  Leslie Pack Kaelbling,et al.  Associative methods in reinforcement learning: an empirical study , 1994, COLT 1994.

[21]  Griselda López,et al.  Analysis of traffic accidents on rural highways using Latent Class Clustering and Bayesian Networks. , 2013, Accident; analysis and prevention.

[22]  S. Vijayarani,et al.  Comparative Analysis of Bayes and Lazy Classification Algorithms , 2013 .

[23]  Joshua Zhexue Huang,et al.  A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining , 1997, DMKD.

[24]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[25]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[26]  Li-Yen Chang,et al.  Data mining of tree-based models to analyze freeway accident frequency. , 2005, Journal of safety research.

[27]  Andrew H. Sung,et al.  A Novel Design Specification Distance (DSD) based K-Mean Clustering Performance Evaluation on Engineering Materials' Database , 2019 .

[28]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[29]  Durga Toshniwal,et al.  A data mining framework to analyze road accident data , 2015, Journal of Big Data.

[30]  Durga Toshniwal,et al.  A novel framework to analyze road accident time series data , 2016, Journal of Big Data.

[31]  Lingjun Meng,et al.  Profiling novel classification algorithms Artificial Immune Systems , 2008, 2008 7th IEEE International Conference on Cybernetic Intelligent Systems.

[32]  Durga Toshniwal,et al.  Analysis of hourly road accident counts using hierarchical clustering and cophenetic correlation coefficient (CPCC) , 2016, Journal of Big Data.