Performance Evaluation of Lazy and Decision Tree Classifier: A Data Mining Approach for Global Celebrity's Death Analysis

In present world data is a valuable asset. The best utilization of this asset by means of technology gives an upper hand to an organization. Technologies like machine learning, data mining, and artificial intelligence are no exception to this either. Celebrities influence common people behaviors through biological, psychological and social processes to a great extent. Simultaneously there is vast disparity amongst their demise over the cause, region, and age. Therefore, it is a challenging and interesting endorsement to work upon. The objective of this work is to come up with the comprehensive result to understand the celebrity deaths by investigating the incidence happened over the decade. The database for training is created from the public and open access databases for years 2006–2016 comprising of 11, 200 reported deaths over the globe. Findings of the work are year by year extraction of death, the cause behind it, age, gender, and place. Lazy and decision tree classifier model of data mining is being used for the analysis based on the profession as evaluation class.

[1]  Vivek Kumar,et al.  Implementation of n-gram Methodology for Rotten Tomatoes Review Dataset Sentiment Analysis , 2017, Int. J. Knowl. Discov. Bioinform..

[2]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3]  Michael Hehenberger,et al.  Text-based knowledge discovery: search and mining of life-sciences documents. , 2002, Drug discovery today.

[4]  M. Bleckwenn,et al.  Drug-related celebrity deaths: A cross-sectional study , 2016, Substance Abuse Treatment, Prevention, and Policy.

[5]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[6]  Hua Xu,et al.  Chinese comments sentiment classification based on word2vec and SVMperf , 2015, Expert Syst. Appl..

[7]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[8]  Yu-N Cheah,et al.  Paraphrase detection using semantic relatedness based on Synset Shortest Path in WordNet , 2016, 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA).

[9]  K. Mouthami,et al.  Sentiment analysis and classification based on textual reviews , 2013, 2013 International Conference on Information Communication and Embedded Systems (ICICES).

[10]  Vivek Kumar,et al.  Unsupervised learning dimensionality reduction algorithm PCA for face recognition , 2017, 2017 International Conference on Computing, Communication and Automation (ICCCA).

[11]  Tetsuya Nasukawa,et al.  Text analysis and knowledge mining system , 2001, IBM Syst. J..

[12]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[13]  Vijender Kumar Solanki,et al.  Data Mining Approach of Accident Occurrences Identification with Effective Methodology and Implementation , 2018, International Journal of Electrical and Computer Engineering (IJECE).

[14]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[15]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[16]  Vijender Kumar Solanki,et al.  A Novel Framework to Use Association Rule Mining for classification of traffic accident severity , 2017 .

[17]  George K. Karagiannidis,et al.  Efficient Machine Learning for Big Data: A Review , 2015, Big Data Res..