Dengue Prediction Using Hierarchical Clustering Methods

The occurrence of dengue is rapidly increasing in every year. Considering the welfare of the public, it is essential to have detailed study on the affected areas of dengue and its intensity for the control of disease. This paper uses hierarchical clustering technique to classify the data of dengue cases reported and deaths occurred in various states of India. An agglomerative clustering of ward method is used for clustering. The outcomes are represented in Indian map using shape file with RStudio. The data is predicted for 2018, by logarithmic transformation using linear models of regression. K-Nearest Neighbour algorithm is used for predicting the cluster data for 2018. The results have shown that the frequency of dengue happening or the intensity is considerably reduced in many states.

[1]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[2]  Tung-Shou Chen,et al.  Proceedings of 2005 International Symposium on Intelligent Signal Processing and Communication Systems a Combined K-means and Hierarchical Clustering Method for Improving the Clustering Efficiency of Microarray , 2022 .

[3]  Fionn Murtagh,et al.  Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? , 2011, Journal of Classification.

[4]  Thomas M. Aune,et al.  Prediction of Disease Severity in Patients with Early Rheumatoid Arthritis by Gene Expression Profiling , 2009, Human genomics and proteomics : HGP.

[5]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[6]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[7]  Dino Isa,et al.  Using the self organizing map for clustering of text documents , 2009, Expert Syst. Appl..

[8]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[9]  Edgar E. Vallejo,et al.  A Clustering Genetic Algorithm for Genomic Data Mining , 2009, Foundations of Computational Intelligence.

[10]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[11]  P. Yanda,et al.  Predicting and mapping malaria under climate change scenarios: the potential redistribution of malaria vectors in Africa , 2010 .

[12]  Isabel M. Ramos,et al.  Applying Data Mining to Software Development Projects: A Case Study , 2004, ICEIS.

[13]  Robert Tibshirani,et al.  Hybrid hierarchical clustering with applications to microarray data. , 2005, Biostatistics.

[14]  Jiong Yang,et al.  STING: A Statistical Information Grid Approach to Spatial Data Mining , 1997, VLDB.

[15]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[16]  Andrew P. Morse,et al.  Dengue burden in India: recent trends and importance of climatic parameters , 2017, Emerging Microbes &Infections.

[17]  S. Lindsay,et al.  Climate change and malaria transmission. , 1996, Annals of tropical medicine and parasitology.

[18]  S. Hales,et al.  Potential effect of population and climate changes on global distribution of dengue fever: an empirical model , 2002, The Lancet.

[19]  Daniel A. Keim,et al.  An Efficient Approach to Clustering in Large Multimedia Databases with Noise , 1998, KDD.

[20]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.