Dendrograms for Clustering in Multivariate Analysis: Applications for COVID-19 Vaccination Infodemic Data in Brazil

Since December 2019, with the discovery of a new coronavirus, humanity has been exposed to a large amount of information from different media. Information is not always true and official. Known as an infodemic, false information can increase the negative effects of the pandemic by impairing data readability and disease control. The paper aims to find similar patterns of behavior of the Brazilian population during 2021 in two analyses: with vaccination data of all age groups and using the age group of 64 years or more, representing 13% of the population, using the multivariate analysis technique. Infodemic vaccination information and pandemic numbers were also used. Dendrograms were used as a cluster visualization technique. The result of the generated clusters was verified by two algorithms: the cophenetic correlation coefficient, which obtained satisfactory results above 0.7, and the elbow method, which corroborated the number of clusters found. In the result of the analysis with all age groups, more homogeneous divisions were perceived among Brazilian states, while the second analysis resulted in more heterogeneous clusters, recalling that at the start of vaccinations they could have had fear, doubts, and significant belief in the infodemic.

[1]  F. Jarad,et al.  Comparative study of artificial neural network versus parametric method in COVID-19 data analysis , 2022, Results in Physics.

[2]  N. Vijaykumar,et al.  An Analysis of the Deleterious Impact of the Infodemic during the COVID-19 Pandemic in Brazil: A Case Study Considering Possible Correlations with Socioeconomic Aspects of Brazilian Demography , 2022, International journal of environmental research and public health.

[3]  T. Sindhu,et al.  A novel extended model with versatile shaped failure rate: Statistical inference with Covid -19 applications , 2022, Results in Physics.

[4]  H. Bondell,et al.  Comparing the dynamics of COVID-19 infection and mortality in the United States, India, and Brazil , 2021, Physica D: Nonlinear Phenomena.

[5]  Minjung Lee,et al.  Direct and Indirect Associations of Media Use With COVID-19 Vaccine Hesitancy in South Korea: Cross-sectional Web-Based Survey , 2021, Journal of medical Internet research.

[6]  P. Smittenaar,et al.  The Pace and Pulse of the Fight against Coronavirus across the US, A Google Trends Approach. , 2021, Journal of medical Internet research.

[7]  M. Custodio,et al.  Surface Water Quality in the Mantaro River Watershed Assessed after the Cessation of Anthropogenic Activities Due to the COVID-19 Pandemic , 2021 .

[8]  Gabriela Gruszynski Sanseverino,et al.  COVID-19 fake news diffusion across Latin America , 2021, Social Network Analysis and Mining.

[9]  C. Manchein,et al.  Scrutinizing the heterogeneous spreading of COVID-19 outbreak in large territorial countries , 2020, Physical biology.

[10]  B. Nunes,et al.  Protective behaviors for COVID-19 among Brazilian adults and elderly living with multimorbidity: the ELSI-COVID-19 initiative. , 2020, Cadernos de saude publica.

[11]  Nick James,et al.  COVID-19 in the United States: Trajectories and second surge behavior , 2020, Chaos.

[12]  A. Bhagavathula,et al.  Global Infodemiology of COVID-19: Analysis of Google Web Searches and Instagram Hashtags , 2020, Journal of medical Internet research.

[13]  Adalto Moreira Braz,et al.  CLUSTER ANALYSIS FOR LANDSCAPE TYPOLOGY , 2020 .

[14]  A. Bhagavathula,et al.  COVID-19-Related Web Search Behaviors and Infodemic Attitudes in Italy: Infodemiological Study , 2020, JMIR public health and surveillance.

[15]  Lizandra Brasil Estabel,et al.  Idosos, fake news e letramento informacional , 2020 .

[16]  D. Wang,et al.  The origin, transmission and clinical therapies on coronavirus disease 2019 (COVID-19) outbreak – an update on the status , 2020, Military Medical Research.

[17]  Jabra Zarka,et al.  Coronavirus Goes Viral: Quantifying the COVID-19 Misinformation Epidemic on Twitter , 2020, Cureus.

[18]  Martin McKee,et al.  Google Trends: Opportunities and limitations in health and health policy research. , 2019, Health policy.

[19]  R. Machado Desenvolvimento de um algoritmo imunológico para agrupamento de dados , 2017 .

[20]  J. Brownstein,et al.  Early detection of disease outbreaks using the Internet , 2009, Canadian Medical Association Journal.

[21]  F. Rohlf Adaptive Hierarchical Clustering Schemes , 1970 .

[22]  Ester Paiva Souto,et al.  Hesitação vacinal e os desafios para enfrentamento da pandemia de COVID-19 em idosos no Brasil , 2020 .

[23]  C. E. WHO Coronavirus Disease (COVID-19) Dashboard , 2020 .

[24]  A. Yabrude,et al.  Desafios das Fake News com Idosos durante Infodemia sobre Covid-19: Experiência de Estudantes de Medicina , 2020 .

[25]  Trupti M. Kodinariya,et al.  Review on determining number of Cluster in K-Means Clustering , 2013 .

[26]  Rupa G. Mehta,et al.  Impact of Outlier Removal and Normalization Approach in Modified k-Means Clustering Algorithm , 2011 .