Different Patterns of the SARS-CoV-2 Epidemic across Italian Regions: a Hierarchical Clustering on Principal Components approach

Italy has experienced the epidemic of severe acute respiratory syndrome coronavirus 2, which spread at different times and with different intensities throughout its territory. We aimed to identify clusters with similar epidemic patterns across Italian regions. To do that, we defined a set of regional indicators reflecting different domains and employed a hierarchical clustering on principal component approach to obtain an optimal cluster solution. As of 24 April 2020, Lombardy was the worst hit Italian region and entirely separated from all the others. Sensitivity analysis - by excluding data from Lombardy - partitioned the remaining regions into four clusters. Although cluster 1 (i.e. Veneto) and 2 (i.e. Piedmont and Emilia-Romagna) included the most hit regions beyond Lombardy, this partition reflected differences in the efficacy of restrictions and testing strategies. Cluster 3 was heterogeneous and comprised regions where the epidemic started later and/or where it spread with the lowest intensity. Regions within cluster 4 were those where the epidemic started slightly after Veneto, Emilia-Romagna and Piedmont, favoring timely adoption of control measures. Our findings provide policymakers with a snapshot of the epidemic in Italy, which might help guiding the adoption of countermeasures in accordance with the situation at regional level.

[1]  S. Battiato,et al.  Modeling the Novel Coronavirus (SARS-CoV-2) Outbreak in Sicily, Italy , 2020, International journal of environmental research and public health.

[2]  C. Signorelli,et al.  COVID-19 in Italy: impact of containment measures and prevalence estimates of infection in the general population , 2020, Acta bio-medica : Atenei Parmensis.

[3]  S. Battiato,et al.  Estimation of Unreported Novel Coronavirus (SARS-CoV-2) Infections from Reported Deaths: A Susceptible–Exposed–Infectious–Recovered–Dead Model , 2020, Journal of clinical medicine.

[4]  Marco Massa,et al.  Covid-19 epidemic in Italy: evolution, projections and impact of government measures , 2020, European Journal of Epidemiology.

[5]  G. Onder,et al.  Case-Fatality Rate and Characteristics of Patients Dying in Relation to COVID-19 in Italy. , 2020, JAMA.

[6]  Franco Blanchini,et al.  Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy , 2020, Nature Medicine.

[7]  D. Fisman,et al.  Estimation of COVID-19 outbreak size in Italy , 2020, The Lancet Infectious Diseases.

[8]  M. Day Covid-19: Italy confirms 11 deaths as cases spread from north , 2020, BMJ.

[9]  Yunpeng Ji,et al.  Potential association between COVID-19 mortality and health-care resource availability , 2020, The Lancet Global Health.

[10]  M. Vinciguerra,et al.  How dietary patterns affect left ventricular structure, function and remodelling: evidence from the Kardiovize Brno 2030 study , 2019, Scientific Reports.

[11]  A. Maugeri,et al.  Dietary Patterns are Associated with Leukocyte LINE-1 Methylation in Women: A Cross-Sectional Study in Southern Italy , 2019, Nutrients.

[12]  A. Maugeri,et al.  Maternal Dietary Patterns Are Associated with Pre-Pregnancy Body Mass Index and Gestational Weight Gain: Results from the “Mamma & Bambino” Cohort , 2019, Nutrients.

[13]  M. Vinciguerra,et al.  Association of Dietary Patterns with Metabolic Syndrome: Results from the Kardiovize Brno 2030 Study , 2018, Nutrients.

[14]  A. Maugeri,et al.  The Association of Dietary Patterns with High-Risk Human Papillomavirus Infection and Cervical Cancer: A Cross-Sectional Study in Italy , 2018, Nutrients.

[15]  P. Gemperline,et al.  Principal Component Analysis , 2003, Encyclopedia of Machine Learning.

[16]  Julie Josse,et al.  Principal component methods - hierarchical clustering - partitional clustering: why would we need to choose for visualizing data? , 2010 .