Data-Driven versus Köppen–Geiger Systems of Climate Classification

Climate zone classification promotes our understanding of the climate and provides a framework for analyzing a range of environmental and socioeconomic data and phenomena. The Köppen–Geiger classification system is the most widely used climate classification scheme. In this study, we compared the climate zones objectively defined using data-driven methods with Köppen–Geiger rule-based classification. Cluster analysis was used to objectively delineate the world’s climatic regions. We applied three clustering algorithms—k-means, ISODATA, and unsupervised random forest classification—to a dataset comprising 10 climatic variables and elevation; we then compared the obtained results with those from the Köppen–Geiger classification system. Results from both the systems were similar for some climatic regions, especially extreme temperature ones such as the tropics, deserts, and polar regions. Data-driven classification identified novel climatic regions that the Köppen–Geiger classification could not. Refinements to the Köppen–Geiger classification, such as precipitation-based subdivisions to existing Köppen–Geiger climate classes like tropical rainforest (Af) and warm summer continental (Dfb), have been suggested based on clustering results. Climatic regions objectively defined by data-driven methods can further the current understanding of climate divisions. On the other hand, rule-based systems, such as the Köppen–Geiger classification, have an advantage in characterizing individual climates. In conclusion, these two approaches can complement each other to form a more objective climate classification system, wherein finer details can be provided by data-driven classification and supported by the intuitive structure of rule-based classification.

[1]  A. Ellis,et al.  Delineating Precipitation Regions of the Contiguous United States from Cluster Analyzed Gridded Data , 2020 .

[2]  F. Méndez-Arriaga,et al.  The temperature and regional climate effects on communitarian COVID-19 contagion in Mexico throughout phase 1 , 2020, Science of The Total Environment.

[3]  P. Jones,et al.  Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset , 2020, Scientific Data.

[4]  J. Im,et al.  Delineation of high resolution climate regions over the Korean Peninsula using machine learning approaches , 2019, PloS one.

[5]  Armando Barreto,et al.  A comprehensive survey on impulse and Gaussian denoising filters for digital images , 2019, Signal Process..

[6]  A. Berg,et al.  Present and future Köppen-Geiger climate classification maps at 1-km resolution , 2018, Scientific Data.

[7]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.

[8]  Tomasz F. Stepinski,et al.  Spatial association between regionalizations using the information-theoretical V-measure , 2018, Int. J. Geogr. Inf. Sci..

[9]  D. Sathiaraj,et al.  Predicting climate types for the Continental United States using unsupervised clustering techniques , 2018, Environmetrics.

[10]  Jim E. Freer,et al.  A Quantitative Hydrological Climate Classification Evaluated With Independent Streamflow Data , 2018, Water Resources Research.

[11]  G. Skok,et al.  Objective climate classification of Slovenia , 2017 .

[12]  Todd R. Yokley,et al.  Absolute humidity and the human nose: A reanalysis of climate zones and their influence on nasal form and function. , 2016, American journal of physical anthropology.

[13]  Tomasz F. Stepinski,et al.  On using a clustering approach for global climate classi , 2015 .

[14]  Hassan A. Kingravi,et al.  Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm , 2014, ArXiv.

[15]  Maya B. Mathur,et al.  Seasonal Patterns in Human A (H5N1) Virus Infection: Analysis of Global Cases , 2014, PloS one.

[16]  P. Calanca,et al.  Quarantine arthropod invasions in Europe: the role of climate, hosts and propagule pressure , 2014 .

[17]  Peter Carey,et al.  Environmental stratifications as the basis for national, European and global ecological monitoring , 2013 .

[18]  Rob H. G. Jongman,et al.  A high-resolution bioclimate map of the world: a unifying framework for global biodiversity research and monitoring , 2013 .

[19]  S. Gray,et al.  Climate Zone Delineation: Evaluating Approaches for Use in Natural Resource Management , 2012, Environmental Management.

[20]  Bruce L. Webber,et al.  CliMond: global high‐resolution historical and future scenario climate surfaces for bioclimatic modelling , 2012 .

[21]  T. Wrbka,et al.  The potential for integration of environmental data from regional stratifications into a European monitoring framework , 2012 .

[22]  M. Kottek,et al.  Comments on: The thermal zones of the Earth by Wladimir Köppen (1884) , 2011 .

[23]  L. Stroosnijder,et al.  A new agro-climatic classification for crop suitability zoning in northern semi-arid Ethiopia , 2010 .

[24]  Franz Rubel,et al.  Observed and projected climate shifts 1901-2100 depicted by world maps of the Köppen-Geiger climate classification , 2010 .

[25]  J. R. Jensen,et al.  Delineation of climate regions using in-situ and remotely-sensed data for the Carolinas , 2008 .

[26]  T. McMahon,et al.  Updated world map of the Köppen-Geiger climate classification , 2007 .

[27]  David M. Mount,et al.  A Fast Implementation of the Isodata Clustering Algorithm , 2007, Int. J. Comput. Geom. Appl..

[28]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[29]  Joydeep Ghosh,et al.  Similarity-Based Text Clustering: A Comparative Study , 2006, Grouping Multidimensional Data.

[30]  B. Rudolf,et al.  World Map of the Köppen-Geiger climate classification updated , 2006 .

[31]  W. Hargrove,et al.  Using Clustered Climate Regimes to Analyze and Compare Predictions from Fully Coupled General Circulation Models , 2005 .

[32]  J. Feddema A Revised Thornthwaite-Type Global Climate Classification , 2005 .

[33]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[34]  Alain Carbonneau,et al.  A multicriteria climatic classification system for grape-growing regions worldwide , 2004 .

[35]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[36]  Yurdanur S. Unal,et al.  Redefining the climate zones of Turkey using cluster analysis , 2003 .

[37]  Arthur T. DeGaetano,et al.  Spatial grouping of United States climate stations using a hybrid clustering approach , 2001 .

[38]  Pedro Larrañaga,et al.  An empirical comparison of four initialization methods for the K-Means algorithm , 1999, Pattern Recognit. Lett..

[39]  M. Sanderson The Classification of Climates from Pythagoras to Koeppen , 1999 .

[40]  Robert G. Fovell,et al.  Consensus Clustering of U.S. Temperature and Precipitation Data , 1997 .

[41]  Robert G. Quayle,et al.  A Historical Perspective of U.S. Climate Divisions , 1996 .

[42]  A. Tsonis,et al.  Assessing the ability of the Köppen system to delineate the general world pattern of climates , 1994 .

[43]  Donald A. Jackson STOPPING RULES IN PRINCIPAL COMPONENTS ANALYSIS: A COMPARISON OF HEURISTICAL AND STATISTICAL APPROACHES' , 1993 .

[44]  R. Fovell,et al.  Climate zones of the conterminous United States defined using cluster analysis , 1993 .

[45]  David E. Stooksbury,et al.  Cluster analysis of Southeastern U.S. climate stations , 1991 .

[46]  G. Trewartha An introduction to climate. , 1980 .

[47]  A. W. Moore,et al.  Classification of climate by pattern analysis with Australasian and southern African data as an example , 1976 .

[48]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[49]  José Gómez-Zotano,et al.  Applying the Technique of Image Classification to Climate Science: The Case of Andalusia (Spain) , 2016 .

[50]  Greg Hamerly,et al.  Accelerating Lloyd’s Algorithm for k -Means Clustering , 2015 .

[51]  Adil M. Bagirov,et al.  Nonsmooth Optimization Based Algorithms in Cluster Analysis , 2015 .

[52]  Stefan Harmeling,et al.  Climate Classifications: the Value of Unsupervised Clustering , 2012, ICCS.

[53]  Jitendra Kumar,et al.  Parallel k-Means Clustering for Quantitative Ecoregion Delineation Using Large Data Sets , 2011, ICCS.

[54]  D. Gesch,et al.  Global multi-resolution terrain elevation data 2010 (GMTED2010) , 2011 .

[55]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[56]  H. Frischkorn,et al.  Water Resources of Ceará and Piauí , 2003 .

[57]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[58]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[59]  Wolfgang Pohl,et al.  Eine neue Wandkarte der Klimagebiete der Erde , 1954 .

[60]  Milutin Milanković,et al.  Handbuch der Klimatologie , 1898, Nature.