Analysis of clustering and selection algorithms for the study of multivariate wave climate

Abstract Recent wave reanalysis databases require the application of techniques capable of managing huge amounts of information. In this paper, several clustering and selection algorithms: K-Means (KMA), self-organizing maps (SOM) and Maximum Dissimilarity (MDA) have been applied to analyze trivariate hourly time series of met-ocean parameters (significant wave height, mean period, and mean wave direction). A methodology has been developed to apply the aforementioned techniques to wave climate analysis, which implies data pre-processing and slight modifications in the algorithms. Results show that: a) the SOM classifies the wave climate in the relevant “wave types” projected in a bidimensional lattice, providing an easy visualization and probabilistic multidimensional analysis; b) the KMA technique correctly represents the average wave climate and can be used in several coastal applications such as longshore drift or harbor agitation; c) the MDA algorithm allows selecting a representative subset of the wave climate diversity quite suitable to be implemented in a nearshore propagation methodology.

[1]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[2]  Leo H. Holthuijsen,et al.  Waves in Oceanic and Coastal Waters , 2007 .

[3]  Marcel Zijlema,et al.  WAVE TRANSFORMATION IN FRONT OF THE DUTCH COAST , 2007 .

[4]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[5]  Robert H. Weisberg,et al.  Patterns of ocean current variability on the West Florida Shelf using the self-organizing map , 2005 .

[6]  Antonio S. Cofiño,et al.  Clustering Methods for Statistical Downscaling in Short-Range Weather Forecasts , 2004 .

[7]  D. Jacob,et al.  Sensitivity studies with the regional climate model REMO , 1997 .

[8]  C. Guedes Soares,et al.  Hindcast of the wave conditions along the west Iberian coast , 2008 .

[9]  Stéphane Abadie,et al.  Wave climate and longshore drift on the South Aquitaine coast , 2006 .

[10]  Tereza Cavazos Downscaling large-scale circulation to local winter rainfall in north-eastern Mexico , 1997 .

[11]  C. Sordo,et al.  Analysis and downscaling multi-model seasonal forecasts in Peru using self-organizing maps , 2005 .

[12]  Gianpiero Cossarini,et al.  Understanding dynamic of biogeochemical properties in the northern Adriatic Sea by using self‐organizing maps and k‐means clustering , 2007 .

[13]  Gwo-Fong Lin,et al.  Identification of homogeneous regions for regional frequency analysis using the self-organizing map , 2006 .

[14]  Marcos García Sotillo,et al.  A 44-year high-resolution ocean and atmospheric hindcast for the Mediterranean Basin developed within the HIPOCAS Project , 2008 .

[15]  P Willett,et al.  Comparison of algorithms for dissimilarity-based compound selection. , 1997, Journal of molecular graphics & modelling.

[16]  G. Dodet,et al.  Wave climate variability in the North-East Atlantic Ocean over the last six decades , 2010 .

[17]  Paula Camus,et al.  Statistical and dynamical downscaling to transfer wave climate to coastal areas , 2010 .

[18]  C. Guedes Soares,et al.  44-year wave hindcast for the North East Atlantic European coast , 2008 .

[19]  Gerassimos A. Athanassoulis,et al.  Probabilistic description of metocean parameters by means of kernel density models 1. Theoretical background and first results , 2002 .

[20]  Peter Willett,et al.  Molecular diversity techniques for chemical databases , 1996, Information Research.

[21]  Peter Stansby,et al.  Long-term prediction of nearshore wave climate with an application to cliff erosion , 2006 .