Euclidean Distance as a Similarity Metric for Principal Component Analysis

Abstract Eigentechniques, in particular principal component analysis (PCA), have been widely used in meteorological analyses since the early 1950s. Traditionally, choices for the parent similarity matrix, which are diagonalized, have been limited to correlation, covariance, or, rarely, cross products. Whereas each matrix has unique characteristic benefits, all essentially identify parameters that vary together. Depending on what underlying structure the analyst wishes to reveal, similarity matrices can be employed, other than the aforementioned, to yield different results. In this work, a similarity matrix based upon Euclidean distance, commonly used in cluster analysis, is developed as a viable alternative. For PCA, Euclidean distance is converted into Euclidean similarity. Unlike the variance-based similarity matrices, a PCA performed using Euclidean similarity identifies parameters that are close to each other in a Euclidean distance sense. Rather than identifying parameters that change together, the r...

[1]  M. Richman,et al.  Relationships between the Definition of the Hyperplane Width to the Fidelity of Principal Component Loading Patterns. , 1999 .

[2]  Jeffrey L. Anderson A Method for Producing and Evaluating Probabilistic Forecasts from Ensemble Model Integrations , 1996 .

[3]  X. Cheng,et al.  Robustness of Low-Frequency Circulation Patterns Derived from EOF and Rotated EOF Analyses , 1995 .

[4]  Michael B. Richman,et al.  On the Application of Cluster Analysis to Growing Season Precipitation Data in North America East of the Rockies , 1995 .

[5]  Michael D. Eilts,et al.  The Oklahoma Mesonet: A Technical Overview , 1995 .

[6]  I. Jolliffe Rotation of principal components: choice of normalization constraints , 1995 .

[7]  M. Richman,et al.  Rotation of principal components , 1986 .

[8]  Precipitation over Northern Italy: a description by means of principal component analysis , 1983 .

[9]  Thomas R. Karl,et al.  Drought in the United States: 1895–1981 , 1982 .

[10]  Robert F. Cahalan,et al.  Sampling Errors in the Estimation of Empirical Orthogonal Functions , 1982 .

[11]  Recent Secular Variations in Mid-Atlantic Winter Extratropical Storm Climate , 1975 .

[12]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .

[13]  J. M. Craddock,et al.  Eigenvectors for representing the 500 mb geopotential surface over the Northern Hemisphere , 1969 .

[14]  John E. Kutzbach,et al.  Empirical Eigenvectors of Sea-Level Pressure, Surface Temperature and Precipitation Complexes over North America , 1967 .

[15]  J. Craddock A Meteorological Application of Principal Component Analysis , 1965 .

[16]  Harry R. Glahn,et al.  Objective Weather Forecasting by Statistical Methods , 1965 .

[17]  H. Harman Modern factor analysis , 1961 .

[18]  H. Kaiser The varimax criterion for analytic rotation in factor analysis , 1958 .

[19]  J. Tukey,et al.  Multiple-Factor Analysis , 1947 .

[20]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .