Random projections in reducing the dimensionality of climate simulation data

Random projection (RP) is a dimensionality reduction method that has been earlier applied to high-dimensional data sets, for instance, in image processing. This study presents experimental results of RP applied to simulated global surface temperature data. Principal component analysis (PCA) is utilised to analyse how RP preserves structures when the original data set is compressed down to 10% or 1% of its original volume. Our experiments show that, although information is naturally lost in RP, the main spatial patterns (the principal component loadings) and temporal signatures (spectra of the principal component scores) can nevertheless be recovered from the randomly projected low-dimensional subspaces. Our results imply that RP could be used as a pre-processing step before analysing the structure of high-dimensional climate data sets having many state variables, time steps and spatial locations.

[1]  Michael Botzet,et al.  Ocean Circulation and Tropical Variability in the Coupled Model ECHAM5/MPI-OM , 2006 .

[2]  George Bebis,et al.  Face recognition experiments with random projection , 2005, SPIE Defense + Commercial Sensing.

[3]  H. Storch,et al.  Statistical Analysis in Climate Research , 2000 .

[4]  D. Thomson,et al.  Spectrum estimation and harmonic analysis , 1982, Proceedings of the IEEE.

[5]  E. Guilyardi,et al.  ENSO representation in climate models: from CMIP3 to CMIP5 , 2013, Climate Dynamics.

[6]  Jonathan M. Lees,et al.  Robust estimation of background noise and signal detection in climatic time series , 1996 .

[7]  Jonathon Shlens,et al.  A Tutorial on Principal Component Analysis , 2014, ArXiv.

[8]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[9]  J. Rinne,et al.  Empirical orthogonal functions of 500 mb height in the northern hemisphere determined from a large data sample , 1979 .

[10]  Dimitris Achlioptas,et al.  Database-friendly random projections: Johnson-Lindenstrauss with binary coins , 2003, J. Comput. Syst. Sci..

[11]  J. Jungclaus MPI-M Earth System Modelling Framework: millennium control experiment , 2007 .

[12]  David L Donoho,et al.  Compressed sensing , 2006, IEEE Transactions on Information Theory.

[13]  Filipe Aires,et al.  Independent component analysis of multivariate time series: Application to the tropical SST variability , 2000 .

[14]  Xiaojin Zhu Random Projection , .

[15]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[16]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[17]  W. B. Johnson,et al.  Extensions of Lipschitz mappings into Hilbert space , 1984 .

[18]  Peter Frankl,et al.  The Johnson-Lindenstrauss lemma and the sphericity of some graphs , 1987, J. Comb. Theory B.

[19]  Kurt Bryan,et al.  Making Do with Less: An Introduction to Compressed Sensing , 2013, SIAM Rev..

[20]  Ian T. Jolliffe,et al.  Rotation of ill-defined principal components , 1989 .

[21]  Kevin E. Trenberth,et al.  The Southern Oscillation Revisited: Sea Level Pressures, Surface Temperatures, and Precipitation , 2000 .

[22]  Abdel Hannachi,et al.  Pattern hunting in climate: a new method for finding trends in gridded climate data , 2007 .

[23]  Peter Filzmoser,et al.  Introduction to Multivariate Statistical Analysis in Chemometrics , 2009 .

[24]  Michael Ghil,et al.  ADVANCED SPECTRAL METHODS FOR CLIMATIC TIME SERIES , 2002 .

[25]  A. Stewart Fotheringham,et al.  Principal Component Analysis on Spatial Data: An Overview , 2013 .

[26]  Hanchao Qi,et al.  Invariance of principal components under low-dimensional random projection of the data , 2012, 2012 19th IEEE International Conference on Image Processing.

[27]  Sanjoy Dasgupta,et al.  An elementary proof of a theorem of Johnson and Lindenstrauss , 2003, Random Struct. Algorithms.

[28]  I. Jolliffe,et al.  ON RELATIONSHIPS BETWEEN UNCENTRED AND COLUMN-CENTRED PRINCIPAL COMPONENT ANALYSIS , 2009 .

[29]  Ian T. Jolliffe,et al.  Empirical orthogonal functions and related techniques in atmospheric science: A review , 2007 .