Fast Retrieval of Weather Analogues in a Multi-petabytes Archive Using Wavelet-Based Fingerprints

Very large climate data repositories provide a consistent view of weather conditions over long time periods. In some applications and studies, given a current weather pattern (e.g. today’s weather), it is useful to identify similar ones (weather analogues) in the past. Looking for similar patterns in an archive using a brute force approach requires data to be retrieved from the archive and then compared to the query, using a chosen similarity measure. Such operation would be very long and costly. In this work, a wavelet-based fingerprinting scheme is proposed to index all weather patterns from the archive. The scheme allows to answer queries by computing the fingerprint of the query pattern, then comparing them to the index of all fingerprints more efficiently, in order to then retrieve only the corresponding selected data from the archive. The experimental analysis is carried out on the ECMWF’s ERA-Interim reanalyses data representing the global state of the atmosphere over several decades. Results shows that 32 bits fingerprints are sufficient to represent meteorological fields over a 1700 km \({\times }\) 1700 km region and allow the quasi instantaneous retrieval of weather analogues.

[1]  Piotr Porwik,et al.  The Haar – Wavelet Transform in Digital Image Processing : Its Status and Achievements , 2004 .

[2]  J. Thepaut,et al.  Toward a Consistent Reanalysis of the Climate System , 2014 .

[3]  Piotr Indyk,et al.  Nearest-neighbor-preserving embeddings , 2007, TALG.

[4]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[5]  David Salesin,et al.  Wavelets for computer graphics: theory and applications , 1996 .

[6]  Carlo Cattani,et al.  Wavelet clustering in time series analysis , 2005 .

[7]  Michael Unser,et al.  Extension of wavelet compression algorithms to 3D and 4D image data: exploitation of data coherence in higher dimensions allows very high compression ratios , 2001, SPIE Optics + Photonics.

[8]  Renée J. Miller,et al.  Similarity search over time-series data using wavelets , 2002, Proceedings 18th International Conference on Data Engineering.

[9]  James S. Walker,et al.  Wavelet-based Image Compression , 2003 .

[10]  Robert Sausen,et al.  Identification of anthropogenic climate change using a second-generation reanalysis , 2004 .

[11]  Nassir Navab,et al.  Wavelet energy map: A robust support for multi-modal registration of medical images , 2009, CVPR.

[12]  Agma J. M. Traina,et al.  MultiWaveMed: a system for medical image retrieval through wavelets transformations , 2003, 16th IEEE Symposium Computer-Based Medical Systems, 2003. Proceedings..

[13]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[14]  Paul H. Whitfield,et al.  Application Potential of Four Nontraditional Similarity Metrics in Hydrometeorology , 2014 .

[15]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[16]  Srinivasan Parthasarathy,et al.  Structure-based querying of proteins using wavelets , 2006, CIKM '06.

[17]  Luca Delle Monache,et al.  Probabilistic Weather Prediction with an Analog Ensemble , 2013 .

[18]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[19]  Nicola Orio,et al.  Music Retrieval: A Tutorial and Review , 2006, Found. Trends Inf. Retr..

[20]  Hans-Peter Kriegel,et al.  State-of-the-Art in Content-Based Image and Video Retrieval , 2001, Computational Imaging and Vision.

[21]  H. Storch,et al.  The Analog Method as a Simple Statistical Downscaling Technique: Comparison with More Complicated Methods , 1999 .

[22]  David Salesin,et al.  Wavelets for computer graphics: a primer. 2 , 1995, IEEE Computer Graphics and Applications.

[23]  Michael S. Evans,et al.  A Historical Analog-Based Severe Weather Checklist for Central New York and Northeastern Pennsylvania , 2014 .

[24]  David Salesin,et al.  Fast multiresolution image querying , 1995, SIGGRAPH.

[25]  Shumeet Baluja,et al.  Waveprint: Efficient wavelet-based audio fingerprinting , 2008, Pattern Recognit..

[26]  H. M. van den Dool,et al.  A New Look at Weather Forecasting through Analogues , 1989 .

[27]  Shahram Latifi,et al.  A wavelet-based technique for image similarity estimation , 2000, Proceedings International Conference on Information Technology: Coding and Computing (Cat. No.PR00540).

[28]  Andrew Quinn,et al.  Analogues for the railway network of Great Britain , 2016 .

[29]  Brian E. Granger,et al.  IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[30]  İlkay Darilmaz,et al.  Wavelet based similarity measurement algorithm for seafloor morphology , 2006 .

[31]  A. Walden,et al.  Wavelet Methods for Time Series Analysis , 2000 .

[32]  David Salesin,et al.  Wavelets for computer graphics: a primer.1 , 1995, IEEE Computer Graphics and Applications.

[33]  H. V. D. Dool,et al.  Searching for analogues, how long must we wait? , 1994 .

[34]  James S. Walker,et al.  A Primer on Wavelets and Their Scientific Applications , 1999 .

[35]  Mark C. Serreze,et al.  Climate change and variability using European Centre for Medium‐Range Weather Forecasts reanalysis (ERA‐40) temperatures on the Tibetan Plateau , 2005 .

[36]  M. Ozdemir,et al.  Comparison of statistical methods and wavelet energy coefficients for determining two common PQ disturbances: Sag and swell , 2009, 2009 International Conference on Electrical and Electronics Engineering - ELECO 2009.

[37]  J. Thepaut,et al.  The ERA‐Interim reanalysis: configuration and performance of the data assimilation system , 2011 .

[38]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .