Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures

Machine learning techniques are attractive tools to establish statistical models with a high degree of non linearity. They require a large amount of data to be trained and are therefore particularly suited to analysing remote sensing data. This work is an attempt at using advanced statistical methods of machine learning to predict the bias between Sea Surface Temperature (SST) derived from infrared remote sensing and ground “truth” from drifting buoy measurements. A large dataset of collocation between satellite SST and in situ SST is explored. Four regression models are used: Simple multi-linear regression, Least Square Shrinkage and Selection Operator (LASSO), Generalised Additive Model (GAM) and random forest. In the case of geostationary satellites for which a large number of collocations is available, results show that the random forest model is the best model to predict the systematic errors and it is computationally fast, making it a good candidate for operational processing. It is able to explain nearly 31% of the total variance of the bias (in comparison to about 24% for the multi-linear regression model).

[1]  Amir Hossein Alavi,et al.  Machine learning in geosciences and remote sensing , 2016 .

[2]  Christopher J. Merchant,et al.  Saharan dust in nighttime thermal imagery: Detection and reduction of related biases in retrieved sea surface temperature , 2006 .

[3]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[4]  Christopher J. Merchant,et al.  Estimation of Sea Surface Temperature from the Spinning Enhanced Visible and Infrared Imager, improved using numerical weather prediction , 2011 .

[5]  Alexander Ignatov,et al.  JP 1 . 13 TOWARDS CONTINUOUS ERROR CHARACTERIZATION OF SEA SURFACE TEMPERATURE IN THE ADVANCED CLEAR-SKY PROCESSOR FOR OCEANS , 2009 .

[6]  Alexander Ignatov,et al.  Sensor-Specific Error Statistics for SST in the Advanced Clear-Sky Processor for Oceans , 2016 .

[7]  Jean Tournadre,et al.  A Multivariate Regression Approach to Adjust AATSR Sea Surface Temperature to In Situ Measurements , 2009, IEEE Geoscience and Remote Sensing Letters.

[8]  R. Tibshirani,et al.  Generalized Additive Models , 1991 .

[9]  Christopher J. Merchant,et al.  Retrievals of sea surface temperature from infrared imagery: origin and form of systematic errors , 2006 .

[10]  H. Roquet,et al.  Six years of OSI-SAF METOP-A AVHRR sea surface temperature , 2015 .

[11]  William J. Emery,et al.  Error characterization of infrared and microwave satellite sea surface temperature products for merging and analysis , 2008 .

[12]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[13]  Wolfgang Wagner,et al.  Uncertainty information in climate data records from Earth observation , 2017 .

[14]  C. Donlon,et al.  Toward Improved Validation of Satellite Sea Surface Skin Temperature Measurements for Climate Research , 2002 .

[15]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[16]  C. Bulgin,et al.  Independent uncertainty estimates for coefficient based sea surface temperature retrieval from the Along-Track Scanning Radiometer instruments , 2016 .