Gaussian processes uncertainty estimates in experimental Sentinel-2 LAI and leaf chlorophyll content retrieval

Abstract ESA’s upcoming Sentinel-2 (S2) Multispectral Instrument (MSI) foresees to provide continuity to land monitoring services by relying on optical payload with visible, near infrared and shortwave infrared sensors with high spectral, spatial and temporal resolution. This unprecedented data availability leads to an urgent need for developing robust and accurate retrieval methods, which ideally should provide uncertainty intervals for the predictions. Statistical learning regression algorithms are powerful candidats for the estimation of biophysical parameters from satellite reflectance measurements because of their ability to perform adaptive, nonlinear data fitting. In this paper, we focus on a new emerging technique in the field of Bayesian nonparametric modeling. We exploit Gaussian process regression (GPR) for retrieval, which is an accurate method that also provides uncertainty intervals along with the mean estimates. This distinct feature is not shared by other machine learning approaches. In view of implementing the regressor into operational monitoring applications, here the portability of locally trained GPR models was evaluated. Experimental data came from the ESA-led field campaign SPARC (Barrax, Spain). For various simulated S2 configurations (S2-10m, S2-20m and S2-60m) two important biophysical parameters were estimated: leaf chlorophyll content (LCC) and leaf area index (LAI). Local evaluation of an extended training dataset with more variation over bare soil sites led to improved LCC and LAI mapping with reduced uncertainties. GPR reached the 10% precision required by end users, with for LCC a NRMSE of 3.5–9.2% ( r 2 : 0.95–0.99) and for LAI a NRMSE of 6.5–7.3% ( r 2 : 0.95–0.96). The developed GPR models were subsequently applied to simulated Sentinel images over various sites. The associated uncertainty maps proved to be a good indicator for evaluating the robustness of the retrieval performance. The generally low uncertainty intervals over vegetated surfaces suggest that the locally trained GPR models are portable to other sites and conditions.

[1]  Michael E. Tipping The Relevance Vector Machine , 1999, NIPS.

[2]  L. Alonso,et al.  ADVANCES AND LIMITATIONS IN A PARAMETRIC GEOMETRIC CORRECTION OF CHRIS/PROBA DATA , 2005 .

[3]  S. Running,et al.  Synergistic algorithm for estimating vegetation canopy leaf area index and fraction of absorbed photosynthetically active , 1998 .

[4]  C. Atzberger Object-based retrieval of biophysical canopy variables using artificial neural nets and radiative transfer models , 2004 .

[5]  R. Colombo,et al.  Retrieval of leaf area index in different vegetation types using high resolution satellite data , 2003 .

[6]  Luis Alonso,et al.  Evaluation of Sentinel-2 Red-Edge Bands for Empirical Estimation of Green LAI and Chlorophyll Content , 2011, Sensors.

[7]  Gustavo Camps-Valls,et al.  Retrieval of oceanic chlorophyll concentration with relevance vector machines , 2006 .

[8]  José F. Moreno,et al.  Gaussian Process Retrieval of Chlorophyll Content From Imaging Spectroscopy Data , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[9]  O. Hagolle,et al.  LAI, fAPAR and fCover CYCLOPES global products derived from VEGETATION: Part 1: Principles of the algorithm , 2007 .

[10]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[11]  Gustavo Camps-Valls,et al.  Retrieval of Biophysical Parameters With Heteroscedastic Gaussian Processes , 2014, IEEE Geoscience and Remote Sensing Letters.

[12]  F. Baret,et al.  Neural network estimation of LAI, fAPAR, fCover and LAI×Cab, from top of canopy MERIS reflectance data : Principles and validation , 2006 .

[13]  Nadine Gobron,et al.  Exploiting the MODIS albedos with the Two-stream Inversion Package (JRC-TIP): 1. Effective leaf area index, vegetation, and soil properties , 2011 .

[14]  Matthias Drusch,et al.  Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services , 2012 .

[15]  Luis Alonso,et al.  Retrieval of Vegetation Biophysical Parameters Using Gaussian Process Techniques , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[16]  S. Leblanc,et al.  A Shortwave Infrared Modification to the Simple Ratio for LAI Retrieval in Boreal Forests: An Image and Model Analysis , 2000 .

[17]  R. Dickinson,et al.  Evaluation of the Utility of Satellite-Based Vegetation Leaf Area Index Data for Climate Simulations , 2001 .

[18]  Luis Gómez-Chova,et al.  Semi-Supervised Support Vector Biophysical Parameter Estimation , 2008, IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium.

[19]  Lammert Kooistra,et al.  Mapping Vegetation Density in a Heterogeneous River Floodplain Ecosystem Using Pointable CHRIS/PROBA Data , 2012, Remote. Sens..

[20]  R. Colombo,et al.  Inversion of a radiative transfer model with hyperspectral observations for LAI mapping in poplar plantations , 2004 .

[21]  Luis Gómez-Chova,et al.  Learning with the kernel signal to noise ratio , 2012, 2012 IEEE International Workshop on Machine Learning for Signal Processing.

[22]  Yee Whye Teh,et al.  Bayesian Nonparametric Models , 2010, Encyclopedia of Machine Learning.

[23]  Changbao Wu,et al.  Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis , 1986 .

[24]  Klaus Scipal,et al.  Theoretical uncertainty analysis of global MODIS, CYCLOPES, and GLOBCARBON LAI products using a triple collocation method , 2012 .

[25]  L. Alonso,et al.  A red-edge spectral index for remote sensing estimation of green LAI over agroecosystems , 2013 .

[26]  Luis Alonso,et al.  Machine learning regression algorithms for biophysical parameter retrieval: Opportunities for Sentinel-2 and -3 , 2012 .

[27]  F. Baret,et al.  LAI and fAPAR CYCLOPES global products derived from VEGETATION. Part 2: validation and comparison with MODIS collection 4 products , 2007 .

[28]  Frédéric Baret,et al.  Validation of global moderate-resolution LAI products: a framework proposed within the CEOS land product validation subgroup , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[29]  F. Baret,et al.  Estimating Canopy Characteristics from Remote Sensing Observations: Review of Methods and Associated Problems , 2008 .

[30]  Michael E. Tipping Sparse Bayesian Learning and the Relevance Vector Machine , 2001, J. Mach. Learn. Res..

[31]  Lorenzo Bruzzone,et al.  Kernel methods for remote sensing data analysis , 2009 .

[32]  Luis Alonso,et al.  A method for the surface reflectance retrieval from PROBA/CHRIS data over land: application to ESA SPARC campaigns , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[33]  Luis Gómez-Chova,et al.  Biophysical parameter estimation with adaptive Gaussian Processes , 2009, 2009 IEEE International Geoscience and Remote Sensing Symposium.

[34]  Byron Hall Bayesian Inference , 2011 .