Uncertainty analysis of gross primary production upscaling using Random Forests, remote sensing and eddy covariance data

Abstract The accurate quantification of carbon fluxes at continental spatial scale is important for future policy decisions in the context of global climate change. However, many elements contribute to the uncertainty of such estimate. In this study, the uncertainties of eight days gross primary production (GPP) predicted by Random Forest (RF) machine learning models were analysed at the site, ecosystem and European spatial scales. At the site level, the uncertainties caused by the missing of key drivers were evaluated. The most accurate predictions of eight days GPP were obtained when all available drivers were used (Pearson's correlation coefficient, ρ ~ 0.84; Root Mean Square Error (RMSE) ~ 1.8 g C m−2 d−1). However, when predictions were based on only remotely sensed data the accuracy was close to the optimum (ρ ~ 0.8; RMSE ~ 1.9 g C m−2 d−1) and to a commonly used light use efficiency model (MOD17) with parameters optimised for the applied study sites (the MOD17 +, ρ ~ 0.79; RMSE ~ 2.04 g C m−2 d−1). Remotely sensed data were key drivers for the accurate prediction of GPP in ecosystems with high variability of green biomass over the phenological cycle (e.g., deciduous broad-leaved forests) or highly affected by the human management (e.g. croplands). In contrast, in the ecosystems with low variability of greenness (e.g., evergreen broad-leaved forests), the predictions were poor when meteorological information were not used. At a European spatial scale, when modelled grids of meteorological, land cover and fPAR data were used as inputs, the propagation of their uncertainty, not accounted in the models training, had significant effects on the uncertainty of the mean annual GPP. At this scale, the effects of meteorological uncertainty were higher than the misclassification error. These findings suggested that a strategy based on satellite-measured data could be a favourable improvement for the spatial upscaling of GPP, because avoiding the propagation of the uncertainties of the modelled grids.

[1]  Zhao-Liang Li,et al.  Validation of the land-surface temperature products retrieved from Terra Moderate Resolution Imaging Spectroradiometer data , 2002 .

[2]  K. Hibbard,et al.  A Global Terrestrial Monitoring Network Integrating Tower Fluxes, Flask Sampling, Ecosystem Modeling and EOS Satellite Data , 1999 .

[3]  Chandra Giri,et al.  A comparative analysis of the Global Land Cover 2000 and MODIS land cover data sets , 2005 .

[4]  S. Running,et al.  Global Terrestrial Gross and Net Primary Productivity from the Earth Observing System , 2000 .

[5]  Martin Jung,et al.  A Guided Hybrid Genetic Algorithm for Feature Selection with Expensive Cost Functions , 2013, ICCS.

[6]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[7]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[8]  Random Forests for Classific ation in Ecology , 2007 .

[9]  D. Sims,et al.  Potential of MODIS EVI and surface temperature for directly estimating per‐pixel ecosystem C fluxes , 2005 .

[10]  W. Oechel,et al.  A continuous measure of gross primary production for the conterminous United States derived from MODIS and AmeriFlux data , 2010, Remote Sensing of Environment.

[11]  A. Arneth,et al.  Separation of net ecosystem exchange into assimilation and respiration using a light response curve approach: critical issues and global evaluation , 2010 .

[12]  A. Bouwman,et al.  Human alteration of the global nitrogen and phosphorus soil balances for the period 1970–2050 , 2009 .

[13]  S. Wofsy,et al.  Modeling gross primary production of temperate deciduous broadleaf forest using satellite images and climate data , 2004 .

[14]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  A. Arneth,et al.  Global patterns of land-atmosphere fluxes of carbon dioxide, latent heat, and sensible heat derived from eddy covariance, satellite, and meteorological observations , 2011 .

[16]  W. Oechel,et al.  A new model of gross primary productivity for North American ecosystems based solely on the enhanced vegetation index and land surface temperature from MODIS , 2008 .

[17]  T. Vesala,et al.  On the separation of net ecosystem exchange into assimilation and ecosystem respiration: review and improved algorithm , 2005 .

[18]  J. Pereira,et al.  Modeling spatial patterns of fire occurrence in Mediterranean Europe using Multiple Regression and Random Forest , 2012 .

[19]  Jean-Michel Poggi,et al.  Variable selection using random forests , 2010, Pattern Recognit. Lett..

[20]  Markus Reichstein,et al.  Cross-site evaluation of eddy covariance GPP and RE decomposition techniques , 2008 .

[21]  Anthony O'Hagan,et al.  Probabilistic uncertainty specification: Overview, elaboration techniques and their application to a mechanistic model of carbon flux , 2012, Environ. Model. Softw..

[22]  I. C. Prentice,et al.  A dynamic global vegetation model for studies of the coupled atmosphere‐biosphere system , 2005 .

[23]  I. C. Prentice,et al.  Evaluation of ecosystem dynamics, plant geography and terrestrial carbon cycling in the LPJ dynamic global vegetation model , 2003 .

[24]  Johannes R. Sveinsson,et al.  Random Forests for land cover classification , 2006, Pattern Recognit. Lett..

[25]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[26]  B. Gao NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space , 1996 .

[27]  N. Gobron,et al.  Diagnostic assessment of European gross primary production , 2008 .

[28]  Maosheng Zhao,et al.  Sensitivity of Moderate Resolution Imaging Spectroradiometer (MODIS) terrestrial primary production to the accuracy of meteorological reanalyses , 2006 .

[29]  A. Bondeau,et al.  Towards global empirical upscaling of FLUXNET eddy covariance observations: validation of a model tree ensemble approach using a biosphere model , 2009 .

[30]  Lin Sun,et al.  Validation of the land surface temperature derived from HJ-1B/IRS data with ground measurements , 2011, 2011 IEEE International Geoscience and Remote Sensing Symposium.

[31]  Markus Reichstein,et al.  Temporal and among‐site variability of inherent water use efficiency at the ecosystem level , 2009 .

[32]  F. Woodward,et al.  Terrestrial Gross Carbon Dioxide Uptake: Global Distribution and Covariation with Climate , 2010, Science.

[33]  Maosheng Zhao,et al.  Improvements of the MODIS terrestrial gross and net primary production global data set , 2005 .

[34]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[35]  Christopher Potter,et al.  Simulating the impacts of disturbances on forest carbon cycling in North America: processes, data, models, and challenges , 2011 .

[36]  S. Running,et al.  Global products of vegetation leaf area and fraction absorbed PAR from year one of MODIS data , 2002 .

[37]  Markus Reichstein,et al.  Mean annual GPP of Europe derived from its water balance , 2007 .

[38]  Jonas Ardö,et al.  Exploring the potential of MODIS EVI for modeling gross primary production across African ecosystems , 2011 .

[39]  J. Evans,et al.  Gradient modeling of conifer species using random forests , 2009, Landscape Ecology.

[40]  O Anthony,et al.  Probabilistic uncertainty speci cation : Overview , elaboration techniques and their application to a mechanistic model of carbon ux , 2011 .

[41]  Damien Sulla-Menashe,et al.  MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets , 2010 .

[42]  R. Valentini,et al.  A new assessment of European forests carbon exchanges by eddy fluxes and artificial neural network spatialization , 2003 .

[43]  V. Brovkin,et al.  Modeling the influence of Greenland ice sheet melting on the Atlantic meridional overturning circulation during the next millennia , 2007 .

[44]  T. Vesala,et al.  Towards a standardized processing of Net Ecosystem Exchange measured with eddy covariance technique: algorithms and uncertainty estimation , 2006 .

[45]  I. C. Prentice,et al.  BIOME3: An equilibrium terrestrial biosphere model based on ecophysiological constraints, resource availability, and competition among plant functional types , 1996 .

[46]  Markus Reichstein,et al.  Uncertainties of modeling gross primary productivity over Europe: A systematic study on the effects of using different drivers and terrestrial biosphere models , 2007 .

[47]  Jonas Ardö,et al.  Evaluation of MODIS gross primary productivity for Africa using eddy covariance data , 2013 .

[48]  Dario Papale,et al.  Eddy Covariance: A Practical Guide to Measurement and Data Analysis , 2012 .

[49]  N. Jones,et al.  Seasonal variation of carbon monoxide in northern Japan: Fourier transform IR measurements and source-labeled model calculations , 2006 .

[50]  A-Xing Zhu,et al.  Developing a continental-scale measure of gross primary production by combining MODIS and AmeriFlux data through Support Vector Machine approach , 2007 .

[51]  Markus Reichstein,et al.  Assessing the ability of three land ecosystem models to simulate gross carbon uptake of forests from boreal to Mediterranean climate in Europe , 2007 .

[52]  J. Lindström,et al.  Towards operational remote sensing of forest carbon balance across Northern Europe , 2007 .

[53]  Ramakrishna R. Nemani,et al.  Evaluation of remote sensing based terrestrial productivity from MODIS using regional tower eddy flux network observations , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[54]  Clemens Beckstein,et al.  Characterization of ecosystem responses to climatic controls using artificial neural networks , 2010 .

[55]  W. Parton,et al.  Equilibration of the terrestrial water, nitrogen, and carbon cycles. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[56]  Hans Tømmervik,et al.  Prediction of the distribution of Arctic‐nesting pink‐footed geese under a warmer climate scenario , 2007 .