Assessing edge pixel classification and growing stock volume estimation in forest stands using a machine learning algorithm and Sentinel-2 data

Sustainable forest management requires accurate and up-to-date baseline data regarding forest structural parameters and the definition of forest stand units. A crucial component in stand characterization is the identification of tree species and to correctly define the stand boundaries. The most common procedure used for this is land cover classification using remote sensing data. However, when classifying forests, misclassification of edge areas can be substantial, yet classification studies often exclude these areas from the accuracy assessment. With the launch of the Sentinel-2 (S2) mission, which provides medium-high spatial resolution satellite imagery, global coverage and high revisit times, new methodologies for defining and estimating stand parameters are being developed. In combination with machine learning algorithms, the use of S2 data to predict forest variables has demonstrated to generate highly accurate results. Random Forest (RF) is one of these algorithms, and has become increasingly popular in environmental studies during the last decade. The combination of the multi-temporal higher resolution S2 images with the ability of the RF algorithm to detect outliers may contribute to improve the classification accuracy of forest edge areas. This study presents a methodology based on a combination of field data and S2 multi-temporal imagery that were analyzed using the RF algorithm in southern Sweden. The aim was to perform a land cover classification to identify forest patches of three tree species (Scots pine, Norway spruce and birch), and to test the inclusion of edge pixels as training data to improve the accuracy at edge areas. For this, a segmented accuracy assessment was proposed, where the accuracy was assessed for interior, intermediate and edge areas, as well as for entire forest patches. The RF algorithm was also used to estimate growing stock volume (GSV), another important forest stand parameter. The results indicate that higher accuracies at edge areas can be obtained when edge pixels are included in the training set. Moreover, the findings describe how different allocation schemes of the training and validation data affect the results. Bands covering the Red Edge, SWIR and a narrow segment of the NIR proved to be beneficial, together with the use of multi-temporal scenes. The GSV estimation yielded inferior results but was able to distinguish the S2 bands most correlated with tree volume. The present study contributes to a better characterization of forest stands and, consequently, to facilitate the generation of forest data required by environmental scientists and the forestry sector. (Less)

[1]  Abbas Alimohammadi,et al.  Land cover mapping based on random forest classification of multitemporal spectral and thermal images , 2015, Environmental Monitoring and Assessment.

[2]  Peter M. Atkinson,et al.  Fusion of Landsat 8 OLI and Sentinel-2 MSI Data , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Elfatih M. Abdel-Rahman,et al.  Random forest regression and spectral band selection for estimating sugarcane leaf nitrogen concentration using EO-1 Hyperion hyperspectral data , 2013 .

[4]  N. Lu,et al.  Species-specific habitat fragmentation assessment, considering the ecological niche requirements and dispersal capability , 2012 .

[5]  Bryan A. Franz,et al.  Sentinel-2 MultiSpectral Instrument (MSI) data processing for aquatic science applications: Demonstrations and validations , 2017 .

[6]  Meriame Mohajane,et al.  Mapping Forest Species in the Central Middle Atlas of Morocco (Azrou Forest) through Remote Sensing Techniques , 2017, ISPRS Int. J. Geo Inf..

[7]  M. Nilsson,et al.  Countrywide Estimates of Forest Variables Using Satellite Data and Field Data from the National Forest Inventory , 2003, Ambio.

[8]  David Paull,et al.  Machine learning of poorly predictable ecological data , 2006 .

[9]  Qi Chen,et al.  A comparison of classification techniques to support land cover and land use analysis in tropical coastal zones , 2011 .

[10]  Tian Gao,et al.  The role of forest stand structure as biodiversity indicator , 2014 .

[11]  Giorgos Mountrakis,et al.  A meta-analysis of remote sensing research on supervised pixel-based land-cover image classification processes: General guidelines for practitioners and future research , 2016 .

[12]  Carl Kingsford,et al.  What are decision trees? , 2008, Nature Biotechnology.

[13]  F. Chianucci,et al.  Use of Sentinel-2 for forest classification in Mediterranean environments , 2018 .

[14]  J. Peñuelas,et al.  The red edge position and shape as indicators of plant chlorophyll content, biomass and hydric status. , 1994 .

[15]  Uwe Stilla,et al.  Machine Learning Comparison between WorldView-2 and QuickBird-2-Simulated Imagery Regarding Object-Based Urban Land Cover Classification , 2011, Remote. Sens..

[16]  Rok Blagus,et al.  Class prediction for high-dimensional class-imbalanced data , 2010, BMC Bioinformatics.

[17]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[18]  Donald A. Walker,et al.  Accuracy Assessment of a Land-Cover Map of the Kuparu k River Basin, Alaska: Considerations for Remote Regions , 1998 .

[19]  Lorenzo Bruzzone,et al.  A Review of Modern Approaches to Classification of Remote Sensing Data , 2014 .

[20]  Özlem Akar,et al.  Mapping land use with using Rotation Forest algorithm from UAV images , 2017 .

[21]  Giles M. Foody,et al.  Status of land cover classification accuracy assessment , 2002 .

[22]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[23]  Jon Atli Benediktsson,et al.  Recent Advances in Techniques for Hyperspectral Image Processing , 2009 .

[24]  J. Parkkinen,et al.  Classification of the reflectance spectra of pine, spruce, and birch. , 1994, Applied optics.

[25]  René Roland Colditz,et al.  An Evaluation of Different Training Sample Allocation Schemes for Discrete and Continuous Land Cover Classification Using Decision Tree-Based Algorithms , 2015, Remote. Sens..

[26]  Sven Wunder,et al.  Incentives to Sustain Forest Ecosystem Services: A Review and Lessons for REDD , 2009 .

[27]  H. Pretzsch Forest Dynamics, Growth, and Yield , 2010 .

[28]  Xiufang Zhu,et al.  Edge-pixels-based support vector data description for specific land-cover distribution mapping , 2015 .

[29]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[30]  D. Roberts,et al.  Deriving Water Content of Chaparral Vegetation from AVIRIS Data , 2000 .

[31]  E. Næsset,et al.  Combining UAV and Sentinel-2 auxiliary data for forest growing stock volume estimation through hierarchical model-based inference , 2018 .

[32]  Jan O. Mattsson,et al.  Wind erosion on arable land in Scania, Sweden and the relation to the wind climate - a review , 2003 .

[33]  Limin Yang,et al.  Accuracy assessment for the U.S. Geological Survey Regional Land-Cover Mapping Program: New York and New Jersey Region , 2000 .

[34]  Weiqi Zhou,et al.  Comparing Machine Learning Classifiers for Object-Based Land Cover Classification Using Very High Resolution Imagery , 2014, Remote Sensing.

[35]  M. Pal,et al.  Random forests for land cover classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[36]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[37]  Ute Beyer,et al.  Remote Sensing And Image Interpretation , 2016 .

[38]  C. Margules,et al.  Indicators of Biodiversity for Ecologically Sustainable Forest Management , 2000 .

[39]  Consuelo Gonzalo-Martin,et al.  A machine learning approach for agricultural parcel delineation through agglomerative segmentation , 2017 .

[40]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[41]  G. Foody Thematic map comparison: Evaluating the statistical significance of differences in classification accuracy , 2004 .

[42]  Yady Tatiana Solano Correa,et al.  Analysis of multitemporal Sentinel-2 images in the framework of the ESA Scientific Exploitation of Operational Missions , 2017, 2017 9th International Workshop on the Analysis of Multitemporal Remote Sensing Images (MultiTemp).

[43]  G. Mallinis,et al.  Assessing the relationships between growing stock volume and Sentinel-2 imagery in a Mediterranean forest ecosystem , 2017 .

[44]  O. Mutanga,et al.  Examining the potential of Sentinel-2 MSI spectral resolution in quantifying above ground biomass across different fertilizer treatments , 2015 .

[45]  Michael Köhl,et al.  New Approaches for Multi Resource Forest Inventories , 2003 .

[46]  A. Prasad,et al.  Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction , 2006, Ecosystems.

[47]  Tom P. Evans,et al.  An edge-oriented approach to thematic map error assessment , 2012 .

[48]  Gretchen G. Moisen,et al.  Evaluating effectiveness of down-sampling for stratified designs and unbalanced prevalence in Random Forest models of tree species distributions in Nevada , 2012 .

[49]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[50]  Qihao Weng,et al.  Automated individual tree-crown delineation and treetop detection with very-high-resolution aerial imagery , 2013 .

[51]  Mathieu Fauvel,et al.  Mapping tree species of forests in southwest France using Sentinel-2 image time series , 2017, 2017 9th International Workshop on the Analysis of Multitemporal Remote Sensing Images (MultiTemp).

[52]  Giles M. Foody,et al.  The use of small training sets containing mixed pixels for accurate hard image classification: Training on mixed spectral responses for classification by a SVM , 2006 .

[53]  N. Pfeifer,et al.  Treefall Gap Mapping Using Sentinel-2 Images , 2017 .

[54]  Martin Kappas,et al.  Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery , 2017, Sensors.

[55]  Carolin Strobl,et al.  An AUC-based permutation variable importance measure for random forests , 2013, BMC Bioinformatics.

[56]  Clement Atzberger,et al.  Tree Species Classification with Random Forest Using Very High Spatial Resolution 8-Band WorldView-2 Satellite Data , 2012, Remote. Sens..

[57]  Scott J. Goetz,et al.  Mapping tree height distributions in Sub-Saharan Africa using Landsat 7 and 8 data , 2016 .

[58]  Mark A. Friedl,et al.  A Method for Robust Estimation of Vegetation Seasonality from Landsat and Sentinel-2 Time Series Data , 2018, Remote. Sens..

[59]  Philipp Probst,et al.  To tune or not to tune the number of trees in random forest? , 2017, J. Mach. Learn. Res..

[60]  John van Genderen,et al.  Fundamentals of satellite remote sensing: an environmental approach , 2016, Int. J. Digit. Earth.

[61]  Hongyu Zhao,et al.  Pathway analysis using random forests classification and regression , 2006, Bioinform..

[62]  S. Ullo,et al.  Contribution of Sentinel-2 data for applications in vegetation monitoring , 2016 .

[63]  Annemarie Schneider,et al.  Monitoring land cover change in urban and peri-urban areas using dense time stacks of Landsat satellite data and a data mining approach , 2012 .

[64]  Morten Wang Fagerland,et al.  The McNemar test for binary matched-pairs data: mid-p and asymptotic are better than exact conditional , 2013, BMC Medical Research Methodology.

[65]  Onisimo Mutanga,et al.  High density biomass estimation for wetland vegetation using WorldView-2 imagery and random forest regression algorithm , 2012, Int. J. Appl. Earth Obs. Geoinformation.

[66]  A. Agresti An introduction to categorical data analysis , 1997 .

[67]  Michael A. Lefsky,et al.  Review of studies on tree species classification from remotely sensed data , 2016 .

[68]  Cláudia Maria de Almeida,et al.  Evaluating Sentinel-2 and Landsat-8 Data to Map Sucessional Forest Stages in a Subtropical Forest in Southern Brazil , 2017, Remote. Sens..

[69]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[70]  Miina Rautiainen,et al.  Optical properties of leaves and needles for boreal tree species in Europe , 2013 .

[71]  Doreen S. Boyd,et al.  Mapping Complex Urban Land Cover from Spaceborne Imagery: The Influence of Spatial Resolution, Spectral Band Set and Classification Approach , 2016, Remote. Sens..

[72]  M. Canty Image Analysis, Classification, and Change Detection in Remote Sensing , 2006 .

[73]  Francesco Pirotti,et al.  BENCHMARK OF MACHINE LEARNING METHODS FOR CLASSIFICATION OF A SENTINEL-2 IMAGE , 2016 .

[74]  Clement Atzberger,et al.  First Experience with Sentinel-2 Data for Crop and Tree Species Classifications in Central Europe , 2016, Remote. Sens..

[75]  R. Valentini,et al.  Discrimination of tropical forest types, dominant species, and mapping of functional guilds by hyperspectral and simulated multispectral Sentinel-2 data , 2016 .

[76]  N. Demir,et al.  POST-HURRICANE DAMAGE ASSESSMENT ON GREENHOUSE FIELDS WITH USE OF SAR DATA , 2018 .

[77]  David L. Verbyla,et al.  Optimistic bias in classification accuracy assessment , 1996 .

[78]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[79]  J. Pereira,et al.  Modeling spatial patterns of fire occurrence in Mediterranean Europe using Multiple Regression and Random Forest , 2012 .

[80]  Meng Liu,et al.  Method for land cover classification accuracy assessment considering edges , 2016, Science China Earth Sciences.

[81]  Brian R. Sturtevant,et al.  Estimation of forest structural parameters using 5 and 10 meter SPOT-5 satellite data , 2009 .