PROGNOSIS OF FOREST PRODUCTION USING MACHINE LEARNING TECHNIQUES

Abstract Forest production and growth are obtained from statistical models that allow the generation of information at the tree or forest stand level. Although the use of regression models is common in forest measurement, there is a constant search for estimation procedures that provide greater accuracy. Recently, machine learning techniques have been used with satisfactory performance in measuring forests. However, methods such as Adaptive Neuro-Fuzzy Inference System (ANFIS) and Random Forest are relatively poorly studied for predicting the volume of wood in eucalyptus plantations in Brazil. Therefore, it is essential to check whether these techniques can provide gains in terms of accuracy. Thus, this study aimed to evaluate the use of Random Forest and ANFIS techniques in the prognosis of forest production. The data used come from continuous forest inventories carried out in stands of eucalyptus clones. The data were divided into 70% for training and 30% for validation. The algorithms used to generate rules in ANFIS were Subtractive Clustering and Fuzzy-C-Means. Besides, training was done with the hybrid algorithm (descending gradient and least squares) with the number of seasons ranging from 1 to 20. Several RFs were trained, varying the number of trees from 50 to 850 and the number of observations by five leaves to 35. Artificial neural networks and decision trees were also trained to compare the feasibility of the techniques. The evaluation of the estimates generated by the techniques for training and validation was calculated based on the following statistics: correlation coefficient (r), relative Bias (RB), and the relative root mean square error (RRMSE) in percentage. In general, the techniques studied in this work showed excellent performance for the training and validation data set with RRMSE values 0.98. The RF presented inferior statistics about the ANFIS for the prognosis of forest production. The Subtractive Clustering (SC) and Fuzzy-C-Means (FCM) algorithms provide accurate baseline and volume projection estimates; both techniques are good alternatives for selecting variables used in modeling forest production.

[1]  Jon Atli Benediktsson,et al.  Automatic selection of molecular descriptors using random forest: Application to drug discovery , 2017, Expert Syst. Appl..

[2]  Guoyi Zhang,et al.  Bias-corrected random forests in regression , 2012 .

[3]  Xinkai Zhu,et al.  Estimation of biomass in wheat using random forest regression algorithm and remote sensing data , 2016 .

[4]  Helio Garcia Leite,et al.  Configuração de redes neurais artificiais para estimação da altura total de árvores de eucalipto , 2016 .

[5]  Krzysztof Siminski,et al.  Interval type-2 neuro-fuzzy system with implication-based inference mechanism , 2017, Expert Syst. Appl..

[6]  Harold E. Burkhart,et al.  Modeling Forest Trees and Stands , 2012, Springer Netherlands.

[7]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[8]  Guilherme G. Oliveira,et al.  Simplifying artificial neural network models of river basin behaviour by an automated procedure for input variable selection , 2015, Eng. Appl. Artif. Intell..

[9]  Giovanni Correia Vieira,et al.  Prognoses of diameter and height of trees of eucalyptus using artificial intelligence. , 2018, The Science of the total environment.

[10]  Maria J. Diamantopoulou,et al.  Estimation of Weibull function parameters for modelling tree diameter distribution using least squares and artificial neural networks methods , 2015 .

[11]  N. Goudarzi Free variable selection QSPR study to predict (19)F chemical shifts of some fluorinated organic compounds using Random Forest and RBF-PLS methods. , 2016, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[12]  A. Kamsin,et al.  Identifying the most significant input parameters for predicting global solar radiation using an ANFIS selection procedure , 2016 .

[13]  Gilciano Saraiva Nogueira,et al.  Individual Growth Model for Eucalyptus Stands in Brazil Using Artificial Neural Network , 2013 .

[14]  Qingzhong Liu,et al.  Predicting injection profiles using ANFIS , 2007, Inf. Sci..

[15]  Robert Babuska,et al.  Neuro-fuzzy methods for nonlinear system identification , 2003, Annu. Rev. Control..

[16]  Ahmet Arslan,et al.  Automatic gender determination from 3D digital maxillary tooth plaster models based on the random forest algorithm and discrete cosine transform , 2017, Comput. Methods Programs Biomed..

[17]  Chinsu Lin,et al.  A decompositional stand structure analysis for exploring stand dynamics of multiple attributes of a mixed-species forest , 2016 .

[18]  Ashfaqur Rahman,et al.  Dissolved oxygen prediction in prawn ponds from a group of one step predictors , 2020 .

[19]  De Li Liu,et al.  High resolution mapping of soil organic carbon stocks using remote sensing variables in the semi-arid rangelands of eastern Australia. , 2018, The Science of the total environment.

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  M. Vastaranta,et al.  Predicting individual tree attributes from airborne laser point clouds based on the random forests technique , 2011 .

[22]  M. Diamantopoulou,et al.  Estimating tree bole volume using artificial neural network models for four species in Turkey. , 2010, Journal of environmental management.

[23]  E. B. Görgens,et al.  Artificial Intelligence Procedures for Tree Taper Estimation within a Complex Vegetation Mosaic in Brazil , 2016, PloS one.

[24]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[25]  Gerhard Tutz,et al.  Random forest for ordinal responses: Prediction and variable selection , 2016, Comput. Stat. Data Anal..

[26]  Shuguang Zhao,et al.  An optimal clustering mechanism based on Fuzzy-C means for wireless sensor networks , 2017, Sustain. Comput. Informatics Syst..

[27]  Yang Shao,et al.  Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points , 2012 .

[28]  L. S. Davis Forest Management: To Sustain Ecological, Economic, and Social Values , 2005 .

[29]  Jeferson Pereira Martins Silva,et al.  Computational techniques applied to volume and biomass estimation of trees in Brazilian savanna. , 2019, Journal of environmental management.

[30]  Song Won Park,et al.  Variables Selection for Neural Networks Identification for Kraft Recovery Boilers , 2004 .

[31]  Gustavo Eduardo Marcatti,et al.  Modeling of eucalyptus productivity with artificial neural networks , 2020 .

[32]  Helio Garcia Leite,et al.  PROGNOSE EM NÍVEL DE POVOAMENTO DE CLONES DE EUCALIPTO EMPREGANDO REDES NEURAIS ARTIFICIAIS , 2015 .

[33]  Hans Pretzsch,et al.  Representation of species mixing in forest growth models. A review and perspective , 2015 .

[34]  Hannah M. Cooper,et al.  Quantification of sawgrass marsh aboveground biomass in the coastal Everglades using object-based ensemble analysis and Landsat data , 2018 .

[35]  Helio Garcia Leite,et al.  Redes neurais artificiais para estimação do volume de árvores , 2014 .

[36]  S. S. Matin,et al.  Modeling of free swelling index based on variable importance measurements of parent coal properties by random forest method , 2016 .

[37]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[38]  J. Harris,et al.  Integration of spectral, thermal, and textural features of ASTER data using Random Forests classification for lithological mapping , 2017 .

[39]  J.-S.R. Jang,et al.  Input selection for ANFIS learning , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[40]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[41]  Hans Pretzsch,et al.  Forest Dynamics, Growth and Yield: From Measurement to Model , 2009 .

[42]  Daniela M. Witten,et al.  An Introduction to Statistical Learning: with Applications in R , 2013 .

[43]  Matthew J. Cracknell,et al.  Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information , 2014, Comput. Geosci..

[44]  Jonathan P. Resop,et al.  Random Forests for Global and Regional Crop Yield Predictions , 2016, PloS one.