Bayesian and Classical Machine Learning Methods: A Comparison for Tree Species Classification with LiDAR Waveform Signatures

A plethora of information contained in full-waveform (FW) Light Detection and Ranging (LiDAR) data offers prospects for characterizing vegetation structures. This study aims to investigate the capacity of FW LiDAR data alone for tree species identification through the integration of waveform metrics with machine learning methods and Bayesian inference. Specifically, we first conducted automatic tree segmentation based on the waveform-based canopy height model (CHM) using three approaches including TreeVaW, watershed algorithms and the combination of TreeVaW and watershed (TW) algorithms. Subsequently, the Random forests (RF) and Conditional inference forests (CF) models were employed to identify important tree-level waveform metrics derived from three distinct sources, such as raw waveforms, composite waveforms, the waveform-based point cloud and the combined variables from these three sources. Further, we discriminated tree (gray pine, blue oak, interior live oak) and shrub species through the RF, CF and Bayesian multinomial logistic regression (BMLR) using important waveform metrics identified in this study. Results of the tree segmentation demonstrated that the TW algorithms outperformed other algorithms for delineating individual tree crowns. The CF model overcomes waveform metrics selection bias caused by the RF model which favors correlated metrics and enhances the accuracy of subsequent classification. We also found that composite waveforms are more informative than raw waveforms and waveform-based point cloud for characterizing tree species in our study area. Both classical machine learning methods (the RF and CF) and the BMLR generated satisfactory average overall accuracy (74% for the RF, 77% for the CF and 81% for the BMLR) and the BMLR slightly outperformed the other two methods. However, these three methods suffered from low individual classification accuracy for the blue oak which is prone to being misclassified as the interior live oak due to the similar characteristics of blue oak and interior live oak. Uncertainty estimates from the BMLR method compensate for this downside by providing classification results in a probabilistic sense and rendering users with more confidence in interpreting and applying classification results to real-world tasks such as forest inventory. Overall, this study recommends the CF method for feature selection and suggests that BMLR could be a superior alternative to classical machining learning methods.

[1]  Barbara Koch,et al.  Exploring full-waveform LiDAR parameters for tree species classification , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[2]  Eric C. Turnblom,et al.  Tree Species Detection Accuracies Using Discrete Point Lidar and Airborne Waveform Lidar , 2012, Remote. Sens..

[3]  Sylvia Frühwirth-Schnatter,et al.  Bayesian Inference in the Multinomial Logit Model , 2016 .

[4]  Heather Reese,et al.  Mapping Tree Canopy Cover and Aboveground Biomass in Sudano-Sahelian Woodlands Using Landsat 8 and Random Forest , 2015, Remote. Sens..

[5]  Felix Morsdorf,et al.  CLUSTERING IN AIRBORNE LASER SCANNING RAW DATA FOR SEGMENTATION OF SINGLE TREES , 2003 .

[6]  Nicholas C. Coops,et al.  Tree species classification in subtropical forests using small-footprint full-waveform LiDAR data , 2016, Int. J. Appl. Earth Obs. Geoinformation.

[7]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[8]  Andrew O. Finley,et al.  Modeling Forest Biomass and Growth: Coupling Long-Term Inventory and Lidar Data , 2016 .

[9]  Maggi Kelly,et al.  A New Method for Segmenting Individual Trees from the Lidar Point Cloud , 2012 .

[10]  Héctor Corrada Bravo,et al.  Automated classification of bird and amphibian calls using machine learning: A comparison of methods , 2009, Ecol. Informatics.

[11]  P. Gong,et al.  Isolating individual trees in a savanna woodland using small footprint lidar data , 2006 .

[12]  Philip J. Howarth,et al.  High Spatial Resolution Remote Sensing Data for Forest Ecosystem Classification: An Examination of Spatial Scale , 2000 .

[13]  Y. Wanga,et al.  LIDAR POINT CLOUD BASED FULLY AUTOMATIC 3 D SINGL TREE MODELLING IN FOREST AND EVALUATIONS OF THE PROCEDURE , 2008 .

[14]  Yong Pang,et al.  Characterizing forest canopy structure with lidar composite metrics and machine learning , 2011 .

[15]  N. Coops,et al.  Estimation of forest structure and canopy fuel parameters from small-footprint full-waveform LiDAR data , 2014 .

[16]  Serge Beucher,et al.  The Morphological Approach to Segmentation: The Watershed Transformation , 2018, Mathematical Morphology in Image Processing.

[17]  Randolph H. Wynne,et al.  Estimating plot-level tree heights with lidar : local filtering with a canopy-height based variable window size , 2002 .

[18]  Liviu Theodor Ene,et al.  Comparative testing of single-tree detection algorithms under different types of forest , 2011 .

[19]  Juha Hyyppä,et al.  An International Comparison of Individual Tree Detection and Extraction Using Airborne Laser Scanning , 2012, Remote. Sens..

[20]  Aniruddha Ghosh,et al.  A framework for mapping tree species combining hyperspectral and LiDAR data: Role of selected classifiers and sensor across three spatial scales , 2014, Int. J. Appl. Earth Obs. Geoinformation.

[21]  P. Krzystek,et al.  Tree species classification and estimation of stem volume and DBH based on single tree extraction by exploiting airborne full-waveform LiDAR data , 2012 .

[22]  R. Dubayah,et al.  Estimation of tropical forest structural characteristics using large-footprint lidar , 2002 .

[23]  M. Schlerf,et al.  Remote sensing of forest biophysical variables using HyMap imaging spectrometer data , 2005 .

[24]  John B. Bradford,et al.  Hierarchical Bayesian spatial models for predicting multiple forest variables using waveform LiDAR, hyperspectral imagery, and large inventory datasets , 2013, Int. J. Appl. Earth Obs. Geoinformation.

[25]  Åsa Persson,et al.  Identifying species of individual trees using airborne laser scanner , 2004 .

[26]  Paul-Christian Bürkner,et al.  brms: An R Package for Bayesian Multilevel Models Using Stan , 2017 .

[27]  S. Popescu,et al.  Measuring individual tree crown diameter with lidar and assessing its influence on estimating forest volume and biomass , 2003 .

[28]  John K. Kruschke,et al.  Doing Bayesian Data Analysis: A Tutorial with R, JAGS, and Stan , 2014 .

[29]  B. Koch,et al.  Detection of individual tree crowns in airborne lidar data , 2006 .

[30]  D. Harding,et al.  ICESat waveform measurements of within‐footprint topographic relief and vegetation vertical structure , 2005 .

[31]  R. Milne,et al.  Integrating remote sensing datasets into ecological modelling: a Bayesian approach , 2008 .

[32]  Achim Zeileis,et al.  Bias in random forest variable importance measures: Illustrations, sources and a solution , 2007, BMC Bioinformatics.

[33]  S. Popescu,et al.  Seeing the Trees in the Forest: Using Lidar and Multispectral Data Fusion with Local Filtering and Variable Window Size for Estimating Tree Height , 2004 .

[34]  Juha Hyyppä,et al.  Assessment of Low Density Full-Waveform Airborne Laser Scanning for Individual Tree Detection and Tree Species Classification , 2014 .

[35]  Sorin C. Popescu,et al.  Gold – A novel deconvolution algorithm with optimization for waveform LiDAR processing , 2017 .

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[37]  J. Reitberger,et al.  Analysis of full waveform LIDAR data for the classification of deciduous and coniferous trees , 2008 .

[38]  L. Joseph 4. Bayesian data analysis (2nd edn). Andrew Gelman, John B. Carlin, Hal S. Stern and Donald B. Rubin (eds), Chapman & Hall/CRC, Boca Raton, 2003. No. of pages: xxv + 668. Price: $59.95. ISBN 1‐58488‐388‐X , 2004 .

[39]  Sorin C. Popescu,et al.  Bayesian decomposition of full waveform LiDAR data with uncertainty analysis , 2017 .

[40]  Mohamed Abdel-Aty,et al.  Using conditional inference forests to identify the factors affecting crash severity on arterial corridors. , 2009, Journal of safety research.

[41]  Markus Hollaus,et al.  Tree species classification based on full-waveform airborne laser scanning data , 2009 .

[42]  Sylvie Durrieu,et al.  Stem Volume and Above-Ground Biomass Estimation of Individual Pine Trees From LiDAR Data: Contribution of Full-Waveform Signals , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.