Aggregating pixel-level basal area predictions derived from LiDAR data to industrial forest stands in North-Central Idaho

Stand exams are the principal means by which timber companies monitor and manage their forested lands. Airborne LiDAR surveys sample forest stands at much finer spatial resolution and broader spatial extent than is practical on the ground. In this paper, we developed models that leverage spatially intensive and extensive LiDAR data and a stratified random sample of field plots across two mixed conifer forest landscapes in north-central Idaho. Our objective was to compare alternative models for producing unbiased maps of basal area per acre (BAA; ft2/acre), towards the greater goal of developing more accurate and efficient inventory techniques. We generated 60 topographic or stand structure metrics from LiDAR that were used as candidate predictor variables for modeling and mapping BAA at the scale of 30m pixels. Tree diameters were tallied in 1/10 and 1/5 acre fixed-radius plots (N = 165). Four models are presented, all based on 12 predictor variables. The first imputes BAA as an auxiliary variable from an imputation model that uses the machine learning algorithm randomForest in classification mode, and was developed in a prior study to map species-level basal areas of 11 conifer species; the second uses randomForest in regression mode to predict BAA as a single response variable from these same 12 predictor variables. The third is a linear regression model that predicts ln-transformed BAA using a best subset of 12 different predictor variables; the fourth again uses randomForest in regression mode, based on the same best subset of 12 variables selected for the linear regression model. We aggregated the pixel-level predictions within industrial forest stand boundaries, and then used equivalence plots to evaluate how well the aggregated predictions matched independent stand exams (having projected the tree growth in FVS and updated the stand tables to July 2003, the time of the LiDAR acquisition). All four models overpredicted BAA, but the bias was significant only in the case of the regression model. Predictions from the two randomForest models run in regression mode were very similar, despite using different predictor variables. We conclude that randomForest can be used to impute or predict canopy structure information from LiDAR-derived topographic and structural metrics with sufficient accuracy for operational management of conifer forests. In the future, tree lists could be imputed from LiDAR-derived canopy structure metrics empirically related to plot-level tree measurements. This will allow projections of tree growth at the pixel level across forested landscapes, instead of at the stand level as is the current norm.

[1]  Andrew P. Robinson,et al.  Model validation using equivalence tests , 2004 .

[2]  A. Hudak,et al.  Nearest neighbor imputation of species-level, plot-scale forest structure attributes from LiDAR data , 2008 .

[3]  Nicholas L. Crookston,et al.  yaImpute: An R Package for kNN Imputation , 2008 .

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Albert R. Stage,et al.  An Expression for the Effect of Aspect, Slope, and Habitat Type on Tree Growth , 1976 .

[6]  Janet L. Ohmann,et al.  Predictive mapping of forest composition and structure with direct gradient analysis and nearest- neighbor imputation in coastal Oregon, U.S.A. , 2002 .

[7]  G. E. Dixon Essential FVS: A User's Guide to the Forest Vegetation Simulator , 2007 .

[8]  W. Cohen,et al.  Lidar Remote Sensing for Ecosystem Studies , 2002 .

[9]  Albert R. Stage,et al.  Most Similar Neighbor: An Improved Sampling Inference Procedure for Natural Resource Planning , 1995, Forest Science.

[10]  N. Duan Smearing Estimate: A Nonparametric Retransformation Method , 1983 .

[11]  W. Cohen,et al.  An evaluation of alternate remote sensing products for forest inventory, monitoring, and mapping of Douglas-fir forests in western Oregon , 2001 .

[12]  W. Cohen,et al.  Geographic variability in lidar predictions of forest stand structure in the Pacific Northwest , 2005 .

[13]  J. Marshall,et al.  A regression-based equivalence test for model validation: shifting the burden of proof. , 2005, Tree physiology.

[14]  Andrew Thomas Hudak,et al.  A Multiscale Curvature Algorithm for Classifying Discrete Return LiDAR in Forested Environments , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[15]  R Daubenmire,et al.  Vegetation: identification of typal communities. , 1966, Science.

[16]  P. Gessler,et al.  Regression modeling and mapping of coniferous forest basal area and tree density from discrete-return lidar and multispectral satellite data , 2006 .

[17]  R. J. Pike,et al.  Elevation-Relief Ratio, Hypsometric Integral, and Geomorphic Area-Altitude Analysis , 1971 .

[18]  W. Cohen,et al.  Patterns of covariance between forest stand and canopy structure in the Pacific Northwest , 2005 .

[19]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .