Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data

The creation and maintenance of complex forest structures has become an important forestry objective. Complex forest structures, often expressed in multimodal shapes of tree size/diameter (DBH) distributions, are challenging to model. Mixture probability density functions of two- or three-component gamma, log-normal, and Weibull mixture models offer a solution and can additionally provide insights into forest dynamics. Model parameters can be efficiently estimated with the maximum likelihood (ML) approach using iterative methods such as the Newton-Raphson (NR) algorithm. However, the NR algorithm is sensitive to the choice of initial values and does not always converge. As an alternative, we explored the use of the iterative expectation-maximization (EM) algorithm for estimating parameters of the aforementioned mixture models because it always converges to ML estimators. Since forestry data frequently occur both in grouped (classified) and ungrouped (raw) forms, the EM algorithm was applied to explore the goodness-of-fit of the gamma, log-normal, and Weibull mixture distributions in three sample plots that exhibited irregular, multimodal, highly skewed, and heavy-tailed DBH distributions where some size classes were empty. The EM-based goodness-of-fit was further compared against a nonparametric kernel-based density estimation (NK) model and the recently popularized gamma-shaped mixture (GSM) models using the ungrouped data. In this example application, the EM algorithm provided well-fitting two- or three-component mixture models for all three model families. The number of components of the best-fitting models differed among the three sample plots (but not among model families) and the mixture models of the log-normal and gamma families provided a better fit than the Weibull distribution for grouped and ungrouped data. For ungrouped data, both log-normal and gamma mixture distributions outperformed the GSM model and, with the exception of the multimodal diameter distribution, also the NK model. The EM algorithm appears to be a promising tool for modeling complex forest structures.

[1]  J. Diaci,et al.  Gap disturbance patterns of a Fagus sylvatica virgin forest remnant in the mountain vegetation belt of Slovenia , 2005 .

[2]  R. Podlaski Highly skewed and heavy-tailed tree diameter distributions: approximation using the gamma shape mixture model , 2016 .

[3]  Adel Mohammadpour,et al.  EM algorithm for symmetric stable mixture model , 2018, Commun. Stat. Simul. Comput..

[4]  F. Meloni,et al.  Toward a definition of the range of variability of central European mixed Fagus-Abies-Picea forests: the nearly steady-state forest of Lom (Bosnia and Herzegovina) , 2011 .

[5]  Naomi S. Altman,et al.  Bandwidth selection for kernel distribution function estimation , 1995 .

[6]  Kernel density estimation for heavy-tailed distributions using the champernowne transformation , 2005 .

[7]  Fuxiang Liu,et al.  Modeling diameter distributions of mixed-species forest stands , 2014 .

[8]  R. Podlaski Two-Component Mixture Models for Diameter Distributions in Mixed-Species, Two-Age Cohort Stands , 2010, Forest Science.

[9]  B. Commarmot,et al.  Age structure and disturbance dynamics of the relic virgin beech forest Uholka (Ukrainian Carpathians) , 2012 .

[10]  J. Paluch The spatial pattern of a natural European beech (Fagus sylvatica L.)–silver fir (Abies alba Mill.) forest: A patch-mosaic perspective , 2007 .

[11]  J. Diaci,et al.  Intermediate wind disturbance in an old-growth beech-fir forest in southeastern Slovenia , 2006 .

[12]  M. Zasada,et al.  A finite mixture distribution approach for characterizing tree diameter distributions by natural social class in pure even-aged Scots pine stands in Poland , 2005 .

[13]  M. Teimouri,et al.  Statistical inference for Birnbaum-Saunders and Weibull distributions fitted to grouped and ungrouped data , 2020 .

[14]  G. McLachlan,et al.  The EM Algorithm and Extensions: Second Edition , 2008 .

[15]  J. Gove,et al.  A finite mixture of two Weibull distributions for modeling the diameter distributions of rotated-sigmoid, uneven-aged stands , 2001 .

[16]  Alan M. Polansky,et al.  Multistage plug—in bandwidth selection for kernel distribution function estimates , 2000 .

[17]  Joseph Buongiorno,et al.  Tree Size Diversity and Economic Returns in Uneven-Aged Forest Stands , 1994 .

[18]  J. Diaci,et al.  Regeneration patterns after intermediate wind disturbance in an old-growth Fagus-Abies forest in southeastern Slovenia , 2006 .

[19]  L. Zhang,et al.  Fitting irregular diameter distributions of forest stands by Weibull, modified Weibull, and mixture Weibull models , 2006, Journal of Forest Research.

[20]  M. Teimouri EM algorithm for mixture of skew-normal distributions fitted to grouped data , 2020, Journal of applied statistics.

[21]  Min Soo Kang,et al.  Clustering performance comparison using K-means and expectation maximization algorithms , 2014, Biotechnology, biotechnological equipment.

[22]  J. Merganic,et al.  Characterisation of diameter distribution using the Weibull function: method of moments , 2006, European Journal of Forest Research.

[23]  Andrew O. Finley,et al.  ForestFit: An R package for modeling plant size distributions , 2020, Environ. Model. Softw..

[24]  Jeffrey H. Gove,et al.  Rotated sigmoid structures in managed uneven-aged northern hardwood stands: a look at the Burr Type III distribution , 2008 .

[25]  Kevin L. O'Hara,et al.  What is close-to-nature silviculture in a changing world? , 2016 .

[26]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[27]  E. Zenner,et al.  Patchiness in old-growth oriental beech forests across development stages at multiple neighborhood scales , 2019, European Journal of Forest Research.

[28]  E. Zenner,et al.  Integration of small-scale canopy dynamics smoothes live-tree structural complexity across development stages in old-growth Oriental beech (Fagus orientalis Lipsky) forests at the multi-gap scale , 2015 .

[29]  Hans Pretzsch,et al.  Forest Dynamics, Growth and Yield: From Measurement to Model , 2009 .

[30]  Giovanni Parmigiani,et al.  GAMMA SHAPE MIXTURES FOR HEAVY-TAILED DISTRIBUTIONS , 2008, 0807.4663.

[31]  R. Podlaski Forest modelling: the gamma shape mixture model and simulation of tree diameter distributions , 2017, Annals of Forest Science.

[32]  K. Nordhausen,et al.  Estimation of the diameter distribution of a stand marked for cutting using finite mixtures , 2007 .

[33]  Francis A. Roesch,et al.  Modelling diameter distributions of two-cohort forest stands with various proportions of dominant species: a two-component mixture model approach. , 2014, Mathematical biosciences.

[34]  Graciela Estévez-Pérez,et al.  Nonparametric Kernel Distribution Function Estimation with kerdiest: An R Package for Bandwidth Choice and Applications , 2012 .

[35]  Suzilah Ismail,et al.  A Simulation Study of a Parametric Mixture Model of Three Different Distributions to Analyze Heterogeneous Survival Data , 2013 .

[36]  J. Gove,et al.  A Finite Mixture Model for Characterizing the Diameter Distributions of Mixed-Species Forest Stands , 2002, Forest Science.

[37]  Ignacio López-de-Ullibarri Bandwidth Selection in Kernel Distribution Function Estimation , 2015 .

[38]  R. Podlaski,et al.  Modelling irregular and multimodal tree diameter distributions by finite mixture models: an approach to stand structure characterisation , 2012, Journal of Forest Research.

[39]  Mahdi Teimouri,et al.  Modeling tree diameters using mixtures of skewed Student’s t and related distributions , 2020 .

[40]  M. Maltamo,et al.  Comparison of percentile based prediction methods and the Weibull distribution in describing the diameter distribution of heterogeneous Scots pine stands , 2000 .