Comparison of machine learning algorithms for classification of LiDAR points for characterization of canola canopy structure

ABSTRACT Recent changes to plant architectural traits that influence the canopy have produced high yielding cultivars in rice, wheat and maize. In breeding programs, rapid assessments of the crop canopy and other structural traits are needed to facilitate the development of advanced cultivars in other crops such as Canola. LiDAR has the potential to provide insights into plant structural traits such as canopy height, aboveground biomass, and light penetration. These parameters all rely heavily on classifying LiDAR returns as ground or vegetation as they rely on the number of ground returns and the number of vegetation returns. The aim of this study is to propose a point classification method for canola using machine learning approach. The training and testing datasets were clusters sampled from field plots for flower, plant and ground. The supervised learning algorithms chosen are Decision Tree, Random Forest, Support Vector Machine, and Naïve Bayes. K-means Clustering was also used as an unsupervised learning algorithm. The results show that Random Forest models (error rate = 0.006%) are the most accurate to use for canola point classification, followed by Support Vector Machine (0.028%) and Decision Tree (0.169%). Naïve Bayes (2.079%) and K-means Clustering (48.806%) are not suitable for this purpose. This method provides the true ground and canopy in point clouds rather than determining ground points via a fixed height rely on the accuracy of the point clouds, subsequently gives more representative measurements of the crop canopy.

[1]  Erik Næsset,et al.  Mapping defoliation during a severe insect attack on Scots pine using airborne laser scanning , 2006 .

[2]  Johan Holmgren,et al.  Estimation of Tree Lists from Airborne Laser Scanning Using Tree Model Clustering and k-MSN Imputation , 2013, Remote. Sens..

[3]  W. Cohen,et al.  Lidar remote sensing of above‐ground biomass in three biomes , 2002 .

[4]  Xiaohuan Xi,et al.  Estimating Leaf Area Index of Maize Using Airborne Discrete-Return LiDAR Data , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[5]  Ester van der Voet,et al.  Dynamic stock modelling: A method for the identification and estimation of future waste streams and emissions based on past production and product stock characteristics , 2005 .

[6]  K. Lim,et al.  Lidar remote sensing of biophysical properties of tolerant northern hardwood forests , 2003 .

[7]  F. Baret,et al.  Estimating wheat green area index from ground-based LiDAR measurement using a 3D canopy structure model , 2017 .

[8]  Paul M. Mather,et al.  An assessment of the effectiveness of decision tree methods for land cover classification , 2003 .

[9]  Kristina Koenig,et al.  Radiometric Correction of Terrestrial LiDAR Data for Mapping of Harvest Residues Density , 2013 .

[10]  J. Kirkegaard,et al.  Canola , 2021, Crop Physiology Case Histories for Major Crops.

[11]  En Zhu,et al.  Large-scale k-means clustering via variance reduction , 2018, Neurocomputing.

[12]  Lav R. Khot,et al.  Field-based crop phenotyping: Multispectral aerial imaging for evaluation of winter wheat emergence and spring stand , 2015, Comput. Electron. Agric..

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Jose A. Jiménez-Berni,et al.  High Throughput Determination of Plant Height, Ground Cover, and Above-Ground Biomass in Wheat with LiDAR , 2018, Front. Plant Sci..

[15]  A. Walter,et al.  Terrestrial 3D laser scanning to track the increase in canopy height of both monocot and dicot crop species under field conditions , 2016, Plant Methods.

[16]  Dong Li,et al.  Combined Use of Airborne LiDAR and Satellite GF-1 Data to Estimate Leaf Area Index, Height, and Aboveground Biomass of Maize During Peak Growing Season , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[17]  H. Aminpanah Effect of nitrogen rate on seed yield, protein and oil content of two canola (Brassica napus L.) cultivars , 2013, Acta agriculturae Slovenica.

[18]  Philip Lewis,et al.  Fast Automatic Precision Tree Models from Terrestrial Laser Scanner Data , 2013, Remote. Sens..

[19]  Xiaohuan Xi,et al.  Estimating FPAR of maize canopy using airborne discrete-return LiDAR data. , 2014, Optics express.

[20]  Yuyu Zhou,et al.  Terrestrial lidar remote sensing of forests: Maximum likelihood estimates of canopy profile, leaf area index, and leaf angle distribution , 2015 .

[21]  L. Monika Moskal,et al.  Modeling approaches to estimate effective leaf area index from aerial discrete-return LIDAR , 2009 .

[22]  M. Otegui,et al.  Plant population density, row spacing and hybrid effects on maize canopy architecture and light attenuation , 2001 .

[23]  Fan Wenjie,et al.  Retrieving crop fractional cover and LAI based on airborne Lidar data , 2011, National Remote Sensing Bulletin.

[24]  K. Tansey,et al.  Backscatter coefficient as an attribute for the classification of full-waveform airborne laser scanning data in urban areas , 2010 .

[25]  Jose A. Jiménez-Berni,et al.  Proximal Remote Sensing Buggies and Potential Applications for Field-Based Phenotyping , 2014 .

[26]  Matti Maltamo,et al.  Airborne discrete-return LIDAR data in the estimation of vertical canopy cover, angular canopy closure and leaf area index , 2011 .

[27]  Xiaohuan Xi,et al.  Estimating leaf area index of maize using airborne full-waveform lidar data , 2016 .

[28]  Liviu Theodor Ene,et al.  Comparative testing of single-tree detection algorithms under different types of forest , 2011 .

[29]  Jing Xiao,et al.  Fusion of airborne laserscanning point clouds and images for supervised and unsupervised scene classification , 2014 .

[30]  Abhijit Ghatak,et al.  Machine Learning with R , 2017, Springer Singapore.

[31]  S. Chakraborty,et al.  Climate change impacts on plant canopy architecture: implications for pest and pathogen management , 2012, European Journal of Plant Pathology.

[32]  Brett Lantz Machine learning with R : discover how to build machine learning algorithms, prepare data, and dig deep into data prediction techniques with R , 2015 .

[33]  Wei Wu,et al.  Influence of canopy structure on sheath blight epidemics in rice , 2014 .

[34]  Sean M. McMahon,et al.  Ground based LiDAR demonstrates the legacy of management history to canopy structure and composition across a fragmented temperate woodland , 2015 .

[35]  Markus Hollaus,et al.  Object-Based Point Cloud Analysis of Full-Waveform Airborne Laser Scanning Data for Urban Vegetation Classification , 2008, Sensors.

[36]  M. Herold,et al.  Nondestructive estimates of above‐ground biomass using terrestrial laser scanning , 2015 .

[37]  Bernhard Höfle,et al.  Comparative classification analysis of post-harvest growth detection from terrestrial LiDAR point clouds in precision agriculture , 2015 .

[38]  Norbert Pfeifer,et al.  Categorizing Wetland Vegetation by Airborne Laser Scanning on Lake Balaton and Kis-Balaton, Hungary , 2012, Remote. Sens..

[39]  K. Omasa,et al.  3D lidar imaging for detecting and understanding plant responses and canopy structure. , 2006, Journal of experimental botany.

[40]  Yong Pang,et al.  Characterizing forest canopy structure with lidar composite metrics and machine learning , 2011 .

[41]  R. DeFries,et al.  Classification trees: an alternative to traditional land cover classifiers , 1996 .

[42]  E. Costes,et al.  Plant architecture, its diversity and manipulation in agronomic conditions, in relation with pest and pathogen attacks , 2012, European Journal of Plant Pathology.

[43]  B. Li,et al.  Quantitative evaluation of influence of PROSTRATE GROWTH 1 gene on rice canopy structure based on three-dimensional structure model , 2016 .