Improving Predictability of Multisensor Data with Nonlinear Statistical Methodologies

The evaluation of the forage quality nutritive value and biomass usually takes multiple harvests and is considered time consuming, labor intensive, and expensive. The use of sensors to evaluate different forage traits has been proposed as a method to alleviate this problem. However, most analytical techniques involve the use of traditional linear methods for prediction, but prediction models can still be improved with the use of nonlinear methods. Thus, the objectives of this study are twofold: (i) to evaluate the performance of different prediction methodologies in important forage agronomic traits; and (ii) to evaluate the impact of sensor variables on predictive model performance in the different methods. The nondestructive multisensor system that accommodates spectral, ultrasonic, and laser data was tested on a bermudagrass [Cynodon dactylon (L.) Pers] experiment. Partial least square regression, ridge regression, support vector machine with radial kernel (SVM), and random forest were tested for the prediction performance. The random forest had the best performance, with 0.89 correlation for dry matter yield trait prediction, whereas SVM performed best in the remaining 15 traits, with correlation ranging from 0.72 to 0.95. Besides the prediction performance of statistical models, this study also provided some insight about the importance of variables by removing variables and reevaluating the model performance in dry matter yield. Overall, most of the traits in this study could be reliably predicted by the SVM. The multisensor system can facilitate the measurement of agriculturally important traits in a time-, laborand cost-efficient way. L. Xing, Dep. of Agronomy, Univ. of Florida, Gainesville, FL 326110500; J.J. Pittman and T.J. Butler, Noble Research Institute, Ardmore, OK 73401; L. Inostroza, Instituto de Investigaciones Agropecuarias, Centro Regional de Investigacioìn Quilamapu, Av. Vicente Meìndez, 515, Chillaìn, Chile; P. Munoz, Horticultural Science Dep., Univ. of Florida, 32611. Received 7 Sept. 2017. Accepted 14 Dec. 2017. *Corresponding author (p.munoz@ufl.edu). Assigned to Associate Editor Michael Casler. Abbreviations: ADF, acid detergent fiber; CCCVI, chlorophyll content index; CP, crude protein; CTD, canopy temperature depression; DMY, dry matter yield; dNDF48, 48-h-digestible neutral detergent fiber; FIPAR, fraction intercepted photosynthetically active radiation; HSNDVI, normalized difference vegetative index from a Crop Circle ACS-430 active canopy sensor; IRVI, infrared reflectance vegetative index; iPAR, incident photosynthetically active radiation; IVTDMD, in vitro true dry matter digestibility; LAIVI, leaf area index proxy index; NDF, neutral detergent fiber; NDRE, normalized difference red edge index; NDVI, normalized difference vegetation index; NIR, nearinfrared reflectance; NIRS, near-infrared reflectance spectroscopy; NRMSE, normalized root mean square error; PLS, partial least square; RE, red-edge reflectance; RF, random forest; rPAR, reflected photosynthetic active radiation; SVM, support vector machine; TDN, total digestible nutrients; WSC, water-soluble carbohydrates. Published in Crop Sci. 58:972–981 (2018). doi: 10.2135/cropsci2017.09.0537 © Crop Science Society of America | 5585 Guilford Rd., Madison, WI 53711 USA This is an open access article distributed under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Published February 15, 2018

[1]  Luis Inostroza,et al.  Using Aerial Images and Canopy Spectral Reflectance for High‐Throughput Phenotyping of White Clover , 2016 .

[2]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[3]  Anthony J. Conner,et al.  Breaking through the feed barrier: options for improving forage genetics , 2015 .

[4]  R. Tobias An Introduction to Partial Least Squares Regression , 1996 .

[5]  J. Schjoerring,et al.  Reflectance measurement of canopy biomass and nitrogen status in wheat crops using normalized difference vegetation indices and partial least squares regression , 2003 .

[6]  Andrew K. Skidmore,et al.  Estimation of green grass/herb biomass from airborne hyperspectral imagery using spectral indices and partial least squares regression , 2007, Int. J. Appl. Earth Obs. Geoinformation.

[7]  J. Shenk,et al.  Predicting Forage Quality by Infrared Replectance Spectroscopy , 1976 .

[8]  Ljubomir J. Buturovic,et al.  Cross-validation pitfalls when selecting and assessing regression and classification models , 2014, Journal of Cheminformatics.

[9]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[10]  Gavin C. Cawley,et al.  On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[11]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[12]  Ajit K. Srivastava,et al.  Engineering Principles of Agricultural Machines , 1993 .

[13]  S. Tarantola,et al.  Designing a spectral index to estimate vegetation water content from remote sensing data: Part 1 - Theoretical approach , 2002 .

[14]  Jordi Llorens,et al.  Performance of an Ultrasonic Ranging Sensor in Apple Tree Canopies , 2011, Sensors.

[15]  R. Belyea,et al.  The relationship of temperature-humidity index with milk production of dairy cows in a Mediterranean climate , 2002 .

[16]  Elfatih M. Abdel-Rahman,et al.  Random forest regression and spectral band selection for estimating sugarcane leaf nitrogen concentration using EO-1 Hyperion hyperspectral data , 2013 .

[17]  Javier Hernandez,et al.  Using Ridge Regression Models to Estimate Grain Yield from Field Spectral Data in Bread Wheat (Triticum Aestivum L.) Grown under Three Water Regimes , 2015, Remote. Sens..

[18]  Andrew K. Skidmore,et al.  Dry season mapping of savanna forage quality, using the hyperspectral Carnegie Airborne Observatory sensor , 2011 .

[19]  Wei‐Chiang Hong A hybrid support vector machine regression for exchange rate prediction , 2006 .

[20]  Board on Agriculture Nutrient Requirements of Dairy Cattle , 2016 .

[21]  Philip A. Townsend,et al.  Leaf optical properties reflect variation in photosynthetic metabolism and its sensitivity to temperature , 2011, Journal of experimental botany.

[22]  M. Wachendorf,et al.  Combining ultrasonic sward height and spectral signatures to assess the biomass of legume-grass swards , 2013 .

[23]  W. Raun,et al.  Bermudagrass, Wheat, and Tall Fescue Crude Protein Forage Estimation using Mobile‐Platform, Active‐Spectral and Canopy‐Height Data , 2016 .

[24]  L. Carrascal,et al.  Partial least squares regression as an alternative to current regression methods used in ecology , 2009 .

[25]  K. H. Asay,et al.  Contributions from Breeding Forage and Turf Grasses—An Overview , 1989 .

[26]  Jorge Alberto Elizondo-Salazar,et al.  Estimación lineal de los requerimientos nutricionales del NRC para ganado de leche. , 2014 .

[27]  Josep Peñuelas,et al.  Visible and near-infrared reflectance techniques for diagnosing plant physiological status , 1998 .

[28]  Corey A. Moffet,et al.  Estimation of Biomass and Canopy Height in Bermudagrass, Alfalfa, and Wheat Using Ultrasonic, Laser, and Spectral Sensors , 2015, Sensors.

[29]  A. Greenberg,et al.  Next-generation phenotyping: requirements and strategies for enhancing our understanding of genotype–phenotype relationships and its relevance to crop improvement , 2013, Theoretical and Applied Genetics.

[30]  J. Araus,et al.  Field high-throughput phenotyping: the new crop breeding frontier. , 2014, Trends in plant science.

[31]  Lucia Nemčíková,et al.  Classification and Regression Trees in R , 2014 .

[32]  John R. Miller,et al.  Comparative study between a new nonlinear model and common linear model for analysing laboratory simulated‐forest hyperspectral data , 2009 .

[33]  Richard Simon,et al.  Bias in error estimation when using cross-validation for model selection , 2006, BMC Bioinformatics.

[34]  Ajit K. Srivastava,et al.  Engineering Principles of Agricultural Machines, Second Edition , 2006 .

[35]  V. Mahajan,et al.  Parameter Estimation in Marketing Models in the Presence of Multicollinearity: An Application of Ridge Regression , 1977 .

[36]  A. Barnston Correspondence among the correlation, RMSE, and Heidke forecast verification measures; refinement of the Heidke score , 1992 .

[37]  Jungho Im,et al.  Support vector machines in remote sensing: A review , 2011 .

[38]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[39]  Albert Stoll,et al.  Guidance of a Forage Harvester with GPS , 2000, Precision Agriculture.

[40]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[41]  D. Mertens Regulation of Forage Intake , 2015 .

[42]  I. Lobos,et al.  NIR-Prediction of water-soluble carbohydrate in white clover and its genetic relationship with cold tolerance , 2017 .

[43]  Harald Martens,et al.  A multivariate calibration problem in analytical chemistry solved by partial least-squares models in latent variables , 1983 .

[44]  Mohamed Mohandes,et al.  Support vector machines for wind speed prediction , 2004 .

[45]  Samsuzana Abd Aziz,et al.  Ultrasonic Sensing for Corn Plant Canopy Characterization , 2004 .

[46]  Patrick J. Starks,et al.  Assessment of forage biomass and quality parameters of bermudagrass using proximal sensing of pasture canopy reflectance , 2007 .

[47]  M. Mausbach,et al.  Soil Quality: A Concept, Definition, and Framework for Evaluation (A Guest Editorial) , 1997 .

[48]  A. J. Rook,et al.  Prediction of the voluntary intake of grass silages by beef cattle. 1. Linear regression analyses , 1990 .

[49]  I. E. Frank Intermediate least squares regression method , 1987 .

[50]  Danh V. Nguyen,et al.  Tumor classification by partial least squares using microarray gene expression data , 2002, Bioinform..

[51]  Zhongsheng Hua,et al.  Predicting corporate financial distress based on integration of support vector machine and logistic regression , 2007, Expert Syst. Appl..

[52]  D. Schingoethe Livestock Feeds and Feeding, 5th edition. , 2002 .

[53]  M. Dhanoa,et al.  Prediction of the voluntary intake of grass silages by beef cattle. 2. Principal component and ridge regression analyses , 1990 .

[54]  RAKESH K. Sharma,et al.  Soil Quality , 2019, Encyclopedia of the UN Sustainable Development Goals.

[55]  G. Bélanger,et al.  Near-infrared reflectance spectroscopy prediction of neutral detergent-soluble carbohydrates in timothy and alfalfa. , 2009, Journal of dairy science.

[56]  Mary E. Martin,et al.  HIGH SPECTRAL RESOLUTION REMOTE SENSING OF FOREST CANOPY LIGNIN, NITROGEN, AND ECOSYSTEM PROCESSES , 1997 .

[57]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[58]  G. W. Burton Progress and Benefits to Humanity from Breeding Warm-Season Forage Grasses , 1989 .

[59]  Paul Geladi,et al.  An example of 2-block predictive partial least-squares regression with simulated data , 1986 .

[60]  S. Wold,et al.  The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses , 1984 .

[61]  Multispectral Image Analysis for Phosphorus Measurement in Bahia Grass , 2005 .

[62]  S. R. Wilkinson,et al.  Bermudagrass Management in the Southern Piedmont USA: I. Soil and Surface Residue Carbon and Sulfur , 2001 .