Application of machine-learning methods to milk mid-infrared spectra for discrimination of cow milk from pasture or total mixed ration diets.

The prevalence of "grass-fed" labeled food products on the market has increased in recent years, often commanding a premium price. To date, the majority of methods used for the authentication of grass-fed source products are driven by auditing and inspection of farm records. As such, the ability to verify grass-fed source claims to ensure consumer confidence will be important in the future. Mid-infrared (MIR) spectroscopy is widely used in the dairy industry as a rapid method for the routine monitoring of individual herd milk composition and quality. Further harnessing the data from individual spectra offers a promising and readily implementable strategy to authenticate the milk source at both farm and processor levels. Herein, a comprehensive comparison of the robustness, specificity, and accuracy of 11 machine-learning statistical analysis methods were tested for the discrimination of grass-fed versus non-grass-fed milks based on the MIR spectra of 4,320 milk samples collected from cows on pasture or indoor total mixed ration-based feeding systems over a 3-yr period. Linear discriminant analysis and partial least squares discriminant analysis (PLS-DA) were demonstrated to offer the greatest level of accuracy for the prediction of cow diet from MIR spectra. Parsimonious strategies for the selection of the most discriminating wavelengths within the spectra are also highlighted.

[1]  I. C. Gormley,et al.  Predicting cow milk quality traits from routinely available milk spectra using statistical machine learning methods. , 2021, Journal of dairy science.

[2]  K. Pierce,et al.  Invited review: A 2020 perspective on pasture-based dairy systems and products. , 2021, Journal of dairy science.

[3]  J. Tobin,et al.  Raman spectroscopy: A rapid method to assess the effects of pasture feeding on the nutritional quality of butter. , 2020, Journal of dairy science.

[4]  Mohammad Alothman,et al.  The “Grass-Fed” Milk Story: Understanding the Impact of Pasture Feeding on the Composition and Quality of Bovine Milk , 2019, Foods.

[5]  D. Boichard,et al.  Comparison of Bayesian and partial least squares regression methods for mid-infrared prediction of cheese-making properties in Montbéliarde cows. , 2019, Journal of dairy science.

[6]  B. Moran,et al.  A national methodology to quantify the diet of grazing dairy cows. , 2018, Journal of dairy science.

[7]  F. Miglior,et al.  Comparison of Bayesian regression models and partial least squares regression for the development of infrared prediction equations. , 2017, Journal of dairy science.

[8]  Pat Dillon,et al.  Effect of pasture versus indoor feeding systems on quality characteristics, nutritional composition, and sensory and volatile properties of full-fat Cheddar cheese. , 2017, Journal of dairy science.

[9]  L. Gruber,et al.  The use of mid-infrared spectrometry to estimate the ration composition of lactating dairy cows. , 2017, Journal of dairy science.

[10]  C. Stanton,et al.  Effect of pasture versus indoor feeding systems on raw milk composition and quality over an entire lactation. , 2016, Journal of dairy science.

[11]  A. Raftery,et al.  mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models , 2016, R J..

[12]  D. Berry,et al.  The potential of Fourier transform infrared spectroscopy of milk samples to predict energy intake and efficiency in dairy cows. , 2016, Journal of dairy science.

[13]  A. Brodkorb,et al.  Prediction of bovine milk technological traits from mid-infrared spectroscopy analysis in dairy cows. , 2015, Journal of dairy science.

[14]  Max Kuhn,et al.  caret: Classification and Regression Training , 2015 .

[15]  J. Pryce,et al.  Mid-infrared spectrometry of milk as a predictor of energy intake and efficiency in lactating dairy cows. , 2014, Journal of dairy science.

[16]  Jan LW Rademaker,et al.  Verification of fresh grass feeding, pasture grazing and organic farming by FTIR spectroscopy analysis of bovine milk , 2014 .

[17]  G. Bittante,et al.  Genetic analysis of the Fourier-transform infrared spectra of bovine milk with emphasis on individual wavelengths related to specific chemical bonds. , 2013, Journal of dairy science.

[18]  P. Dardenne,et al.  Potential use of milk mid-infrared spectra to predict individual methane emission of dairy cows. , 2012, Animal : an international journal of animal bioscience.

[19]  N. Speybroeck Classification and regression trees , 2012, International Journal of Public Health.

[20]  C. Bouveyron,et al.  Gaussian mixture models for the classification of high‐dimensional vibrational spectroscopy data , 2010 .

[21]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[22]  A. Raftery,et al.  Variable Selection and Updating In Model-Based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications. , 2009, The annals of applied statistics.

[23]  P. Dardenne,et al.  Estimating fatty acid content in cow milk using mid-infrared spectrometry. , 2006, Journal of dairy science.

[24]  H. Zou,et al.  Addendum: Regularization and variable selection via the elastic net , 2005 .

[25]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[26]  Christina Gloeckner,et al.  Modern Applied Statistics With S , 2003 .

[27]  Tong Zhang An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods , 2001, AI Mag..

[28]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[29]  G. Celeux,et al.  Regularized Gaussian Discriminant Analysis through Eigenvalue Decomposition , 1996 .

[30]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[31]  P. Garthwaite An Interpretation of Partial Least Squares , 1994 .

[32]  Wei-Chien Chang On using Principal Components before Separating a Mixture of Two Multivariate Normal Distributions , 1983 .

[33]  A. P. Dawid,et al.  Generative or Discriminative? Getting the Best of Both Worlds , 2007 .

[34]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[35]  Brian D. Ripley,et al.  Modern applied statistics with S, 4th Edition , 2002, Statistics and computing.

[36]  L. Breiman Random Forests , 2001, Machine Learning.

[37]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .