Predicting daily milk yield for primiparous cows using data of within-herd relatives to capture genotype-by-environment interactions.

This study develops and illustrates a hybrid k-medoids, random forest, and support vector regression (K-R-S) approach for predicting the lactation curves of individual primiparous cows within a targeted environment using monthly milk production data from their dams and paternal siblings. The model simulation and evaluation were based on historical test-day (TD) milk production data from 2010 to 2016 for 260 Wisconsin dairy farms. Data from older paternal siblings and dams were used to create family units (n = 6,400) of individual calves, from which their future performance was predicted. Test-day milk yield (MY) records from 2010 to 2014 were used for model training, whereas monthly milk production records of Holstein calves born in 2014 were used for model evaluation. The K-R-S hybrid approach was used to generate MY predictions for 5 randomly selected batches of 320 primiparous cows, which were used to evaluate model performance at the individual cow level by cross-validation. Across all 5 batches, the mean absolute error and the root mean square error of the K-R-S predictions were lower (by 24.2 and 23.4%, respectively) than that of the mean daily MY of paternal siblings. The K-R-S predictions of TD MY were closer to actual values 74.2 ± 2.0% of the time, as compared with means of paternal siblings'. The correlation between actual TD MY and K-R-S predictions was greater (0.34 ± 0.01) than the correlation between the actual yield and the mean of paternal siblings (0.08 ± 0.01). The results of this study demonstrate the effectiveness of the K-R-S hybrid approach for predicting future first-lactation MY of dairy calves in management applications, such as milk production forecasting or decision-support simulation, using only monthly TD yields of within-herd relatives and in the absence of detailed genomic data.

[1]  Emmanuel Frénod,et al.  Comparison of forecast models of production of dairy cows combining animal and diet parameters , 2020, Comput. Electron. Agric..

[2]  S. Oosting,et al.  Accuracy of estimates of milk production per lactation from limited test-day and recall data collected at smallholder dairy farms , 2020 .

[3]  K. Weigel,et al.  Inclusion of herdmate data improves genomic prediction for milk-production and feed-efficiency traits within North American dairy herds. , 2019, Journal of dairy science.

[4]  R. Veerkamp,et al.  Predicting survival in dairy cattle by combining genomic breeding values and phenotypic information. , 2019, Journal of dairy science.

[5]  Paulo César de Resende Andrade,et al.  Predicting first test day milk yield of dairy heifers , 2019, Comput. Electron. Agric..

[6]  M. Endres,et al.  Milk yield and milking station visits of primiparous versus multiparous cows on automatic milking system farms in the Upper Midwest United States. , 2019, Journal of dairy science.

[7]  L. Telo da Gama,et al.  Heterosis in the lactation curves of Girolando cows with emphasis on variations of the individual curves , 2019, Journal of Applied Animal Research.

[8]  D. Jensen,et al.  Dynamic forecasting of individual cow milk yield in automatic milking systems. , 2018, Journal of dairy science.

[9]  K A Weigel,et al.  A 100-Year Review: Methods and impact of genetic selection in dairy cattle-From daughter-dam comparisons to deep learning algorithms. , 2017, Journal of dairy science.

[10]  K. Weigel,et al.  Use of genotype × environment interaction model to accommodate genetic heterogeneity for residual feed intake, dry matter intake, net energy in milk, and metabolic body weight in dairy cattle. , 2017, Journal of dairy science.

[11]  G. de los Campos,et al.  Genotype by environment (climate) interaction improves genomic prediction for production traits in US Holstein cattle. , 2017, Journal of dairy science.

[12]  Fan Zhang,et al.  An automatic model configuration and optimization system for milk production forecasting , 2016, Comput. Electron. Agric..

[13]  D. Döpfer,et al.  Exploring relationships between Dairy Herd Improvement monitors of performance and the Transition Cow Index in Wisconsin dairy herds. , 2016, Journal of dairy science.

[14]  H. Mulder,et al.  Genotype by environment interaction for production, somatic cell score, workability, and conformation traits in Dutch Holstein-Friesian cows between farms with or without grazing. , 2016, Journal of dairy science.

[15]  M. J. O'Mahony,et al.  Comparison of modelling techniques for milk-production forecasting. , 2014, Journal of dairy science.

[16]  M. Mourits,et al.  First-calving age and first-lactation milk production on Dutch dairy farms , 2012, Journal of Dairy Science.

[17]  Daniel Gianola,et al.  Application of support vector regression to genome-assisted prediction of quantitative traits , 2011, Theoretical and Applied Genetics.

[18]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[19]  T. Velmurugan,et al.  Computational Complexity between K-Means and K-Medoids Clustering Algorithms for Normal and Uniform Distributions of Data Points , 2010 .

[20]  Gavin C. Cawley,et al.  On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[21]  S. Brotherstone,et al.  Genotype by environment interaction for first-lactation female fertility traits in UK dairy cattle. , 2009, Journal of dairy science.

[22]  S. Brotherstone,et al.  Prenatal maternal effects on body condition score, female fertility, and milk yield of dairy cows. , 2007, Journal of dairy science.

[23]  R. K. Sharma,et al.  Prediction of first lactation 305-day milk yield in Karan Fries dairy cattle using ANN modeling , 2007, Appl. Soft Comput..

[24]  T. Strabel,et al.  Genetic analysis of milk production traits of polish black and white cattle using large-scale random regression test-day models. , 2006, Journal of dairy science.

[25]  P. VanRaden,et al.  Effectiveness of national and regional sire evaluations in predicting future-daughter milk yield. , 2005, Journal of dairy science.

[26]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[27]  N. Gengler,et al.  Prediction of daily milk, fat, and protein production by a random regression test-day model. , 2004, Journal of dairy science.

[28]  D. Vicario,et al.  A multivariate approach to modeling shapes of individual lactation curves in cattle. , 2004, Journal of dairy science.

[29]  M. Goddard,et al.  Genotype x environment interaction for milk production of daughters of Australian dairy sires from test-day records. , 2003, Journal of dairy science.

[30]  D. Boichard,et al.  Modeling lactation curves and estimation of genetic parameters for first lactation test-day records of French Holstein cows. , 2003, Journal of dairy science.

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  William G. Hill,et al.  ESTIMATING VARIANCE COMPONENTS FOR TEST DAY MILK RECORDS BY RESTRICTED MAXIMUM LIKELIHOOD WITH A RANDOM REGRESSION ANIMAL MODEL , 1999 .

[33]  J. Jamrozik,et al.  Estimates of genetic parameters for a test day model with random regressions for yield traits of first lactation Holsteins. , 1997, Journal of dairy science.

[34]  K. Beauchemin,et al.  Compressed baled alfalfa hay for primiparous and multiparous dairy cows. , 1994, Journal of dairy science.

[35]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[36]  W. E. Vinson,et al.  Prediction of Daughter's Performance from Dam's Cow Index Adjusted for Within-Herd Variance , 1988 .

[37]  J.B.M. Wilmink,et al.  Adjustment of lactation yield for age at calving in relation to level of production , 1987 .

[38]  P. D. P. WOOD,et al.  Algebraic Model of the Lactation Curve in Cattle , 1967, Nature.

[39]  P. Boettcher,et al.  Genotype x environment interactions in conventional versus pasture-based dairies in Canada. , 2003, Journal of dairy science.