BIG DATA ANALYTICS AND PRECISION ANIMAL AGRICULTURE SYMPOSIUM: Machine learning and data mining advance predictive big data analysis in precision animal agriculture1

Abstract Precision animal agriculture is poised to rise to prominence in the livestock enterprise in the domains of management, production, welfare, sustainability, health surveillance, and environmental footprint. Considerable progress has been made in the use of tools to routinely monitor and collect information from animals and farms in a less laborious manner than before. These efforts have enabled the animal sciences to embark on information technology-driven discoveries to improve animal agriculture. However, the growing amount and complexity of data generated by fully automated, high-throughput data recording or phenotyping platforms, including digital images, sensor and sound data, unmanned systems, and information obtained from real-time noninvasive computer vision, pose challenges to the successful implementation of precision animal agriculture. The emerging fields of machine learning and data mining are expected to be instrumental in helping meet the daunting challenges facing global agriculture. Yet, their impact and potential in “big data” analysis have not been adequately appreciated in the animal science community, where this recognition has remained only fragmentary. To address such knowledge gaps, this article outlines a framework for machine learning and data mining and offers a glimpse into how they can be applied to solve pressing problems in animal sciences.

[1]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[2]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[3]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[4]  G. Wahba,et al.  A Correspondence Between Bayesian Estimation on Stochastic Processes and Smoothing by Splines , 1970 .

[5]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[6]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[7]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[8]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[9]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[10]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[11]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  S. Samarasinghe,et al.  On-line detection of mastitis in dairy herds using artificial neural networks , 2005 .

[14]  D. Kamra Rumen microbial ecosystem , 2005 .

[15]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[16]  Hugo Jair Escalante,et al.  A Comparison of Outlier Detection Algorithms for Machine Learning , 2005 .

[17]  Torsten P. Bohlin,et al.  Practical Grey-box Process Identification: Theory and Applications , 2006 .

[18]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[19]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[20]  J. Hauth Grey-Box Modelling for Nonlinear Systems , 2008 .

[21]  K. Weigel,et al.  Machine learning classification procedure for selecting SNPs in genomic selection: application to early mortality in broilers. , 2007, Developments in biologicals.

[22]  Jean-Philippe Vert,et al.  Group lasso with overlap and graph lasso , 2009, ICML '09.

[23]  J. Schmidhuber,et al.  A Novel Connectionist System for Unconstrained Handwriting Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  K. Nelson,et al.  Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases , 2009, Proceedings of the National Academy of Sciences.

[25]  Eunseog Youn,et al.  Rumen Bacterial Diversity Dynamics Associated with Changing from Bermudagrass Hay to Grazed Winter Wheat Diets , 2010, Microbial Ecology.

[26]  How to Feed the World in 2050 , 2009 .

[27]  S. Samarasinghe,et al.  Detection of mastitis and its stage of progression by automatic milking systems using artificial neural networks , 2009, Journal of Dairy Research.

[28]  渡邊 澄夫 Algebraic geometry and statistical learning theory , 2009 .

[29]  B. Roe,et al.  Rumen Microbial Population Dynamics during Adaptation to a High-Grain Diet , 2010, Applied and Environmental Microbiology.

[30]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[31]  S. Tringe,et al.  Metagenomic Discovery of Biomass-Degrading Genes and Genomes from Cow Rumen , 2011, Science.

[32]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[33]  S. De Vliegher,et al.  Invited review: Mastitis in dairy heifers: nature of the disease, potential impact, prevention, and control. , 2012, Journal of dairy science.

[34]  Florent E. Angly,et al.  Phage-bacteria relationships and CRISPR elements revealed by a metagenomic survey of the rumen microbiome. , 2012, Environmental microbiology.

[35]  J. Jensen,et al.  Screening for outliers in multiple trait genetic evaluation , 2012 .

[36]  B. Hayes,et al.  Metagenomic Predictions: From Microbiome to Complex Health and Environmental Phenotypes in Humans and Cattle , 2013, PloS one.

[37]  Daniel Gianola,et al.  Kernel-based whole-genome prediction of complex traits: a review , 2014, Front. Genet..

[38]  Jørgen Kongsro,et al.  Estimation of pig weight using a Microsoft Kinect prototype imaging system , 2014 .

[39]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[40]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[41]  Daniel Gianola,et al.  Machine learning methods and predictive ability metrics for genome-wide prediction of complex traits $ , 2014 .

[42]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[43]  Kenta Oono,et al.  Chainer : a Next-Generation Open Source Framework for Deep Learning , 2015 .

[44]  Jonathon Shlens,et al.  Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[45]  Shin Ishii,et al.  Distributional Smoothing with Virtual Adversarial Training , 2015, ICLR 2016.

[46]  R. A. Gomes,et al.  Technical note: Estimating body weight and body composition of beef cattle trough digital image analysis. , 2016, Journal of animal science.

[47]  G. Erickson,et al.  Rumen bacterial communities can be acclimated faster to high concentrate diets than currently implemented feedlot programs , 2016, Journal of applied microbiology.

[48]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[49]  B. White,et al.  Specific microbiome-dependent mechanisms underlie the energy harvest efficiency of ruminants , 2016, The ISME Journal.

[50]  Stephen P. Miller,et al.  Assessing accuracy of imputation using different SNP panel densities in a multi-breed sheep population , 2016, Genetics Selection Evolution.

[51]  Eoin L. Brodie,et al.  Toward a Predictive Understanding of Earth’s Microbiomes to Address 21st Century Challenges , 2016, mBio.

[52]  M. Pérez-Enciso Animal Breeding learning from machine learning. , 2017, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[53]  Jose A Navas-Molina,et al.  The Microbiome and Big Data. , 2017, Current opinion in systems biology.

[54]  B. Sturm,et al.  Implementation of machine vision for detecting behaviour of cattle and pigs , 2017 .

[55]  Yinglin Xia,et al.  Hypothesis testing and statistical analysis of microbiome , 2017, Genes & diseases.

[56]  John Joseph Valletta,et al.  Applications of machine learning in animal behaviour studies , 2017, Animal Behaviour.

[57]  D. Berckmans,et al.  Precision livestock farming for the global livestock sector , 2017 .

[58]  D. Donoho 50 Years of Data Science , 2017 .

[59]  Jason Williams,et al.  Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators , 2017, bioRxiv.