Predicting Growth and Carcass Traits in Swine Using Microbiome Data and Machine Learning Algorithms

In this paper, we evaluated the power of microbiome measures taken at three time points over the growth test period (weaning, 15 and 22 weeks) to foretell growth and carcass traits in 1039 individuals of a line of crossbred pigs. We measured prediction accuracy as the correlation between actual and predicted phenotypes in a five-fold cross-validation setting. Phenotypic traits measured included live weight measures and carcass composition obtained during the trial as well as at slaughter. We employed a null model excluding microbiome information as a baseline to assess the increase in prediction accuracy stemming from the inclusion of operational taxonomic units (OTU) as predictors. We further contrasted performance of models from the Bayesian alphabet (Bayesian Lasso) as well machine learning approaches (Random Forest and Gradient Boosting) and semi-parametric kernel models (Reproducing Kernel Hilbert space). In most cases, prediction accuracy increased significantly with the inclusion of microbiome data. Accuracy was more substantial with the inclusion of microbiome information taken at weeks 15 and 22, with values ranging from approximately 0.30 for loin traits to more than 0.50 for back fat. Conversely, microbiome composition at weaning resulted in most cases in marginal gains of prediction accuracy, suggesting that later measures might be more useful to include in predictive models. Model choice affected predictions marginally with no clear winner for any model/trait/time point. We, therefore, suggest average prediction across models as a robust strategy in fitting microbiome information. In conclusion, microbiome composition can effectively be used as a predictor of growth and composition traits, particularly for fatness traits. The inclusion of OTU predictors could potentially be used to promote fast growth of individuals while limiting fat accumulation. Early microbiome measures might not be good predictors of growth and OTU information might be best collected at later life stages. Future research should focus on the inclusion of both microbiome as well as host genome information in predictions, as well as the interaction between the two. Furthermore, the influence of the microbiome on feed efficiency as well as carcass and meat quality should be investigated.

[1]  O. González-Recio,et al.  Genome-wide prediction of discrete traits using bayesian regressions and machine learning , 2011, Genetics Selection Evolution.

[2]  Lusheng Huang,et al.  Evaluating the Contribution of Gut Microbiota to the Variation of Porcine Fatness with the Cecum and Fecal Samples , 2016, Front. Microbiol..

[3]  C. Maltecca,et al.  Feed intake, average daily gain, feed efficiency, and real-time ultrasound traits in Duroc pigs: II. Genomewide association. , 2014, Journal of animal science.

[4]  D. Bessesen,et al.  Human gut microbes associated with obesity , 2007 .

[5]  Leo Breiman,et al.  Big Random Forests: Classification and Regression Forests forLarge Data Sets , 2014 .

[6]  G. de los Campos,et al.  Genotype by environment (climate) interaction improves genomic prediction for production traits in US Holstein cattle. , 2017, Journal of dairy science.

[7]  T. Tribout,et al.  Optimized management of genetic variability in selected pig populations. , 2008, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[8]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[9]  Q. Zebeli,et al.  Adaptation of the Cecal Bacterial Microbiome of Growing Pigs in Response to Resistant Starch Type 4 , 2015, Applied and Environmental Microbiology.

[10]  Dominik Endres,et al.  A new metric for probability distributions , 2003, IEEE Transactions on Information Theory.

[11]  Lusheng Huang,et al.  16S rRNA gene-based association study identified microbial taxa associated with pork intramuscular fat content in feces and cecum lumen , 2017, BMC Microbiology.

[12]  J Crossa,et al.  Genomic prediction in CIMMYT maize and wheat breeding programs , 2013, Heredity.

[13]  C. Baes,et al.  Invited review: Inbreeding in the genomics era: Inbreeding, inbreeding depression, and management of genomic variability. , 2017, Journal of dairy science.

[14]  Daniel Gianola,et al.  Machine learning methods and predictive ability metrics for genome-wide prediction of complex traits $ , 2014 .

[15]  C. Maltecca,et al.  Predicting Growth and Carcass Traits in Swine Using Metagenomic Data and Machine Learning Algorithms , 2018, bioRxiv.

[16]  D. Gianola,et al.  Reproducing Kernel Hilbert Spaces Regression Methods for Genomic Assisted Prediction of Quantitative Traits , 2008, Genetics.

[17]  G. Hartman,et al.  Metagenome-Wide Association Study and Machine Learning Prediction of Bulk Soil Microbiome and Crop Productivity , 2017, Front. Microbiol..

[18]  Tad S Sonstegard,et al.  Genome-wide association analysis of thirty one production, health, reproduction and body conformation traits in contemporary U.S. Holstein cows , 2011, BMC Genomics.

[19]  V Kishore Ayyadevara,et al.  Gradient Boosting Machine , 2018 .

[20]  M. Pop,et al.  Metagenomic Analysis of the Human Distal Gut Microbiome , 2006, Science.

[21]  Jo Handelsman,et al.  Toward a Census of Bacteria in Soil , 2006, PLoS Comput. Biol..

[22]  Marco Lopez-Cruz,et al.  Increased Prediction Accuracy in Wheat Breeding Trials Using a Marker × Environment Interaction Genomic Selection Model , 2015, G3: Genes, Genomes, Genetics.

[23]  E. Zoetendal,et al.  Age, introduction of solid feed and weaning are more important determinants of gut bacterial succession in piglets than breed and nursing mother as revealed by a reciprocal cross-fostering model. , 2016, Environmental microbiology.

[24]  Edoardo Pasolli,et al.  Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights , 2016, PLoS Comput. Biol..

[25]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[26]  Greg Ridgeway,et al.  Generalized Boosted Models: A guide to the gbm package , 2006 .

[27]  José Crossa,et al.  Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree , 2009, Genetics.

[28]  José Crossa,et al.  Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods. , 2010, Genetics research.

[29]  G. de los Campos,et al.  Genome-Wide Regression and Prediction with the BGLR Statistical Package , 2014, Genetics.

[30]  Donagh Berry,et al.  Exploring a Possible Link between the Intestinal Microbiota and Feed Efficiency in Pigs , 2017, Applied and Environmental Microbiology.

[31]  Daniel Gianola,et al.  Kernel-based whole-genome prediction of complex traits: a review , 2014, Front. Genet..

[32]  Jonathan R. Brestoff,et al.  Commensal bacteria at the interface of host metabolism and the immune system , 2013, Nature Immunology.

[33]  C. Maltecca,et al.  The relationship between different measures of feed efficiency and feeding behavior traits in Duroc pigs. , 2017, Journal of animal science.

[34]  Lusheng Huang,et al.  Unraveling the Fecal Microbiota and Metagenomic Functional Capacity Associated with Feed Efficiency in Pigs , 2017, Front. Microbiol..

[35]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[36]  James Versalovic,et al.  Human microbiome in health and disease. , 2012, Annual review of pathology.

[37]  Steven Salzberg,et al.  BIOINFORMATICS ORIGINAL PAPER , 2004 .

[38]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[39]  J. Clemente,et al.  Gut Microbiota from Twins Discordant for Obesity Modulate Metabolism in Mice , 2013, Science.

[40]  J. Clemente,et al.  The Long-Term Stability of the Human Gut Microbiota , 2013 .

[41]  R. Wellmann,et al.  Host Genome Influence on Gut Microbial Composition and Microbial Prediction of Complex Traits in Pigs , 2017, Genetics.

[42]  Keiichi Suzuki,et al.  Genetic parameters for measures of residual feed intake and growth traits in seven generations of Duroc pigs , 2009 .

[43]  Gota Morota,et al.  BIG DATA ANALYTICS AND PRECISION ANIMAL AGRICULTURE SYMPOSIUM: Machine learning and data mining advance predictive big data analysis in precision animal agriculture1 , 2018, Journal of animal science.

[44]  C. Maltecca,et al.  Feed intake, average daily gain, feed efficiency, and real-time ultrasound traits in Duroc pigs: I. Genetic parameter estimation and accuracy of genomic prediction. , 2014, Journal of animal science.

[45]  C. Maltecca,et al.  Contribution of Microbiome to Variation in Fat and Growth Traits in Crossbred Pigs , 2018 .

[46]  Xun Xu,et al.  A reference gene catalogue of the pig gut microbiome , 2016, Nature Microbiology.

[47]  Christian Maltecca,et al.  Effectiveness of genomic prediction on milk flow traits in dairy cattle , 2012, Genetics Selection Evolution.

[48]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[49]  Daniel Gianola,et al.  Additive Genetic Variability and the Bayesian Alphabet , 2009, Genetics.

[50]  Christian Maltecca,et al.  Accounting for trait architecture in genomic predictions of US Holstein cattle using a weighted realized relationship matrix , 2015, Genetics Selection Evolution.

[51]  C. F. Hansen,et al.  Nutrition and pathology of weaner pigs: Nutritional strategies to support barrier function in the gastrointestinal tract , 2012 .

[52]  Genome-wide association study on legendre random regression coefficients for the growth and feed intake trajectory on Duroc Boars , 2015, BMC Genetics.

[53]  C. Maltecca,et al.  The use of multiple imputation for the accurate measurements of individual feed intake by electronic feeders. , 2016, Journal of animal science.

[54]  D Gianola,et al.  Reproducing kernel Hilbert spaces regression: a general framework for genetic evaluation. , 2009, Journal of animal science.

[55]  Wangjun Wu,et al.  Dynamic Distribution of the Gut Microbiota and the Relationship with Apparent Crude Fiber Digestibility and Growth Stages in Pigs , 2015, Scientific Reports.

[56]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[57]  M. Rajput,et al.  Weaning stress and gastrointestinal barrier development: Implications for lifelong gut health in pigs , 2017, Animal nutrition.

[58]  G. Fitzgerald,et al.  'I. , 2019, Australian journal of primary health.

[59]  G. Morota,et al.  BIG DATA ANALYTICS AND PRECISION ANIMAL AGRICULTURE SYMPOSIUM: Machine learning and data mining advance predictive big data analysis in precision animal agriculture , 2018, Journal of animal science.

[60]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[61]  이기수,et al.  II. , 1992 .

[62]  Keiichi Suzuki,et al.  Genetic parameters for production traits and measures of residual feed intake in Duroc and Landrace pigs , 2008 .

[63]  J. Estellé,et al.  Phylogenetic network analysis applied to pig gut microbiota identifies an ecosystem structure linked with growth traits , 2016, The ISME Journal.

[64]  Robert A. Edwards,et al.  Quality control and preprocessing of metagenomic datasets , 2011, Bioinform..

[65]  C. Maltecca,et al.  Host contributes to longitudinal diversity of fecal microbiota in swine selected for lean growth , 2018, Microbiome.

[66]  R. C. Macridis A review , 1963 .

[67]  P. Turnbaugh,et al.  Microbial ecology: Human gut microbes associated with obesity , 2006, Nature.

[68]  C. Nyachoti,et al.  Husbandry practices and gut health outcomes in weaned piglets: A review , 2017, Animal nutrition.

[69]  C. Maltecca,et al.  Metagenomic Predictions of Growth and Carcass Traits in Pigs with the Use of Bayesian Alphabet and Machine Learning Methods. , 2018 .