Applications of Graphical Models in Quantitative Genetics and Genomics

In this chapter, we provide a brief introduction about graphical models, with an emphasis on Bayesian networks, and discuss some of their applications in genetics and genomics studies with agricultural and livestock species. First, some key definitions regarding stochastic graphical models are provided, as well as basic principles of inference related to graphical structure and model parameters. Next is a discussion of some examples of applications, which include prediction of complex traits using genomic information or other correlated traits as well as the investigation of the flow of information from DNA polymorphisms to endpoint phenotypes, including intermediate phenotypes such as gene expression. A first example with prediction refers to the forecasting of total egg production in quails using early expressed traits (such as weekly body weight, partial egg production, and egg quality traits) as explanatory variables to support decision making (e.g., earlier culling decisions) in production/breeding systems. An additional example uses genomic information for the estimation of genetic merit of selection candidates for genetic improvement of economically important traits. An example with causal inference deals with the network underlying carcass fat deposition and muscularity in pigs by jointly modeling phenotypic, genotypic, and transcriptomic data. Some additional applications of Bayesian networks and other graphical model techniques are highlighted as well, including multitrait quantitative trait loci (QTL) analysis and structural equation models with latent variables. It is shown that graphical models such as Bayesian networks offer a powerful and insightful approach both for prediction and for causal inference, with a myriad of applications in the areas of genetics and genomics, and the study of complex phenotypic traits in agriculture.

[1]  David V Conti,et al.  Commentary: the concept of 'Mendelian Randomization'. , 2004, International journal of epidemiology.

[2]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[3]  R. L. Quaas,et al.  Multiple Trait Evaluation Using Relatives' Records , 1976 .

[4]  K. Weigel,et al.  The Causal Meaning of Genomic Predictors and How It Affects Construction and Comparison of Genome-Enabled Selection Models , 2013, Genetics.

[5]  Robin Thompson,et al.  Analysis of Litter Size and Average Litter Weight in Pigs Using a Recursive Model , 2007, Genetics.

[6]  D Gianola,et al.  Inferring relationships between somatic cell score and milk yield using simultaneous and recursive models. , 2007, Journal of dairy science.

[7]  Daniel Gianola,et al.  Quantitative Genetic Models for Describing Simultaneous and Recursive Relationships Between Phenotypes This article is dedicated to Arthur B. Chapman, teacher and mentor of numerous animal breeding students and disciple and friend of Sewall Wright. , 2004, Genetics.

[8]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[9]  Xiao-Lin Wu,et al.  Inferring causal phenotype networks using structural equation models , 2011, Genetics Selection Evolution.

[10]  Christine Sinoquet,et al.  Probabilistic graphical models for genetics, genomics and postgenomics , 2014 .

[11]  Xiao-Lin Wu,et al.  Modeling relationships between calving traits: a comparison between standard and recursive mixed models , 2010, Genetics Selection Evolution.

[12]  Y. Liu,et al.  A transcript profiling approach reveals the zinc finger transcription factor ZNF191 is a pleiotropic factor , 2009, BMC Genomics.

[13]  T. Silander,et al.  Association analyses of the MAS-QTL data set using grammar, principal components and Bayesian network methodologies , 2011, BMC proceedings.

[14]  Guilherme J M Rosa,et al.  Searching for Recursive Causal Structures in Multivariate Quantitative Genetics Mixed Models , 2010, Genetics.

[15]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[16]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[17]  Paola Sebastiani,et al.  Genetic dissection and prognostic modeling of overt stroke in sickle cell anemia , 2005, Nature Genetics.

[18]  W. Kruijer,et al.  Genotype-phenotype modeling considering intermediate level of biological variation: a case study involving sensory traits, metabolites and QTLs in ripe tomatoes. , 2015, Molecular bioSystems.

[19]  B. Shipley Cause and correlation in biology , 2000 .

[20]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[21]  G. Rosa,et al.  Quantitative trait loci mapping in an F2 Duroc x Pietrain resource population: I. Growth traits. , 2008, Journal of animal science.

[22]  Daniel Gianola,et al.  Additive Genetic Variability and the Bayesian Alphabet , 2009, Genetics.

[23]  B. J. Hayes,et al.  Genomic selection: Genomic selection , 2007 .

[24]  D Gianola,et al.  Predictive ability of subsets of single nucleotide polymorphisms with and without parent average in US Holsteins. , 2010, Journal of dairy science.

[25]  J. Steibel,et al.  Searching for causal networks involving latent variables in complex traits: Application to growth, carcass, and meat quality traits in pigs. , 2015, Journal of animal science.

[26]  D Gianola,et al.  An assessment of linkage disequilibrium in Holstein cattle using a Bayesian network. , 2012, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[27]  L R Schaeffer,et al.  Relationships between milk yield and somatic cell score in Canadian Holsteins from simultaneous and recursive random regression models. , 2010, Journal of dairy science.

[28]  Keith Shockley,et al.  Structural Model Analysis of Multiple Quantitative Traits , 2006, PLoS genetics.

[29]  B. Yandell,et al.  CAUSAL GRAPHICAL MODELS IN SYSTEMS GENETICS: A UNIFIED FRAMEWORK FOR JOINT INFERENCE OF CAUSAL NETWORK AND GENETIC ARCHITECTURE FOR CORRELATED PHENOTYPES. , 2010, The annals of applied statistics.

[30]  Radhakrishnan Nagarajan,et al.  Bayesian Networks in R , 2013 .

[31]  Radhakrishnan Nagarajan,et al.  Bayesian Networks in R: with Applications in Systems Biology , 2013 .

[32]  D. Gianola,et al.  Exploration of relationships between claw disorders and milk yield in Holstein cows via recursive linear and threshold models. , 2008, Journal of dairy science.

[33]  Kent A Weigel,et al.  Comparison of classification methods for detecting associations between SNPs and chick mortality , 2009, Genetics Selection Evolution.

[34]  G. Rosa,et al.  Breeding and Genetics Symposium: inferring causal effects from observational data in livestock. , 2013, Journal of animal science.

[35]  B. Valente,et al.  Exploring causal networks of bovine milk fatty acids in a multivariate mixed model context , 2014, Genetics Selection Evolution.

[36]  D. Gianola,et al.  Exploration of lagged relationships between mastitis and milk yield in dairycows using a Bayesian structural equation Gaussian-threshold model , 2008, Genetics Selection Evolution.

[37]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[38]  K. Weigel,et al.  Exploring Biological Relationships Between Calving Traits in Primiparous Cattle with a Bayesian Recursive Model , 2009, Genetics.

[39]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[40]  Juan P. Steibel,et al.  Exploring causal networks underlying fat deposition and muscularity in pigs through the integration of phenotypic, genotypic and transcriptomic data , 2015, BMC Systems Biology.

[41]  Guilherme J M Rosa,et al.  Using multiple regression, Bayesian networks and artificial neural networks for prediction of total egg production in European quails based on earlier expressed phenotypes. , 2014, Poultry science.

[42]  J. Steibel,et al.  Genome-Wide Linkage Analysis of Global Gene Expression in Loin Muscle Tissue Identifies Candidate Genes in Pigs , 2011, PloS one.

[43]  D. Gianola,et al.  A structural equation model for describing relationships between somatic cell score and milk yield in dairy goats. , 2006, Journal of animal science.

[44]  B. Valente,et al.  Searching for phenotypic causal networks involving complex traits: an application to European quail , 2011, Genetics Selection Evolution.

[45]  D Gianola,et al.  Reproducing kernel Hilbert spaces regression: a general framework for genetic evaluation. , 2009, Journal of animal science.

[46]  D. Gianola,et al.  Inferring relationships between health and fertility in Norwegian Red cows using recursive models. , 2009, Journal of dairy science.

[47]  F. V. van Eeuwijk,et al.  A New Method to Infer Causal Phenotype Networks Using QTL and Phenotypic Information , 2014, PloS one.

[48]  D. Balding,et al.  Improving the efficiency of genomic selection , 2013, Statistical applications in genetics and molecular biology.

[49]  R. Bates,et al.  Quantitative trait locus mapping in an F2 Duroc x Pietrain resource population: II. Carcass and meat quality traits. , 2008, Journal of animal science.

[50]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[51]  T. Haavelmo The Statistical Implications of a System of Simultaneous Equations , 1943 .

[52]  Kenneth A. Bollen,et al.  Structural Equations with Latent Variables , 1989 .