Novel Bayesian Networks for Genomic Prediction of Developmental Traits in Biomass Sorghum

The ability to connect genetic information between traits over time allow Bayesian networks to offer a powerful probabilistic framework to construct genomic prediction models. In this study, we phenotyped a diversity panel of 869 biomass sorghum (Sorghum bicolor (L.) Moench) lines, which had been genotyped with 100,435 SNP markers, for plant height (PH) with biweekly measurements from 30 to 120 days after planting (DAP) and for end-of-season dry biomass yield (DBY) in four environments. We evaluated five genomic prediction models: Bayesian network (BN), Pleiotropic Bayesian network (PBN), Dynamic Bayesian network (DBN), multi-trait GBLUP (MTr-GBLUP), and multi-time GBLUP (MTi-GBLUP) models. In fivefold cross-validation, prediction accuracies ranged from 0.46 (PBN) to 0.49 (MTr-GBLUP) for DBY and from 0.47 (DBN, DAP120) to 0.75 (MTi-GBLUP, DAP60) for PH. Forward-chaining cross-validation further improved prediction accuracies of the DBN, MTi-GBLUP and MTr-GBLUP models for PH (training slice: 30-45 DAP) by 36.4–52.4% relative to the BN and PBN models. Coincidence indices (target: biomass, secondary: PH) and a coincidence index based on lines (PH time series) showed that the ranking of lines by PH changed minimally after 45 DAP. These results suggest a two-level indirect selection method for PH at harvest (first-level target trait) and DBY (second-level target trait) could be conducted earlier in the season based on ranking of lines by PH at 45 DAP (secondary trait). With the advance of high-throughput phenotyping technologies, our proposed two-level indirect selection framework could be valuable for enhancing genetic gain per unit of time when selecting on developmental traits.

[1]  Samuel B. Fernandes,et al.  Deleterious Mutation Burden and Its Association with Complex Traits in Sorghum (Sorghum bicolor) , 2019, Genetics.

[2]  Girish Chowdhary,et al.  In‐Field Whole‐Plant Maize Architecture Characterized by Subcanopy Rovers and Latent Space Phenotyping , 2019, The Plant Phenome Journal.

[3]  Patrick S. Schnable,et al.  Field‐based robotic phenotyping of sorghum plant architecture using stereo vision , 2018, J. Field Robotics.

[4]  Harkamal Walia,et al.  Utilizing random regression models for genomic prediction of a longitudinal trait derived from high‐throughput phenotyping , 2018, bioRxiv.

[5]  Samuel B. Fernandes,et al.  Leveraging mutational burden for complex trait prediction in sorghum , 2018, bioRxiv.

[6]  L. F. V. Ferrão,et al.  Accurate genomic prediction of Coffea canephora in multiple environments using whole-genome statistical models , 2018, Heredity.

[7]  A. Rasheed,et al.  Fast-Forwarding Genetic Gain. , 2018, Trends in plant science.

[8]  C. T. Guimarães,et al.  Improving accuracies of genomic predictions for drought tolerance in maize by joint modeling of additive and dominance effects in multi-environment trials , 2018, Heredity.

[9]  Samuel B. Fernandes,et al.  Efficiency of multi-trait, indirect, and trait-assisted genomic selection for improvement of biomass sorghum , 2017, Theoretical and Applied Genetics.

[10]  D. Akdemir,et al.  Accuracies of univariate and multivariate genomic prediction models in African cassava , 2017, bioRxiv.

[11]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[12]  A. Junker,et al.  Genetic variation of growth dynamics in maize (Zea mays L.) revealed through automated non‐invasive phenotyping , 2017, The Plant journal : for cell and molecular biology.

[13]  J. Batley,et al.  Speed breeding: a powerful tool to accelerate crop research and breeding , 2017, bioRxiv.

[14]  M. Gore,et al.  rAmpSeq: Using repetitive sequences for robust genotyping , 2016, bioRxiv.

[15]  P. Schnable,et al.  Genomic prediction contributing to a promising global strategy to turbocharge gene banks , 2016, Nature Plants.

[16]  P. Mermelstein,et al.  Opposite Effects of mGluR1a and mGluR5 Activation on Nucleus Accumbens Medium Spiny Neuron Dendritic Spine Density , 2016, PloS one.

[17]  Carolyn J. Lawrence-Dill,et al.  The Quest for Understanding Phenotypic Variation via Integrated Approaches in the Field Environment1[OPEN] , 2016, Plant Physiology.

[18]  Nadia Shakoor,et al.  A Genomic Resource for the Development, Improvement, and Exploitation of Sorghum for Bioenergy , 2016, Genetics.

[19]  Osval A. Montesinos-López,et al.  A Genomic Bayesian Multi-trait and Multi-environment Model , 2016, G3: Genes, Genomes, Genetics.

[20]  M. Balestre,et al.  Genomic selection to resistance to Stenocarpella maydis in maize lines using DArTseq markers , 2016, BMC Genetics.

[21]  Paola Sebastiani,et al.  Learning Bayesian Networks from Correlated Data , 2016, Scientific Reports.

[22]  M. Balestre,et al.  Inclusion of Dominance Effects in the Multivariate GBLUP Model , 2016, PloS one.

[23]  Brian L Browning,et al.  Genotype Imputation with Millions of Reference Samples. , 2016, American journal of human genetics.

[24]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[25]  D. Gianola,et al.  Do Molecular Markers Inform About Pleiotropy? , 2015, Genetics.

[26]  J. Klápště,et al.  A comparison of genomic selection models across time in interior spruce (Picea engelmannii × glauca) using unordered SNP imputation methods , 2015, Heredity.

[27]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[28]  N. Loman,et al.  A complete bacterial genome assembled de novo using only nanopore sequencing data , 2015, Nature Methods.

[29]  M. Sorrells,et al.  Perspectives for Genomic Selection Applications and Research in Plants , 2015 .

[30]  Ryan F. McCormick,et al.  Energy sorghum--a genetic model for the design of C4 grass bioenergy crops. , 2014, Journal of experimental botany.

[31]  M. Stephens,et al.  fastSTRUCTURE: Variational Inference of Population Structure in Large SNP Data Sets , 2014, Genetics.

[32]  Robert J. Elshire,et al.  TASSEL-GBS: A High Capacity Genotyping by Sequencing Analysis Pipeline , 2014, PloS one.

[33]  F. De Filippis,et al.  A Selected Core Microbiome Drives the Early Stages of Three Popular Italian Cheese Manufactures , 2014, PloS one.

[34]  Xia Jiang,et al.  Modeling the Altered Expression Levels of Genes on Signaling Pathways in Tumors As Causal Bayesian Networks , 2014, Cancer informatics.

[35]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[36]  O. Serang,et al.  SNP genotyping allows an in-depth characterisation of the genome of sugarcane and other complex autopolyploids , 2013, Scientific Reports.

[37]  Shizhong Xu,et al.  Genetic Mapping and Genomic Selection Using Recombination Breakpoint Data , 2013, Genetics.

[38]  Jun Li,et al.  Whole-genome sequencing reveals untapped genetic potential in Africa’s indigenous cereal crop sorghum , 2013, Nature Communications.

[39]  D. Gianola Priors in Whole-Genome Regression: The Bayesian Alphabet Returns , 2013, Genetics.

[40]  Christopher M. Bishop,et al.  Model-based machine learning , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[41]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[42]  Mark E. Borsuk,et al.  Using Bayesian networks to discover relations between genes, environment, and disease , 2013, BioData Mining.

[43]  C. T. Hash,et al.  Population genomic and genome-wide association studies of agroclimatic traits in sorghum , 2012, Proceedings of the National Academy of Sciences.

[44]  Hua Xu,et al.  Genetic studies of complex human diseases: Characterizing SNP-disease associations using Bayesian networks , 2012, BMC Systems Biology.

[45]  Wen-Hsiung Li,et al.  MicroRNA 3' end nucleotide modification patterns and arm selection preference in liver tissues , 2012, BMC Systems Biology.

[46]  Jean-Luc Jannink,et al.  Multiple-Trait Genomic Selection Methods Increase Genetic Value Prediction Accuracy , 2012, Genetics.

[47]  David Levine,et al.  A high-performance computing toolset for relatedness and principal component analysis of SNP data , 2012, Bioinform..

[48]  M. Wolak nadiv : an R package to create relatedness matrices for estimating non‐additive genetic variances in animal models , 2012 .

[49]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[50]  Katherine E. Guill,et al.  The relationship between parental genetic or phenotypic divergence and progeny variation in the maize nested association mapping population , 2011, Heredity.

[51]  R. Jewkes,et al.  Perceptions and Experiences of Research Participants on Gender-Based Violence Community Based Survey: Implications for Ethical Guidelines , 2012, PloS one.

[52]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[53]  Kanti V. Mardia,et al.  Bayesian Methods in Structural Bioinformatics , 2012 .

[54]  José Crossa,et al.  Genomic Prediction of Breeding Values when Modeling Genotype × Environment Interaction using Pedigree and Dense Molecular Markers , 2012 .

[55]  Oliver Serang,et al.  Efficient Exact Maximum a Posteriori Computation for Bayesian SNP Genotyping in Polyploids , 2012, PloS one.

[56]  Jeffrey B. Endelman,et al.  Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP , 2011 .

[57]  S. Carpenter,et al.  Solutions for a cultivated planet , 2011, Nature.

[58]  M. Calus,et al.  Accuracy of multi-trait genomic selection using different methods , 2011, Genetics Selection Evolution.

[59]  M. Blaxter,et al.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing , 2011, Nature Reviews Genetics.

[60]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[61]  J. Hamblin,et al.  Breeding Common Bean for Yield in Mixtures , 2011 .

[62]  Wilfred Vermerris,et al.  Survey of genomics approaches to improve bioenergy traits in maize, sorghum and sugarcane. , 2011, Journal of integrative plant biology.

[63]  J. Holland,et al.  Estimating and Interpreting Heritability for Plant Breeding: An Update , 2010 .

[64]  Daniel Gianola,et al.  Additive Genetic Variability and the Bayesian Alphabet , 2009, Genetics.

[65]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[66]  Virginia Walbot,et al.  Translational Genomics for Bioenergy Production from Fuelstock Grasses: Maize as the Model Species , 2007, The Plant Cell Online.

[67]  R. Bernardo,et al.  Prospects for genomewide selection for quantitative traits in maize , 2007 .

[68]  S Czajka,et al.  Analyzing Multi‐environment Variety Trials Using Randomization‐Derived Mixed Models , 2005, Biometrics.

[69]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[70]  M. Lynch,et al.  Genetics and Analysis of Quantitative Traits , 1996 .

[71]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[72]  R. L. Quaas,et al.  Multiple Trait Evaluation Using Relatives' Records , 1976 .

[73]  J. Harlan,et al.  A Simplified Classification of Cultivated Sorghum 1 , 1972 .

[74]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .