Dissecting Genome-Wide Association Signals for Loss-of-Function Phenotypes in Sorghum Flavonoid Pigmentation Traits

Genome-wide association studies are a powerful method to dissect the genetic basis of traits, although in practice the effects of complex genetic architecture and population structure remain poorly understood. To compare mapping strategies we dissected the genetic control of flavonoid pigmentation traits in the cereal grass sorghum by using high-resolution genotyping-by-sequencing single-nucleotide polymorphism markers. Studying the grain tannin trait, we find that general linear models (GLMs) are not able to precisely map tan1-a, a known loss-of-function allele of the Tannin1 gene, with either a small panel (n = 142) or large association panel (n = 336), and that indirect associations limit the mapping of the Tannin1 locus to Mb-resolution. A GLM that accounts for population structure (Q) or standard mixed linear model that accounts for kinship (K) can identify tan1-a, whereas a compressed mixed linear model performs worse than the naive GLM. Interestingly, a simple loss-of-function genome scan, for genotype-phenotype covariation only in the putative loss-of-function allele, is able to precisely identify the Tannin1 gene without considering relatedness. We also find that the tan1-a allele can be mapped with gene resolution in a biparental recombinant inbred line family (n = 263) using genotyping-by-sequencing markers but lower precision in the mapping of vegetative pigmentation traits suggest that consistent gene-level resolution will likely require larger families or multiple recombinant inbred lines. These findings highlight that complex association signals can emerge from even the simplest traits given epistasis and structured alleles, but that gene-resolution mapping of these traits is possible with high marker density and appropriate models.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  S. Bean,et al.  Variability in tannin content, chemistry and activity in a diverse group of tannin containing sorghum cultivars. , 2013, Journal of the science of food and agriculture.

[3]  C. T. Hash,et al.  Population genomic and genome-wide association studies of agroclimatic traits in sorghum , 2012, Proceedings of the National Academy of Sciences.

[4]  É. A. Moraes,et al.  Sorghum genotype may reduce low-grade inflammatory response and oxidative stress and maintains jejunum morphology of rats fed a hyperlipidic diet , 2012 .

[5]  Eleazar Eskin,et al.  Genome-wide association studies in mice , 2012, Nature Reviews Genetics.

[6]  Meng Li,et al.  Genetics and population analysis Advance Access publication July 13, 2012 , 2012 .

[7]  Xianran Li,et al.  Presence of tannins in sorghum grains is conditioned by different natural alleles of Tannin1 , 2012, Proceedings of the National Academy of Sciences.

[8]  Cheng-Ting Yeh,et al.  Parallel domestication of the Shattering1 genes in cereals , 2012, Nature Genetics.

[9]  Bjarni J. Vilhjálmsson,et al.  An efficient multi-locus mixed model approach for genome-wide association studies in structured populations , 2012, Nature Genetics.

[10]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[11]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools , 2011, Nucleic Acids Res..

[12]  David M. Goodstein,et al.  Phytozome: a comparative platform for green plant genomics , 2011, Nucleic Acids Res..

[13]  Qian Qian,et al.  Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm , 2011, Nature Genetics.

[14]  Yurii S. Aulchenko,et al.  Detecting Low Frequent Loss-of-Function Alleles in Genome Wide Association Studies with Red Hair Color as Example , 2011, PloS one.

[15]  Justin O Borevitz,et al.  Genome-wide association studies in plants: the missing heritability is in the field , 2011, Genome Biology.

[16]  Diana V. Dugas,et al.  Coincident light and clock regulation of pseudoresponse regulator protein 37 (PRR37) controls photoperiodic flowering in sorghum , 2011, Proceedings of the National Academy of Sciences.

[17]  Mark H. Wright,et al.  Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa , 2011, Nature communications.

[18]  C. Tonelli,et al.  Recent advances on the regulation of anthocyanin synthesis in reproductive organs. , 2011, Plant science : an international journal of experimental plant biology.

[19]  D. Jordan,et al.  Exploring and Exploiting Genetic Variation from Unadapted Sorghum Germplasm in a Breeding Program , 2011 .

[20]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[21]  James C. Schnable,et al.  Genes Identified by Visible Mutant Phenotypes Show Increased Bias toward One of Two Subgenomes of Maize , 2011, PloS one.

[22]  David B. Goldstein,et al.  The Importance of Synthetic Associations Will Only Be Resolved Empirically , 2011, PLoS biology.

[23]  Joy Bergelson,et al.  Towards identifying genes underlying ecologically relevant traits in Arabidopsis thaliana , 2010, Nature Reviews Genetics.

[24]  D. Balding,et al.  Genome-wide association mapping to candidate polymorphism resolution in the unsequenced barley genome , 2010, Proceedings of the National Academy of Sciences.

[25]  M. Nordborg,et al.  Conditions Under Which Genome-Wide Association Studies Will be Positively Misleading , 2010, Genetics.

[26]  Qifa Zhang,et al.  Genome-wide association studies of 14 agronomic traits in rice landraces , 2010, Nature Genetics.

[27]  E. Grotewold,et al.  ZmMYB31 directly represses maize lignin genes and redirects the phenylpropanoid metabolic flux. , 2010, The Plant journal : for cell and molecular biology.

[28]  K. Olsen,et al.  Genetic perspectives on crop domestication. , 2010, Trends in plant science.

[29]  E. Zeggini,et al.  Synthetic associations in the context of genome-wide association scan signals , 2010, Human molecular genetics.

[30]  D. Jordan,et al.  Location of major effect genes in sorghum (Sorghum bicolor (L.) Moench) , 2010, Theoretical and Applied Genetics.

[31]  Zachary A. Szpiech,et al.  Genome-wide association studies in diverse populations , 2010, Nature Reviews Genetics.

[32]  R. Dixon,et al.  The Mysteries of Proanthocyanidin Transport and Polymerization1 , 2010, Plant Physiology.

[33]  Iffa Gaffoor,et al.  Flavonoid Phytoalexin-Dependent Resistance to Anthracnose Leaf Blight Requires a Functional yellow seed1 in Sorghum bicolor , 2010, Genetics.

[34]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[35]  David B. Goldstein,et al.  Rare Variants Create Synthetic Genome-Wide Associations , 2010, PLoS biology.

[36]  Bjarni J. Vilhjálmsson,et al.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines , 2010 .

[37]  M. T. Vinayan Genetic Architecture of Spotted Stem Borer Resistance in Sorghum As Inferred From Qtl Mapping and Synteny with the Maize Genome , 2010 .

[38]  Qian Qian,et al.  Allelic diversities in rice starch biosynthesis lead to a diverse array of rice eating and cooking qualities , 2009, Proceedings of the National Academy of Sciences.

[39]  Chengsong Zhu,et al.  Nonmetric Multidimensional Scaling Corrects for Population Structure in Association Mapping With Different Sample Types , 2009, Genetics.

[40]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[41]  William L. Rooney,et al.  Community Resources and Strategies for Association Mapping in Sorghum , 2008 .

[42]  Edward S. Buckler,et al.  TASSEL: software for association mapping of complex traits in diverse samples , 2007, Bioinform..

[43]  S. Iida,et al.  The Rc and Rd genes are involved in proanthocyanidin synthesis in rice pericarp. , 2006, The Plant journal : for cell and molecular biology.

[44]  D. McNeil,et al.  Genetics and Cytogenetics , 2007 .

[45]  S. Tingey,et al.  Whole genome scan detects an allelic variant of fad2 associated with increased oleic acid levels in maize , 2007, Molecular Genetics and Genomics.

[46]  P. Smouse,et al.  genalex 6: genetic analysis in Excel. Population genetic software for teaching and research , 2006 .

[47]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[48]  Herman Nilsson-Ehle,et al.  Kreuzungsuntersuchungen an Hafer und Weizen , 2005, Zeitschrift für Induktive Abstammungs- und Vererbungslehre.

[49]  G. Beecher,et al.  Concentrations of proanthocyanidins in common foods and estimations of normal consumption. , 2004, The Journal of nutrition.

[50]  S. Kitamura,et al.  TRANSPARENT TESTA 19 is involved in the accumulation of both anthocyanins and proanthocyanidins in Arabidopsis. , 2004, The Plant journal : for cell and molecular biology.

[51]  B. Winkel-Shirley,et al.  Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. , 2001, Plant physiology.

[52]  L. Lepiniec,et al.  The TT8 Gene Encodes a Basic Helix-Loop-Helix Domain Protein Required for Expression of DFR and BAN Genes in Arabidopsis Siliques , 2000, Plant Cell.

[53]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[54]  C. W. Smith,et al.  Sorghum: origin, history, technology and production. , 2000 .

[55]  P. Dufour,et al.  Quantitative trait loci for grain quality, productivity, morphological and agronomical traits in sorghum (Sorghum bicolor L. Moench) , 1998, Theoretical and Applied Genetics.

[56]  F. Miller,et al.  The association of genes controlling caryopsis traits with grain mold resistance in sorghum. , 1993 .

[57]  C. Darwin On the Origin of Species by Means of Natural Selection: Or, The Preservation of Favoured Races in the Struggle for Life , 2019 .

[58]  B. Mcclintock The origin and behavior of mutable loci in maize , 1950, Proceedings of the National Academy of Sciences.

[59]  J. Stephens A Second Factor for Subcoat in Sorghum Seed 1 , 1946 .

[60]  K. Sax,et al.  The Association of Size Differences with Seed-Coat Pattern and Pigmentation in PHASEOLUS VULGARIS. , 1923, Genetics.

[61]  H. N. Vinall,et al.  IMPROVEMENT OF SORGHUMS BY HYBRIDIZATION , 1921 .

[62]  A. Bennett The Origin of Species by means of Natural Selection; or the Preservation of Favoured Races in the Struggle for Life , 1872, Nature.

[63]  C. Darwin The Origin of Species by Means of Natural Selection, Or, The Preservation of Favoured Races in the Struggle for Life , 2019 .