Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines.

The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates and stronger purifying selection on deletions. Weaker selection on insertions than deletions is consistent with our observed distribution of genome size determined by flow cytometry, which is skewed toward larger genomes. Insertion/deletion and single nucleotide polymorphisms are positively correlated with each other and with local recombination, suggesting that their nonrandom distributions are due to hitchhiking and background selection. Our cytogenetic analysis identified 16 polymorphic inversions in the DGRP. Common inverted and standard karyotypes are genetically divergent and account for most of the variation in relatedness among the DGRP lines. Intriguingly, variation in genome size and many quantitative traits are significantly associated with inversions. Approximately 50% of the DGRP lines are infected with Wolbachia, and four lines have germline insertions of Wolbachia sequences, but effects of Wolbachia infection on quantitative traits are rarely significant. The DGRP complements ongoing efforts to functionally annotate the Drosophila genome. Indeed, 15% of all D. melanogaster genes segregate for potentially damaged proteins in the DGRP, and genome-wide analyses of quantitative traits identify novel candidate genes. The DGRP lines, sequence data, genotypes, quality scores, phenotypes, and analysis and visualization tools are publicly available.

[1]  Calvin B. Bridges,et al.  SALIVARY CHROMOSOME MAPSWith a Key to the Banding of the Chromosomes of Drosophila Melanogaster , 1935 .

[2]  C. Pigott Genetics and the Origin of Species , 1959, Nature.

[3]  D. Falconer,et al.  Introduction to Quantitative Genetics. , 1962 .

[4]  W. G. Hill,et al.  The effect of linkage on limits to artificial selection. , 1966, Genetical research.

[5]  H. Stalker Chromosome studies in wild populations of D. melanogaster. , 1976, Genetics.

[6]  T. Mukai,et al.  Inversion Clines in Populations of DROSOPHILA MELANOGASTER. , 1977, Genetics.

[7]  H. Stalker Chromosome Studies in Wild Populations of DROSOPHILA MELANOGASTER. II. Relationship of Inversion Frequencies to Latitude, Season, Wing-Loading and Flight Activity. , 1980, Genetics.

[8]  M. Turelli Heritable genetic variation via mutation-selection balance: Lerch's zeta meets the abdominal bristle. , 1984, Theoretical population biology.

[9]  D. Falconer Introduction to quantitative genetics. 1. ed. , 1984 .

[10]  M. Turelli,et al.  UNIDIRECTIONAL INCOMPATIBILITY BETWEEN POPULATIONS OF DROSOPHILA SIMULANS , 1986, Evolution; international journal of organic evolution.

[11]  M. Nei Molecular Evolutionary Genetics , 1987 .

[12]  C. Aquadro,et al.  Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster , 1992, Nature.

[13]  M. G. Kidwell Lateral transfer in natural populations of eukaryotes. , 1993, Annual review of genetics.

[14]  B. Charlesworth,et al.  The effect of deleterious mutations on neutral molecular variation. , 1993, Genetics.

[15]  E. Betrán,et al.  Recombination and gene flux caused by gene conversion and crossing over in inversion heterokaryotypes. , 1997, Genetics.

[16]  A. Ruíz,et al.  Effect of inversion polymorphism on the neutral nucleotide variability of linked chromosomal regions in Drosophila. , 2000, Genetics.

[17]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[18]  P. Andolfatto,et al.  Inversion polymorphisms and nucleotide variability in Drosophila. , 2001, Genetical research.

[19]  Dmitri A Petrov,et al.  Mutational equilibrium model of genome size evolution. , 2002, Theoretical population biology.

[20]  Lior Pachter,et al.  VISTA: computational tools for comparative genomics , 2004, Nucleic Acids Res..

[21]  E. Eichler,et al.  Shotgun sequence assembly and recent segmental duplications within the human genome , 2004, Nature.

[22]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[23]  W. Stephan,et al.  Insertion/Deletion and Nucleotide Polymorphism Data Reveal Constraints in Drosophila melanogaster Introns and Intergenic Regions , 2005, Genetics.

[24]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[25]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[26]  R. Woodgate,et al.  What a difference a decade makes: Insights into translesion DNA synthesis , 2007, Proceedings of the National Academy of Sciences.

[27]  S. Richards,et al.  Widespread Lateral Gene Transfer from Intracellular Bacteria to Multicellular Eukaryotes , 2007, Science.

[28]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[29]  Loren H Rieseberg,et al.  Revisiting the Impact of Inversions in Evolution: From Population Genetic Markers to Drivers of Adaptive Shifts and Speciation? , 2008, Annual review of ecology, evolution, and systematics.

[30]  M. Ashburner,et al.  The Bacterial Symbiont Wolbachia Induces Resistance to RNA Viral Infections in Drosophila melanogaster , 2008, PLoS biology.

[31]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[32]  R. Hudson,et al.  Single-nucleotide mutation rate increases close to insertions/deletions in eukaryotes , 2008, Nature.

[33]  Pablo Librado,et al.  DnaSP v5: a software for comprehensive analysis of DNA polymorphism data , 2009, Bioinform..

[34]  T. Mackay,et al.  Epistatic interactions attenuate mutations affecting startle behaviour in Drosophila melanogaster. , 2009, Genetics research.

[35]  William J. Astle,et al.  Population Structure and Cryptic Relatedness in Genetic Association Studies , 2009, 1010.4681.

[36]  E. Stone,et al.  Systems Genetics of Complex Traits in Drosophila melanogaster , 2009, Nature Genetics.

[37]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[38]  Jonathan Flint,et al.  Genetic architecture of quantitative traits in mice, flies, and humans. , 2009, Genome research.

[39]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[40]  E. Stone,et al.  The genetics of quantitative traits: challenges and prospects , 2009, Nature Reviews Genetics.

[41]  Kai Ye,et al.  Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads , 2009, Bioinform..

[42]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[43]  J. Rougemont,et al.  Primer-initiated sequence synthesis to detect and assemble structural variants , 2010, Nature Methods.

[44]  G. Weinstock,et al.  A SNP discovery method to assess variant allele probability from next-generation resequencing data. , 2010, Genome research.

[45]  Gos Micklem,et al.  Supporting Online Material Materials and Methods Figs. S1 to S50 Tables S1 to S18 References Identification of Functional Elements and Regulatory Circuits by Drosophila Modencode , 2022 .

[46]  Guillaume J. Filion,et al.  Systematic Protein Location Mapping Reveals Five Principal Chromatin Types in Drosophila Cells , 2010, Cell.

[47]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[48]  Misko Dzamba,et al.  Detecting copy number variation with mated short reads. , 2010, Genome research.

[49]  Sebastian M. Waszak,et al.  Systematic Inference of Copy-Number Genotypes from Personal Genome Sequencing Data Reveals Extensive Olfactory Receptor Gene Content Diversity , 2010, PLoS Comput. Biol..

[50]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[51]  Wei Pan,et al.  A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants , 2010, Human Heredity.

[52]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[53]  Alan Hodgkinson,et al.  Variation in the mutation rate across mammalian genomes , 2011, Nature Reviews Genetics.

[54]  Hsien-Da Huang,et al.  Clusters of Nucleotide Substitutions and Insertion/Deletion Mutations Are Associated with Repeat Sequences , 2011, PLoS biology.

[55]  Joshua M. Korn,et al.  Discovery and genotyping of genome structural polymorphism by sequencing on a population scale , 2011, Nature Genetics.

[56]  Vipin T. Sreedharan,et al.  Multiple reference genomes and transcriptomes for Arabidopsis thaliana , 2011, Nature.

[57]  Megumi Onishi-Seebacher,et al.  Challenges in studying genomic structural variant formation mechanisms: The short‐read dilemma and beyond , 2011, BioEssays : news and reviews in molecular, cellular and developmental biology.

[58]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[59]  A. Spradling,et al.  The Drosophila Gene Disruption Project: Progress Using Transposons With Distinctive Site Specificities , 2011, Genetics.

[60]  Kenny Q. Ye,et al.  Mapping copy number variation by population scale genome sequencing , 2010, Nature.

[61]  B. van Steensel,et al.  Chromatin: constructing the big picture , 2011, The EMBO journal.

[62]  Bradley P. Coe,et al.  Genome structural variation discovery and genotyping , 2011, Nature Reviews Genetics.

[63]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[64]  M. Gerstein,et al.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. , 2011, Genome research.

[65]  Jeffrey B. Endelman,et al.  Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP , 2011 .

[66]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[67]  J. Johnston,et al.  Genome size determination using flow cytometry of propidium iodide-stained nuclei. , 2011, Methods in molecular biology.

[68]  T. Mackay,et al.  Analysis of natural variation reveals neurogenetic networks for Drosophila olfactory behavior , 2012, Proceedings of the National Academy of Sciences.

[69]  Thomas Zichner,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[70]  M. Rieder,et al.  Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. , 2012, American journal of human genetics.

[71]  A. Kondrashov,et al.  A Strong Deletion Bias in Nonallelic Gene Conversion , 2012, PLoS genetics.

[72]  Lenovia J. McCoy,et al.  Genome-wide association study of sleep in Drosophila melanogaster , 2013, BMC Genomics.

[73]  T. Mackay,et al.  Genome-Wide Association Analysis of Oxidative Stress Resistance in Drosophila melanogaster , 2012, PloS one.

[74]  R. Gibbs,et al.  INAUGURAL ARTICLE by a Recently Elected Academy Member:Epistasis dominates the genetic architecture of Drosophila quantitative traits , 2012 .

[75]  Sebastian M. Waszak,et al.  Genomic Variation and Its Impact on Gene Expression in Drosophila melanogaster , 2012, PLoS genetics.

[76]  J. Welch,et al.  Population Genomics of the Wolbachia Endosymbiont in Drosophila melanogaster , 2012, PLoS genetics.

[77]  Jim Thurmond,et al.  FlyBase 101 – the basics of navigating FlyBase , 2011, Nucleic Acids Res..

[78]  Russell B. Corbett-Detig,et al.  Population Genomics of Inversion Polymorphisms in Drosophila melanogaster , 2012, PLoS genetics.

[79]  Daniel Gianola,et al.  Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster , 2012, PLoS genetics.

[80]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[81]  J. M. Comeron,et al.  The Many Landscapes of Recombination in Drosophila melanogaster , 2012, PLoS genetics.

[82]  Kevin R. Thornton,et al.  The Drosophila melanogaster Genetic Reference Panel , 2012, Nature.

[83]  T. Mackay,et al.  Genome-Wide Association for Sensitivity to Chronic Oxidative Stress in Drosophila melanogaster , 2012, PloS one.

[84]  E. Stone,et al.  Joint genotyping on the fly: Identifying variation among a sequenced panel of inbred lines , 2012, Genome research.

[85]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[86]  Paul C. Leyland,et al.  FlyBase: improvements to the bibliography , 2012, Nucleic Acids Res..

[87]  I. Hellmann,et al.  Massive genomic variation and strong selection in Arabidopsis thaliana lines from Sweden , 2013, Nature Genetics.

[88]  Thomas Zichner,et al.  Impact of genomic structural variation in Drosophila melanogaster based on population-scale sequencing , 2013, Genome research.

[89]  Kevin R. Thornton,et al.  A second-generation assembly of the Drosophila simulans genome provides new insights into patterns of lineage-specific divergence , 2013, Genome research.

[90]  Georgii A. Bazykin,et al.  Strong Mutational Bias Toward Deletions in the Drosophila melanogaster Genome Is Compensated by Selection , 2013, Genome biology and evolution.

[91]  Mark Gerstein,et al.  The origin, evolution, and functional impact of short insertion–deletion variants identified in 179 human genomes , 2013, Genome research.

[92]  A. Cutter,et al.  Fine-Scale Signatures of Molecular Evolution Reconcile Models of Indel-Associated Mutation , 2013, Genome biology and evolution.

[93]  T. Mackay Epistasis and quantitative traits: using model organisms to study gene–gene interactions , 2013, Nature Reviews Genetics.

[94]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[95]  T. Mackay,et al.  Intrapopulation Genome Size Variation in D. melanogaster Reflects Life History Variation and Plasticity , 2014, PLoS genetics.