Sequencing of 15,622 gene-bearing BACs reveals new features of the barley genome

Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework. However, since only 6,278 BACs in the physical map were sequenced, detailed fine structure was limited. To gain access to the gene-containing portion of the barley genome at high resolution, we identified and sequenced 15,622 BACs representing the minimal tiling path of 72,052 physical mapped gene-bearing BACs. This generated about 1.7 Gb of genomic sequence containing 17,386 annotated barley genes. Exploration of the sequenced BACs revealed that although distal ends of chromosomes contain most of the gene-enriched BACs and are characterized by high rates of recombination, there are also gene-dense regions with suppressed recombination. Knowledge of these deviant regions is relevant to trait introgression, genome-wide association studies, genomic selection model development and map-based cloning strategies. Sequences and their gene and SNP annotations can be accessed and exported via http://harvest-web.org/hweb/utilmenu.wc or through the software HarvEST:Barley (download from harvest.ucr.edu). In the latter, we have implemented a synteny viewer between barley and Aegilops tauschii to aid in comparative genome analysis.

[1]  Zhou Du,et al.  agriGO: a GO analysis toolkit for the agricultural community , 2010, Nucleic Acids Res..

[2]  Mihaela M. Martis,et al.  A physical, genetic and functional sequence assembly of the barley genome. , 2022 .

[3]  R. Wing,et al.  A bacterial artificial chromosome library for barley (Hordeum vulgare L.) and the identification of clones containing putative resistance genes , 2000, Theoretical and Applied Genetics.

[4]  S. Ullrich,et al.  Barley: Production, Improvement, and Uses , 2011 .

[5]  Gary J. Muehlbauer,et al.  The USDA Barley Core Collection: Genetic Diversity, Population Structure, and Potential for Genome-Wide Association Studies , 2014, PloS one.

[6]  Siu-Ming Yiu,et al.  IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth , 2012, Bioinform..

[7]  Ming-Cheng Luo,et al.  GenoProfiler: batch processing of high-throughput capillary fingerprinting data , 2007, Bioinform..

[8]  L. Mao,et al.  The Mla (powdery mildew) resistance cluster is associated with three NBS-LRR gene families and suppressed recombination within a 240-kb DNA interval on chromosome 5S (1HS) of barley. , 1999, Genetics.

[9]  P. Langridge,et al.  Genetic mapping and BAC assignment of EST-derived SSR markers shows non-uniform distribution of genes in the barley genome , 2006, Theoretical and Applied Genetics.

[10]  Edward S. Buckler,et al.  Crop genomics: advances and applications , 2011, Nature Reviews Genetics.

[11]  Jun Wang,et al.  The draft genome of Tibetan hulless barley reveals adaptive patterns to the high stressful Tibetan Plateau , 2015, Proceedings of the National Academy of Sciences.

[12]  G. Presting,et al.  Sequence organization of barley centromeres. , 2001, Nucleic acids research.

[13]  B. Steuernagel,et al.  Advances in Sequencing the Barley Genome , 2014 .

[14]  Jan Vrána,et al.  Chromosomes in the flow to simplify genome analysis , 2012, Functional & Integrative Genomics.

[15]  Thomas Nussbaumer,et al.  MIPS PlantsDB: a database framework for comparative plant genome research , 2012, Nucleic Acids Res..

[16]  Hikmet Budak,et al.  Megabase Level Sequencing Reveals Contrasted Organization and Evolution Patterns of the Wheat Gene and Transposable Element Spaces[W] , 2010, Plant Cell.

[17]  李佩芳 International Rice Genome Sequencing Project. 2005. The map-based sequence of the rice genome. , 2005 .

[18]  Cheng Lu,et al.  Genomic and Genetic Characterization of Rice Cen3 Reveals Extensive Transcription and Evolutionary Implications of a Complex Centromere[W][OA] , 2006, The Plant Cell Online.

[19]  B. Keller,et al.  High gene density is conserved at syntenic loci of small and large grass genomes. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[20]  R. Valenta,et al.  The gene coding for the major birch pollen allergen Betv1, is highly homologous to a pea disease resistance response gene. , 1989, The EMBO journal.

[21]  Hana Šimková,et al.  The physical map of wheat chromosome 1BS provides insights into its gene space organization and evolution , 2013, Genome Biology.

[22]  Christopher D Town,et al.  A first survey of the rye (Secale cereale) genome composition through BAC end sequencing of the short arm of chromosome 1R , 2008, BMC Plant Biology.

[23]  C. Topp,et al.  Centromeric Retroelements and Satellites Interact with Maize Kinetochore Protein CENH3 , 2002, The Plant Cell Online.

[24]  Paul D. Shaw,et al.  Natural variation in a homolog of Antirrhinum CENTRORADIALIS contributed to spring growth habit and environmental adaptation in cultivated barley , 2012, Nature Genetics.

[25]  M. Platzer,et al.  A whole-genome snapshot of 454 sequences exposes the composition of the barley genome and provides evidence for parallel evolution of genome size in wheat and barley. , 2009, The Plant journal : for cell and molecular biology.

[26]  Gianfranco Ciardo,et al.  Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space , 2013, PLoS Comput. Biol..

[27]  Hadi Quesneville,et al.  Structural and functional partitioning of bread wheat chromosome 3B , 2014, Science.

[28]  S. Friedel,et al.  Arabidopsis KINETOCHORE NULL2 Is an Upstream Component for Centromeric Histone H3 Variant cenH3 Deposition at Centromeres[W] , 2013, Plant Cell.

[29]  Andreas Graner,et al.  Six-rowed barley originated from a mutation in a homeodomain-leucine zipper I-class homeobox gene , 2007, Proceedings of the National Academy of Sciences.

[30]  H. Müller,et al.  Insular Organization of Gene Space in Grass Genomes , 2013, PloS one.

[31]  J. Chapman,et al.  Anchoring and ordering NGS contig assemblies by population sequencing (POPSEQ) , 2013, The Plant journal : for cell and molecular biology.

[32]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[33]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[34]  G. Bernardi,et al.  The distribution of genes in the genomes of Gramineae. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Stefano Lonardi,et al.  An Improved Consensus Linkage Map of Barley Based on Flow‐Sorted Chromosomes and Single Nucleotide Polymorphism Markers , 2011 .

[36]  C. Soderlund,et al.  Contigs built with fingerprints, markers, and FPC V4.7. , 2000, Genome research.

[37]  Carol Soderlund,et al.  FPC: a system for building contigs from restriction fingerprinted clones , 1997, Comput. Appl. Biosci..

[38]  Stefano Lonardi,et al.  A Graph-Theoretical Approach to the Selection of the Minimum Tiling Path from a Physical Map , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[39]  L. Yan,et al.  The wheat and barley vernalization gene VRN3 is an orthologue of FT , 2006, Proceedings of the National Academy of Sciences.

[40]  Axel Himmelbach,et al.  A Sequence-Ready Physical Map of Barley Anchored Genetically by Two Million Single-Nucleotide Polymorphisms1[W][OPEN] , 2013, Plant Physiology.

[41]  Roberto Tuberosa,et al.  Genomics of Plant Genetic Resources , 2014, Springer Netherlands.

[42]  Stefano Lonardi,et al.  A compartmentalized approach to the assembly of physical maps , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[43]  T. Langdon,et al.  Retrotransposon evolution in diverse plant genomes. , 2000, Genetics.

[44]  Gianfranco Ciardo,et al.  When less is more: 'slicing' sequencing data improves read decoding accuracy and de novo assembly quality , 2015, Bioinform..

[45]  G. Muehlbauer,et al.  Genetics and Genomics of the Triticeae , 2009 .

[46]  J. Doležel,et al.  Dissection of the nuclear genome of barley by chromosome flow sorting , 2006, Theoretical and Applied Genetics.

[47]  Jian Wang,et al.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler , 2012, GigaScience.

[48]  Carolyn Thomas,et al.  High-throughput fingerprinting of bacterial artificial chromosomes using the snapshot labeling kit and sizing of restriction fragments by capillary electrophoresis. , 2003, Genomics.

[49]  R. Wing,et al.  Genome Dynamics and Evolution of the Mla (Powdery Mildew) Resistance Locus in Barley Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.002238. , 2002, The Plant Cell Online.

[50]  D. Sandhu,et al.  Gene-Containing Regions of Wheat and the Other Grass Genomes1 , 2002, Plant Physiology.

[51]  G. Coupland,et al.  The Evolution of CONSTANS-Like Gene Families in Barley, Rice, and Arabidopsis1 , 2003, Plant Physiology.

[52]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[53]  J. Bennetzen,et al.  Comparative sequence analysis of colinear barley and rice bacterial artificial chromosomes. , 2001, Plant physiology.

[54]  I. Grosse,et al.  A 1,000-loci transcript map of the barley genome: new anchoring points for integrative grass genomics , 2007, Theoretical and Applied Genetics.

[55]  Uwe Scholz,et al.  Unlocking the Barley Genome by Chromosomal and Comparative Genomics[W][OA] , 2011, Plant Cell.

[56]  P. Langridge,et al.  The International Barley Sequencing Consortium—At the Threshold of Efficient Access to the Barley Genome1[W] , 2009, Plant Physiology.

[57]  S. Ullrich Significance, Adaptation, Production, and Trade of Barley , 2010 .

[58]  Mihaela M. Martis,et al.  A 4-gigabase physical map unlocks the structure and evolution of the complex genome of Aegilops tauschii, the wheat D-genome progenitor , 2013, Proceedings of the National Academy of Sciences.

[59]  Pascal Condamine,et al.  Coupling amplified DNA from flow-sorted chromosomes to high-density SNP mapping in barley , 2008, BMC Genomics.