Wheat Panache: A pangenome graph database representing presence–absence variation across sixteen bread wheat genomes

Bread wheat (Triticum aestivum L.) is one of humanity's most important staple crops, characterized by a large and complex genome with a high level of gene presence–absence variation (PAV) between cultivars, hampering genomic approaches for crop improvement. With the growing global population and the increasing impact of climate change on crop yield, there is an urgent need to apply genomic approaches to accelerate wheat breeding. With recent advances in DNA sequencing technology, a growing number of high‐quality reference genomes are becoming available, reflecting the genetic content of a diverse range of cultivars. However, information on the presence or absence of genomic regions has been hard to visualize and interrogate because of the size of these genomes and the lack of suitable bioinformatics tools. To address this limitation, we have produced a wheat pangenome graph maintained within an online database to facilitate interrogation and comparison of wheat cultivar genomes. The database allows users to visualize regions of the pangenome to assess PAV between bread wheat genomes.

[1]  Erik K. Garrison,et al.  Unbiased pangenome graphs , 2022, bioRxiv.

[2]  Jordan M. Eizenga,et al.  Pangenomics enables genotyping of known structural variants in 5202 diverse genomes , 2021, Science.

[3]  S. Nahnsen,et al.  ODGI: understanding pangenome graphs , 2021, bioRxiv.

[4]  J. Keilwagen,et al.  Detecting major introgressions in wheat and their putative origins using coverage analysis , 2021, Scientific Reports.

[5]  Michael S. Barker,et al.  Modelling of gene loss propensity in the pangenomes of three Brassica species suggests different mechanisms between polyploids and diploids , 2021, Plant biotechnology journal.

[6]  P. Bayer,et al.  The pangenome of banana highlights differences between genera and genomes , 2021, The plant genome.

[7]  Nathan P. Hendricks,et al.  Decreased wheat production in the USA from climate change driven by yield losses rather than crop abandonment , 2021, PloS one.

[8]  A. Rathore,et al.  Sorghum Pan-Genome Explores the Functional Utility for Genomic-Assisted Breeding to Accelerate the Genetic Gain , 2021, Frontiers in Plant Science.

[9]  Matthieu G. Conte,et al.  Panache: a web browser-based viewer for linearized pangenomes , 2021, bioRxiv.

[10]  Bernardo J. Clavijo,et al.  Multiple wheat genomes reveal global variation in modern breeding , 2020, Nature.

[11]  Trevor W. Rife,et al.  The Aegilops ventricosa 2NvS segment in bread wheat: cytology, genomics and breeding , 2020, Theoretical and Applied Genetics.

[12]  Joseph L. Gage,et al.  A Maize Practical Haplotype Graph Leverages Diverse NAM Assemblies , 2020 .

[13]  J. Batley,et al.  Plant pan-genomes are the new reference , 2020, Nature Plants.

[14]  Chong Chu,et al.  The design and construction of reference pangenome graphs with minigraph , 2020, Genome Biology.

[15]  J. Batley,et al.  Trait associations in the pangenome of pigeon pea (Cajanus cajan) , 2020, Plant biotechnology journal.

[16]  Qingyong Yang,et al.  Eight high-quality genomes reveal pan-genome architecture and ecotype differentiation of Brassica napus , 2020, Nature Plants.

[17]  J. Batley,et al.  Pangenomics Comes of Age: From Bacteria to Plant and Animal Applications. , 2019, Trends in genetics : TIG.

[18]  Peter J. Bradbury,et al.  A sorghum Practical Haplotype Graph facilitates genome-wide imputation and cost-effective genomic prediction , 2019, bioRxiv.

[19]  Glenn Hickey,et al.  Genotyping structural variants in pangenome graphs using the vg toolkit , 2019, Genome Biology.

[20]  Jonathan D. G. Jones,et al.  Shifting the limits in wheat research and breeding using a fully annotated reference genome , 2018, Science.

[21]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[22]  Steven L Salzberg,et al.  The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum , 2017, bioRxiv.

[23]  C. K. Chan,et al.  The pangenome of hexaploid bread wheat , 2017, The Plant journal : for cell and molecular biology.

[24]  D. Edwards,et al.  SNP Discovery Using a Pangenome: Has the Single Reference Approach Become Obsolete? , 2017, Biology.

[25]  C. K. Chan,et al.  The pangenome of an agronomically important crop plant Brassica oleracea , 2016, Nature Communications.

[26]  Suzanna E Lewis,et al.  JBrowse: a dynamic web platform for genome visualization and analysis , 2016, Genome Biology.

[27]  Brian D. Ondov,et al.  Mash: fast genome and metagenome distance estimation using MinHash , 2015, Genome Biology.

[28]  Mark B. Schultz,et al.  Bandage: interactive visualization of de novo genome assemblies , 2015, bioRxiv.

[29]  J. Batley,et al.  A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome , 2014, Science.

[30]  Adam Skarshewski,et al.  Sequencing wheat chromosome arm 7BS delimits the 7BS/4AL translocation and reveals homoeologous gene conservation , 2012, Theoretical and Applied Genetics.

[31]  J. Batley,et al.  Sequencing and assembly of low copy and genic regions of isolated Triticum aestivum chromosome arm 7DS. , 2011, Plant biotechnology journal.

[32]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[33]  Maureen J Donlin,et al.  Using the Generic Genome Browser (GBrowse) , 2007, Current protocols in bioinformatics.

[34]  S. Wyman,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[35]  Identification and characterization of more than 4 million intervarietal SNPs across the group 7 chromosomes of bread wheat , 2022 .