Upgrading short-read animal genome assemblies to chromosome level using comparative genomics and a universal probe set

Most recent initiatives to sequence and assemble new species' genomes de novo fail to achieve the ultimate endpoint to produce contigs, each representing one whole chromosome. Even the best-assembled genomes (using contemporary technologies) consist of subchromosomal-sized scaffolds. To circumvent this problem, we developed a novel approach that combines computational algorithms to merge scaffolds into chromosomal fragments, PCR-based scaffold verification, and physical mapping to chromosomes. Multigenome-alignment-guided probe selection led to the development of a set of universal avian BAC clones that permit rapid anchoring of multiple scaffolds to chromosomes on all avian genomes. As proof of principle, we assembled genomes of the pigeon (Columbia livia) and peregrine falcon (Falco peregrinus) to chromosome levels comparable, in continuity, to avian reference genomes. Both species are of interest for breeding, cultural, food, and/or environmental reasons. Pigeon has a typical avian karyotype (2n = 80), while falcon (2n = 50) is highly rearranged compared to the avian ancestor. By using chromosome breakpoint data, we established that avian interchromosomal breakpoints appear in the regions of low density of conserved noncoding elements (CNEs) and that the chromosomal fission sites are further limited to long CNE "deserts." This corresponds with fission being the rarest type of rearrangement in avian genome evolution. High-throughput multiple hybridization and rapid capture strategies using the current BAC set provide the basis for assembling numerous avian (and possibly other reptilian) species, while the overall strategy for scaffold assembly and mapping provides the basis for an approach that (provided metaphases can be generated) could be applied to any animal genome.

[1]  B. Faircloth,et al.  Primer3—new capabilities and interfaces , 2012, Nucleic acids research.

[2]  D. Griffin,et al.  Intrachromosomal rearrangements in avian genome evolution: evidence for regions prone to breakpoints , 2011, Heredity.

[3]  E. Jarvis,et al.  Novel Insights into Chromosome Evolution in Birds, Archosaurs, and Reptiles , 2016, Genome biology and evolution.

[4]  Kevin Y. Yip,et al.  Genome-Wide Structural Variation Detection by Genome Mapping on Nanochannel Arrays , 2015, Genetics.

[5]  S. O’Brien,et al.  The Genome 10K Project: a way forward. , 2015, Annual review of animal biosciences.

[6]  D. Burt,et al.  Origin and evolution of avian microchromosomes , 2002, Cytogenetic and Genome Research.

[7]  Wei Zhao,et al.  Genome-wide adaptive complexes to underground stresses in blind mole rats Spalax , 2014, Nature Communications.

[8]  K. Worley,et al.  The Genome Sequence of Taurine Cattle: A Window to Ruminant Biology and Evolution , 2009, Science.

[9]  Albert J. Vilella,et al.  Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis , 2010, PLoS biology.

[10]  Stephen J O'Brien,et al.  Every genome sequence needs a good map. , 2009, Genome research.

[11]  D. A. Christie,et al.  Raptors of the World , 2002 .

[12]  A. Wilson,et al.  Rapid speciation and chromosomal evolution in mammals. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Syed Haider,et al.  Ensembl BioMarts: a hub for data retrieval across taxonomic space , 2011, Database J. Biol. Databases Curation.

[14]  Albert J. Vilella,et al.  A high-resolution map of human evolutionary constraint using 29 mammals , 2011, Nature.

[15]  David Waddington,et al.  The dynamics of chromosome evolution in birds and mammals , 1999, Nature.

[16]  M. Ibrahim Whole-Genome Resequencing , 2009 .

[17]  Md. Shamsuzzoha Bayzid,et al.  Whole-genome analyses resolve early branches in the tree of life of modern birds , 2014, Science.

[18]  A. Fontdevila,et al.  The evolutionary history of Drosophila buzzatii , 1985, Chromosoma.

[19]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Xiaoxiang Hu,et al.  Advanced technologies for genomic analysis in farm animals and its application for QTL mapping , 2009, Genetica.

[21]  Leif Andersson,et al.  Domestic-animal genomics: deciphering the genetics of complex traits , 2004, Nature Reviews Genetics.

[22]  Colin N. Dewey,et al.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004, Nature.

[23]  Stephen J O'Brien,et al.  From wild animals to domestic pets, an evolutionary view of domestication , 2009, Proceedings of the National Academy of Sciences.

[24]  Kin-Fan Au,et al.  PacBio Sequencing and Its Applications , 2015, Genom. Proteom. Bioinform..

[25]  M. Ferguson-Smith,et al.  Afrotheria genome; overestimation of genome size and distinct chromosome GC content revealed by flow karyotyping. , 2013, Genomics.

[26]  David C. Schwartz,et al.  High-resolution human genome structure by single-molecule analysis , 2010, Proceedings of the National Academy of Sciences.

[27]  E. Gaginskaya,et al.  High chromosome conservation detected by comparative chromosome painting in chicken, pigeon and passerine birds , 2004, Chromosome Research.

[28]  Loretta Auvil,et al.  Reference-assisted chromosome assembly , 2013, Proceedings of the National Academy of Sciences.

[29]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.

[30]  S. Gabriel,et al.  Advances in understanding cancer genomes through second-generation sequencing , 2010, Nature Reviews Genetics.

[31]  Gaik Tamazian,et al.  Chromosomer: a reference-based genome arrangement tool for producing draft chromosome sequences , 2016, GigaScience.

[32]  Bronwen L. Aken,et al.  Analyses of pig genomes provide insight into porcine demography and evolution , 2012, Nature.

[33]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[34]  Fengtang Yang,et al.  Cross-Species Chromosome Painting Corroborates Microchromosome Fusion during Karyotype Evolution of Birds , 2010, Cytogenetic and Genome Research.

[35]  H. Tempest,et al.  The evolution of the avian genome as revealed by comparative molecular cytogenetics , 2007, Cytogenetic and Genome Research.

[36]  Denis M Larkin,et al.  Reconstruction of gross avian genome structure, organization and evolution suggests that the chicken lineage most closely resembles the dinosaur avian ancestor , 2014, BMC Genomics.

[37]  Alejandro A. Schäffer,et al.  A Fast and Symmetric DUST Implementation to Mask Low-Complexity DNA Sequences , 2006, J. Comput. Biol..

[38]  Jun Wang,et al.  Genomic Diversity and Evolution of the Head Crest in the Rock Pigeon , 2013, Science.

[39]  A. Pombo,et al.  Intermingling of Chromosome Territories in Interphase Suggests Role in Translocations and Transcription-Dependent Associations , 2006, PLoS biology.

[40]  Brendan L. O’Connell,et al.  Chromosome-scale shotgun assembly using an in vitro method for long-range linkage , 2015, Genome research.

[41]  Alvaro G. Hernandez,et al.  Whole-genome resequencing of two elite sires for the detection of haplotypes under selection in dairy cattle , 2012, Proceedings of the National Academy of Sciences.

[42]  D. Karolchik,et al.  The UCSC Genome Browser database: 2016 update , 2015, bioRxiv.

[43]  Andreas R. Pfenning,et al.  Comparative genomics reveals insights into avian genome evolution and adaptation , 2014, Science.

[44]  Jing He,et al.  Peregrine and saker falcon genome sequences provide insights into evolution of a predatory lifestyle , 2013, Nature Genetics.

[45]  Zanoni Dias,et al.  Cassis: detection of genomic rearrangement breakpoints , 2010, Bioinform..

[46]  Kathleen Marchal,et al.  A network-based approach to identify substrate classes of bacterial glycosyltransferases , 2014, BMC Genomics.

[47]  David Haussler,et al.  Cactus: Algorithms for genome multiple sequence alignment. , 2011, Genome research.

[48]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[49]  Albert J. Vilella,et al.  The genome of a songbird , 2010, Nature.

[50]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[51]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[52]  Brian J. Raney,et al.  Ragout—a reference-assisted assembly tool for bacterial genomes , 2014, Bioinform..

[53]  Loretta Auvil,et al.  Breakpoint regions and homologous synteny blocks in chromosomes have different evolutionary histories. , 2009, Genome research.

[54]  Nora Husain,et al.  Clone DB: an integrated NCBI resource for clone-associated data , 2012, Nucleic Acids Res..

[55]  Bronwen L. Aken,et al.  Third Report on Chicken Genes and Chromosomes 2015 , 2015, Cytogenetic and Genome Research.

[56]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[57]  M. Shapiro,et al.  Divergence, Convergence, and the Ancestry of Feral Populations in the Domestic Rock Pigeon , 2012, Current Biology.

[58]  D. Griffin,et al.  Characterization of chromosome structures of Falconinae (Falconidae, Falconiformes, Aves) by chromosome painting and delineation of chromosome rearrangements during their differentiation , 2008, Chromosome Research.

[59]  M. Ferguson-Smith,et al.  Reassessment of genome size in turtle and crocodile based on chromosome measurement by flow karyotyping: close similarity to chicken , 2012, Biology Letters.

[60]  T. Price Domesticated birds as a model for the genetics of speciation by sexual selection. , 2002 .

[61]  Pall I. Olason,et al.  The genomic landscape of species divergence in Ficedula flycatchers , 2012, Nature.

[62]  P. Pevzner,et al.  Dynamics of Mammalian Chromosome Evolution Inferred from Multispecies Comparative Maps , 2005, Science.

[63]  Fengtang Yang,et al.  Contrasting origin of B chromosomes in two cervids (Siberian roe deer and grey brocket deer) unravelled by chromosome-specific DNA sequencing , 2016, BMC Genomics.

[64]  Jun Wang,et al.  Comparative genomic data of the Avian Phylogenomics Project , 2014, GigaScience.

[65]  Tucker,et al.  Diving speeds and angles of a gyrfalcon (Falco rusticolus) , 1998, The Journal of experimental biology.

[66]  Michael N Romanov,et al.  Molecular Cytogenetics of the California Condor: Evolutionary and Conservation Implications , 2009, Cytogenetic and Genome Research.

[67]  J. Deakin,et al.  Tracing the evolution of amniote chromosomes , 2014, Chromosoma.

[68]  Michael N Romanov,et al.  Comparative BAC-based mapping in the white-throated sparrow, a novel behavioral genomics model, using interspecies overgo hybridization , 2011, BMC Research Notes.

[69]  Robert S. Harris,et al.  Improved pairwise alignment of genomic dna , 2007 .

[70]  A. Ruíz,et al.  EVOLUTIONARY HISTORY OF DROSOPHILA BUZZATII. II. HOW MUCH HAS CHROMOSOMAL POLYMORPHISM CHANGED IN COLONIZATION? , 1982, Evolution; international journal of organic evolution.

[71]  Graham J. Williams Data Mining with Rattle and R: The Art of Excavating Data for Knowledge Discovery , 2011 .