Chromosome‐level genome assembly of burbot (Lota lota) provides insights into the evolutionary adaptations in freshwater

The burbot (Lota lota) is the only member of the order Gadiformes adapted solely to freshwater. This species has the widest longitudinal range among freshwater fish worldwide. Burbot serves as a good model for studies on adaptive genome evolution from marine to freshwater environments. However, a high‐quality reference genome of burbot has not yet been released. Here, the first chromosome‐level genome of burbot was constructed using PacBio long sequencing and Hi‐C technology. A total of 95.24 Gb polished PacBio sequences were generated, and the preliminary genome assembly was 575.83 Mb in size with a contig N50 size of 2.15 Mb. The assembled sequences were anchored to 22 pseudochromosomes by using Hi‐C data. The final assembled genome after Hi‐C correction was 575.92 Mb, with a contig N50 of 2.01 Mb and a scaffold N50 of 22.10 Mb. A total of 22,067 protein‐coding genes were predicted, 94.82% of which were functionally annotated. Phylogenetic analyses indicated that burbot diverged with the Atlantic cod approximately 43.8 million years ago. In addition, 377 putative genes that appear to be under positive selection in burbot were identified. These positively selected genes might be involved in the adaptation to the freshwater environment. These genome data provide an invaluable resource for the ecological and evolutionary study of the order Gadiformes.

[1]  S. Lien,et al.  A Nanopore Based Chromosome-Level Assembly Representing Atlantic Cod from the Celtic Sea , 2019, bioRxiv.

[2]  Nicolás Bellora,et al.  Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data , 2018, Proceedings of the National Academy of Sciences.

[3]  M. Říha,et al.  Assessment of burbot Lota lota (L. 1758) population sustainability in central European reservoirs. , 2018, Journal of fish biology.

[4]  A. Nederbragt,et al.  Genomic architecture of haddock (Melanogrammus aeglefinus) shows expansions of innate immune genes and short tandem repeats , 2018, BMC Genomics.

[5]  Sudhir Kumar,et al.  TimeTree: A Resource for Timelines, Timetrees, and Divergence Times. , 2017, Molecular biology and evolution.

[6]  O. Taşbozan,et al.  Fatty Acids in Fish , 2017 .

[7]  S. Wuertz,et al.  Induction of gonadal maturation at different temperatures in burbot Lota lota. , 2016, Journal of fish biology.

[8]  Reinhold Hanel,et al.  Evolution of the immune system influences speciation rates in teleost fishes , 2016, Nature Genetics.

[9]  Jeffrey T Leek,et al.  Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown , 2016, Nature Protocols.

[10]  James R. Knight,et al.  An improved genome assembly uncovers prolific tandem repeats in Atlantic cod , 2016, bioRxiv.

[11]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[12]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[13]  Christina A. Cuomo,et al.  Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement , 2014, PloS one.

[14]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[15]  Andrew C. Adey,et al.  Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions , 2013, Nature Biotechnology.

[16]  Jianying Yuan,et al.  Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects , 2013, 1308.2012.

[17]  D. Eick Habitat preferences of the burbot (Lota lota) from the River Elbe: An experimental approach , 2013 .

[18]  Aaron A. Klammer,et al.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data , 2013, Nature Methods.

[19]  Glenn Tesler,et al.  Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory , 2012, BMC Bioinformatics.

[20]  I. Goldman,et al.  Mechanisms of membrane transport of folates into cells and across epithelia. , 2011, Annual review of nutrition.

[21]  B. Langmead,et al.  Aligning Short Sequencing Reads with Bowtie , 2010, Current protocols in bioinformatics.

[22]  M. Stapanian,et al.  Recruitment of burbot (Lota lota L.) in Lake Erie: an empirical modelling approach , 2010 .

[23]  J. Jackson,et al.  Worldwide status of burbot and conservation measures , 2010 .

[24]  M. Washburn,et al.  Distinct modes of regulation of the Uch37 deubiquitinating enzyme in the proteasome and in the Ino80 chromatin-remodeling complex. , 2008, Molecular cell.

[25]  Sofia M. C. Robb,et al.  MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. , 2007, Genome research.

[26]  Sam W. Lee,et al.  Hzf Determines Cell Survival upon Genotoxic Stress by Modulating p53 Transactivation , 2007, Cell.

[27]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[28]  Zhao Xu,et al.  LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons , 2007, Nucleic Acids Res..

[29]  R. Smith,et al.  Effects of solar UV radiation on aquatic ecosystems and interactions with climate change , 2007, Photochemical & photobiological sciences : Official journal of the European Photochemistry Association and the European Society for Photobiology.

[30]  Burkhard Morgenstern,et al.  AUGUSTUS: ab initio prediction of alternative transcripts , 2006, Nucleic Acids Res..

[31]  Jeffery P. Demuth,et al.  CAFE: a computational tool for the study of gene family evolution , 2006, Bioinform..

[32]  A. Perretti,et al.  A mitogenic view on the evolutionary history of the Holarctic freshwater gadoid, burbot (Lota lota) , 2005, Molecular ecology.

[33]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[34]  Jonathan Pevsner,et al.  Basic Local Alignment Search Tool (BLAST) , 2005 .

[35]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[36]  D. Valentine,et al.  Omega-3 fatty acids in cellular membranes: a unified concept. , 2004, Progress in lipid research.

[37]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[38]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[39]  N. Rosenthal,et al.  Developmental basis of evolutionary digit loss in the Australian lizard Hemiergis. , 2003, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[40]  Michael J. Sanderson,et al.  R8s: Inferring Absolute Rates of Molecular Evolution, Divergence times in the Absence of a Molecular Clock , 2003, Bioinform..

[41]  J. Vuorenmaa,et al.  Effects of eutrophication on fish and fisheries in Finnish lakes: a survey based on random sampling , 1999 .

[42]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[43]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[44]  E. Bergersen,et al.  Winter movements of burbot (Lota lota) during an extreme drawdown in Bull Lake, Wyoming, USA , 1993 .

[45]  P. Maitland,et al.  Practical conservation of British fishes: current action on six declining species , 1990 .

[46]  Z. Jara Some aspects of excretion and osmoregulation of fishes , 1988 .

[47]  K. Pivnička Morphological Variation in the Burhot (Lota lota) and Recognition of the Subspecies: A Review , 1970 .

[48]  D. Finnegan Convergence in Diet and Morphology in Marine and Freshwater Cottoid Fishes , 2017 .

[49]  Chon-Kit Kenneth Chan,et al.  Analysis of RNA-Seq Data Using TopHat and Cufflinks. , 2016, Methods in molecular biology.

[50]  BIOINFORMATICS APPLICATIONS NOTE , 2005 .

[51]  G. Barlow,et al.  Fishes of the world , 2004, Environmental Biology of Fishes.

[52]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[53]  Peer Bork,et al.  Systematic identification of novel protein domain families associated with nuclear functions. , 2002, Genome research.

[54]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[55]  G. Benson,et al.  Tandem repeats finder: a program to analyze DNA sequences. , 1999, Nucleic acids research.

[56]  H. Lehtonen Winter biology of burbot (Lota lota L.) , 1998 .

[57]  L. P. Schultz,et al.  Contribution to the ichthyology of Alaska, with descriptions of two new fishes , 1941 .

[58]  W. Illiam,et al.  OCCASIONAL PAPERS OF THE MUSEUM OF ZOOLOGY , 2007 .