A draft reference genome of the red abalone, Haliotis rufescens, for conservation genomics

Abstract Red abalone, Haliotis rufescens, are herbivorous marine gastropods that primarily feed on kelp. They are the largest and longest-lived of abalone species with a range distribution in North America from central Oregon, United States, to Baja California, MEX. Recently, red abalone have been in decline as a consequence of overharvesting, disease, and climate change, resulting in the closure of the commercial fishery in the 1990s and the recreational fishery in 2018. Protecting this ecologically and economically important species requires an understanding of their current population dynamics and connectivity. Here, we present a new red abalone reference genome as part of the California Conservation Genomics Project (CCGP). Following the CCGP genome strategy, we used Pacific Biosciences HiFi long reads and Dovetail Omni-C data to generate a scaffold-level assembly. The assembly comprises 616 scaffolds for a total size of 1.3 Gb, a scaffold N50 of 45.7 Mb, and a BUSCO complete score of 97.3%. This genome represents a significant improvement over a previous assembly and will serve as a powerful tool for investigating seascape genomic diversity, local adaptation to temperature and ocean acidification, and informing management strategies.

[1]  B. D. Todd,et al.  Reference Genome of the Northwestern Pond Turtle, Actinemys marmorata , 2022, The Journal of heredity.

[2]  B. Shapiro,et al.  A Draft Reference Genome Assembly of the Critically Endangered Black Abalone, Haliotis cracherodii , 2022, The Journal of heredity.

[3]  Russell B. Corbett-Detig,et al.  Landscape genomics to enable conservation actions: The California Conservation Genomics Project. , 2022, The Journal of heredity.

[4]  S. Sim,et al.  HiFiAdapterFilt, a memory efficient read processing pipeline, prevents occurrence of adapter sequence in PacBio HiFi reads and their negative impacts on genome assembly , 2022, BMC Genomics.

[5]  H. B. Shaffer,et al.  Reference Genome Assembly of the Big Berry Manzanita (Arctostaphylos glauca) , 2021, The Journal of heredity.

[6]  Heng Li,et al.  Robust haplotype-resolved assembly of diploid individuals without parental data , 2021, 2109.04785.

[7]  Felipe A. Simão,et al.  BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes , 2021, Molecular biology and evolution.

[8]  A. Whitehead,et al.  Evolved differences in energy metabolism and growth dictate the impacts of ocean acidification on abalone aquaculture , 2020, Proceedings of the National Academy of Sciences.

[9]  S. Koren,et al.  Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies , 2020, Genome Biology.

[10]  Sergey Koren,et al.  Towards complete and error-free genome assemblies of all vertebrate species , 2020, Nature.

[11]  Mark Blaxter,et al.  BlobToolKit – Interactive Quality Assessment of Genome Assemblies , 2019, G3: Genes, Genomes, Genetics.

[12]  L. Rogers‐Bennett,et al.  Marine heat wave and multiple stressors tip bull kelp forest to sea urchin barrens , 2019, Scientific Reports.

[13]  J. Strugnell,et al.  Best Foot Forward: Nanopore Long Reads, Hybrid Meta-Assembly, and Haplotig Purging Optimizes the First Genome Assembly for the Southern Hemisphere Blacklip Abalone (Haliotis rubra) , 2019, Front. Genet..

[14]  M. Schatz,et al.  GenomeScope 2.0 and Smudgeplots: Reference-free profiling of polyploid genomes , 2019, bioRxiv.

[15]  F. Delsuc,et al.  MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics , 2019, bioRxiv.

[16]  J. Erlandson,et al.  Early Red Abalone Shell Middens, Human Subsistence, and Environmental Change on California's Northern Channel Islands , 2019, Journal of Ethnobiology.

[17]  Nezar Abdennur,et al.  Cooler: scalable storage for Hi-C data and other genomically-labeled arrays , 2019, bioRxiv.

[18]  Arun S. Seetharam,et al.  An Annotated Genome for Haliotis rufescens (Red Abalone) and Resequenced Green, Pink, Pinto, Black, and White Abalone Species , 2019, Genome biology and evolution.

[19]  Malcolm J. Hawksford,et al.  High Resolution , 2019, Colorado Review.

[20]  Sergey Koren,et al.  Integrating Hi-C links with assembly graphs for chromosome-scale assembly , 2018, bioRxiv.

[21]  Kin Chung Lam,et al.  High-resolution TADs reveal DNA sequences underlying genome organization in flies , 2017, Nature Communications.

[22]  Jonas Korlach,et al.  De novo PacBio long-read and phased avian genome assemblies correct and add to reference genes generated with intermediate and short reads , 2017, GigaScience.

[23]  S. Koren,et al.  Scaffolding of long read assemblies using long range contact information , 2016, BMC Genomics.

[24]  Jacob M. Luber,et al.  HiGlass: web-based visual exploration and analysis of genome interaction maps , 2017, Genome Biology.

[25]  Heebal Kim,et al.  Genome sequence of pacific abalone (Haliotis discus hannai): the first draft genome in family Haliotidae , 2017, GigaScience.

[26]  P. Cook Recent Trends in Worldwide Abalone Production , 2016, Journal of Shellfish Research.

[27]  Alexey A. Gurevich,et al.  QUAST: quality assessment tool for genome assemblies , 2013, Bioinform..

[28]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[29]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[30]  C. Gallardo‐Escárate,et al.  KARYOTYPE COMPOSITION IN THREE CALIFORNIA ABALONES AND THEIR RELATIONSHIP WITH GENOME SIZE , 2007 .

[31]  K. Gruenthal,et al.  Genetic structure of natural populations of California red abalone (Haliotis rufescens) using multiple genetic markers , 2007 .