The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly

Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified that 9 of the 19 chromosomes were assembled as single contigs, while the other 10 chromosomes were each scaffolded together from two or more contigs. We qualitatively identified that percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000. The genome version and its associated annotations are also available via this Figshare repository https://doi.org/10.6084/m9.figshare.20069273.

[1]  Glennis A. Logsdon,et al.  Verkko: telomere-to-telomere assembly of diploid chromosomes , 2022, bioRxiv.

[2]  M. Kiskowski,et al.  Does colour impact responses to images in geckos? , 2022, Journal of Zoology.

[3]  T. Gamble,et al.  Chromosome-level genome assembly reveals dynamic sex chromosomes in Neotropical leaf-litter geckos (Sphaerodactylidae: Sphaerodactylus). , 2022, The Journal of heredity.

[4]  Heng Li,et al.  Haplotype-resolved assembly of diploid genomes without parental data , 2022, Nature Biotechnology.

[5]  A. Bauer,et al.  The evolutionary history of an accidental model organism, the leopard gecko Eublepharis macularius (Squamata: Eublepharidae). , 2022, Molecular phylogenetics and evolution.

[6]  S. Edwards,et al.  What Have We Learned from the First 500 Avian Genomes? , 2021, Annual Review of Ecology, Evolution, and Systematics.

[7]  Jessica S. Blackburn,et al.  Long-read sequencing of the zebrafish genome reorganizes genomic architecture , 2021, bioRxiv.

[8]  Joanna L. Kelley,et al.  Toward a genome sequence for every animal: Where are we now? , 2021, Proceedings of the National Academy of Sciences.

[9]  T. Gamble,et al.  A chromosome-level genome assembly of the parasitoid wasp, Cotesia glomerata (Hymenoptera: Braconidae). , 2021, The Journal of heredity.

[10]  Aaron M. Streets,et al.  The complete sequence of a human genome , 2021, bioRxiv.

[11]  Mitsutaka Kadota,et al.  Technical considerations in Hi‐C scaffolding and evaluation of chromosome‐scale genome assemblies , 2021, Molecular ecology.

[12]  R. Durbin,et al.  False gene and chromosome losses affected by assembly and sequence errors , 2021, bioRxiv.

[13]  T. Glimm,et al.  Capturing and analyzing pattern diversity: an example using the melanistic spotted patterns of leopard geckos , 2021, bioRxiv.

[14]  Heng Li,et al.  Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm , 2021, Nature Methods.

[15]  Steven L Salzberg,et al.  Liftoff: accurate mapping of gene annotations , 2020, Bioinform..

[16]  S. Koren,et al.  Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies , 2020, Genome Biology.

[17]  Cédric Feschotte,et al.  RepeatModeler2 for automated genomic discovery of transposable element families , 2020, Proceedings of the National Academy of Sciences.

[18]  J. Madej,et al.  Iridophoroma associated with the Lemon Frost colour morph of the leopard gecko (Eublepharis macularius) , 2020, Scientific Reports.

[19]  Alexander Suh,et al.  Identifying the causes and consequences of assembly gaps using a multiplatform genome assembly of a bird‐of‐paradise , 2019, bioRxiv.

[20]  Andrew G. Clark,et al.  RepeatModeler2: automated genomic discovery of transposable element families , 2019, bioRxiv.

[21]  T. Gamble Duplications in corneous beta protein genes and the evolution of gecko adhesion. , 2019, Integrative and comparative biology.

[22]  Srinivas Aluru,et al.  Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems , 2019, 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[23]  Mosè Manni,et al.  BUSCO: Assessing Genome Assembly and Annotation Completeness. , 2019, Methods in molecular biology.

[24]  Sergey Koren,et al.  De novo assembly of haplotype-resolved genomes with trio binning , 2018, Nature Biotechnology.

[25]  Thomas J. Sanger,et al.  How a growing organismal perspective is adding new depth to integrative studies of morphological evolution , 2018, Biological reviews of the Cambridge Philosophical Society.

[26]  T. Gamble,et al.  Isolating and quantifying the role of developmental noise in generating phenotypic variation , 2018, bioRxiv.

[27]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[28]  Osamu Nishimura,et al.  gVolante for standardizing completeness assessment of genome and transcriptome assemblies , 2017, Bioinform..

[29]  Sudhir Kumar,et al.  TimeTree: A Resource for Timelines, Timetrees, and Divergence Times. , 2017, Molecular biology and evolution.

[30]  Neva C. Durand,et al.  De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds , 2016, Science.

[31]  Huanming Yang,et al.  Draft genome of the leopard gecko, Eublepharis macularius , 2016, GigaScience.

[32]  James T. Robinson,et al.  Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom. , 2016, Cell systems.

[33]  M. Schatz,et al.  Phased diploid genome assembly with single-molecule real-time sequencing , 2016, Nature Methods.

[34]  Mario Baum,et al.  Homology The Hierarchical Basis Of Comparative Biology , 2016 .

[35]  J. Wiens,et al.  Combining phylogenomic and supermatrix approaches, and a time-calibrated phylogeny for squamate reptiles (lizards and snakes) based on 52 genes and 4162 species. , 2016, Molecular phylogenetics and evolution.

[36]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[37]  T. Gamble,et al.  Restriction Site-Associated DNA Sequencing (RAD-seq) Reveals an Extraordinary Number of Transitions among Gecko Sex-Determining Systems. , 2015, Molecular biology and evolution.

[38]  A. Amores,et al.  A RAD-Tag Genetic Map for the Platyfish (Xiphophorus maculatus) Reveals Mechanisms of Karyotype Evolution Among Teleost Fish , 2014, Genetics.

[39]  L. Vitt Geckos: The Animal Answer Guide , 2013, Copeia.

[40]  T. Townsend,et al.  Resolving the phylogeny of lizards and snakes (Squamata) with extensive sampling of genes and species , 2012, Biology Letters.

[41]  M. Vickaryous,et al.  Scar‐Free Wound Healing and Regeneration Following Tail Loss in the Leopard Gecko, Eublepharis macularius , 2012, Anatomical record.

[42]  Y. Benjamini,et al.  Summarizing and correcting the GC content bias in high-throughput sequencing , 2012, Nucleic acids research.

[43]  M. Vickaryous,et al.  A novel amniote model of epimorphic regeneration: the leopard gecko, Eublepharis macularius , 2011, BMC Developmental Biology.

[44]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[45]  L. Vitt,et al.  Coming to America: multiple origins of New World geckos , 2011, Journal of evolutionary biology.

[46]  M. Wake,et al.  Integrative Biology: Science for the 21st Century , 2008 .

[47]  R. Gadagkar Nothing in Biology Makes Sense Except in the Light of Evolution , 2005 .

[48]  M. Pagel,et al.  Bayesian estimation of ancestral character states on phylogenies. , 2004, Systematic biology.

[49]  E. Louis,et al.  Molecular phylogenetics of squamata: the position of snakes, amphisbaenians, and dibamids, and the root of the squamate tree. , 2004, Systematic biology.

[50]  J. Sakata,et al.  Social experience affects territorial and reproductive behaviours in male leopard geckos, Eublepharis macularius , 2002, Animal Behaviour.

[51]  Laurence D. Hurst,et al.  The evolution of isochores , 2001, Nature Reviews Genetics.

[52]  L. Witmer Homology of facial structures in extant archosaurs (birds and crocodilians), with special reference to paranasal pneumaticity and nasal conchae , 1995, Journal of morphology.

[53]  D. Crews,et al.  Temperature-dependent sex determination in the leopard gecko, Eublepharis macularius. , 1993, The Journal of experimental zoology.

[54]  A. Russell,et al.  The Role of Phylogenetic Analysis in the Inference of Unpreserved Attributes of Extinct Taxa , 1992 .

[55]  J. Felsenstein Phylogenies and the Comparative Method , 1985, The American Naturalist.

[56]  I. Whimster,et al.  AN EXPERIMENTAL APPROACH TO THE PROBLEM OF SPOTTINESS. , 1965, The British journal of dermatology.