Six new reference-quality bat genomes illuminate the molecular basis and evolution of bat adaptations

Bats account for ~20% of all extant mammal species and are considered exceptional given their extraordinary adaptations, including biosonar, true flight, extreme longevity, and unparalleled immune systems. To understand these adaptations, we generated reference-quality genomes of six species representing the key divergent lineages. We assembled these genomes with a novel pipeline incorporating state-of-the-art long-read and long-range sequencing and assembly techniques. The genomes were annotated using a maximal evidence approach, de novo predictions, protein/mRNA alignments, Iso-seq long read and RNA-seq short read transcripts, and gene projections from our new TOGA pipeline, retrieving virtually all (>99%) mammalian BUSCO genes. Phylogenetic analyses of 12,931 protein coding-genes and 10,857 conserved non-coding elements identified across 48 mammalian genomes helped to resolve bats’ closest extant relatives within Laurasiatheria, supporting a basal position for bats within Scrotifera. Genome-wide screens along the bat ancestral branch revealed (a) selection on hearing-involved genes (e.g LRP2, SERPINB6, TJP2), which suggest that laryngeal echolocation is a shared ancestral trait of bats; (b) selection (e.g INAVA, CXCL13, NPSR1) and loss of immunity related proteins (e.g. LRRC70, IL36G), including pro-inflammatory NF-kB signalling; and (c) expansion of the APOBEC family, associated with restricting viral infection, transposon activity and interferon signalling. We also identified unique integrated viruses, indicating that bats have a history of tolerating viral pathogens, lethal to other mammal species. Non-coding RNA analyses identified variant and novel microRNAs, revealing regulatory relationships that may contribute to phenotypic diversity in bats. Together, our reference-quality genomes, high-quality annotations, genome-wide screens and in-vitro tests revealed previously unknown genomic adaptations in bats that may explain their extraordinary traits.

[1]  David W. Burt,et al.  Illuminating the dark side of the human transcriptome with TAMA Iso-Seq analysis , 2019, bioRxiv.

[2]  M. Springer,et al.  An ABBA-BABA Test for Introgression Using Retroposon Insertion Data , 2019, bioRxiv.

[3]  A. Katzourakis,et al.  The evolution of endogenous retroviral envelope genes in bats and their potential contribution to host biology. , 2019, Virus research.

[4]  S. Puechmaille,et al.  Longitudinal comparative transcriptomics reveals unique mechanisms underlying extended healthspan in bats , 2019, Nature Ecology & Evolution.

[5]  G. Wilkinson,et al.  Recurrent evolution of extreme longevity in bats , 2019, Biology Letters.

[6]  R. Butler,et al.  Quantifying the completeness of the bat fossil record , 2019, Palaeontology.

[7]  Anna L. Brown,et al.  APOBEC Mutagenesis and Copy-Number Alterations Are Drivers of Proteogenomic Tumor Evolution and Heterogeneity in Metastatic Thoracic Tumors , 2019, Cell reports.

[8]  V. Misra,et al.  Immune System Modulation and Viral Persistence in Bats: Understanding Viral Spillover , 2019, Viruses.

[9]  A reference standard for genome biology , 2018, Nature Biotechnology.

[10]  N. Simmons,et al.  Hibernation in bats (Mammalia: Chiroptera) did not evolve through positive selection of leptin , 2018, Ecology and evolution.

[11]  C. Abraham,et al.  INAVA-ARNO complexes bridge mucosal barrier function with inflammatory signaling , 2018, eLife.

[12]  Judith N. Mandl,et al.  Going to Bat(s) for Studies of Disease Tolerance , 2018, Front. Immunol..

[13]  Sergey Koren,et al.  De novo assembly of haplotype-resolved genomes with trio binning , 2018, Nature Biotechnology.

[14]  R. Pfundt,et al.  Genome-wide investigation of an ID cohort reveals de novo 3′UTR variants affecting gene expression , 2018, Human Genetics.

[15]  Michael Hiller,et al.  The axolotl genome and the evolution of key tissue formation regulators , 2018, Nature.

[16]  Alex Dornburg,et al.  Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome‐Scale Data Sets , 2018, Systematic biology.

[17]  Guan-Zhu Han,et al.  Endogenous retroviruses of non-avian/mammalian vertebrates illuminate diversity and deep history of retroviruses , 2018, PLoS pathogens.

[18]  Juan Li,et al.  Upregulation of miR‐374a promotes tumor metastasis and progression by downregulating LACTB and predicts unfavorable prognosis in breast cancer , 2018, Cancer medicine.

[19]  T. Kepler,et al.  The Egyptian Rousette Genome Reveals Unexpected Features of Bat Antiviral Immunity , 2018, Cell.

[20]  D. Elleder,et al.  Remnants of an Ancient Deltaretrovirus in the Genomes of Horseshoe Bats (Rhinolophidae) , 2018, Viruses.

[21]  R. Harris,et al.  Differential Evolution of Antiretroviral Restriction Factors in Pteropid Bats as Revealed by APOBEC3 Gene Complexity , 2018, Molecular biology and evolution.

[22]  Bjoern E. Langer,et al.  A genomics approach reveals insights into the importance of gene losses for mammalian adaptations , 2018, Nature Communications.

[23]  M. Daly,et al.  C1orf106 is a colitis risk gene that regulates stability of epithelial adherens junctions , 2018, Science.

[24]  E. Myers,et al.  Bat Biology, Genomes, and the Bat1K Project: To Generate Chromosome-Level Genomes for All Living Bat Species. , 2018, Annual review of animal biosciences.

[25]  Graham M. Hughes,et al.  Growing old, yet staying young: The role of telomeres in bats’ exceptional longevity , 2018, Science Advances.

[26]  Michael Hiller,et al.  Author Correction: The axolotl genome and the evolution of key tissue formation regulators , 2018, Nature.

[27]  Michael Hiller,et al.  CESAR 2.0 substantially improves speed and accuracy of comparative gene annotation , 2017, Bioinform..

[28]  Graham M. Hughes,et al.  A Potent Anti-Inflammatory Response in Bat Macrophages May Be Linked to Extended Longevity and Viral Tolerance , 2017, Acta Chiropterologica.

[29]  German Tischler-Höhle,et al.  Haplotype and Repeat Separation in Long Reads , 2017, CIBB.

[30]  Robert M. Waterhouse,et al.  BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics , 2017, bioRxiv.

[31]  A. von Haeseler,et al.  UFBoot2: Improving the Ultrafast Bootstrap Approximation , 2017, bioRxiv.

[32]  German Tischler Haplotype and Repeat Separation in Long Reads , 2017, bioRxiv.

[33]  A. Alase,et al.  IL‐36γ has proinflammatory effects on human endothelial cells , 2017, Experimental dermatology.

[34]  Thomas K. F. Wong,et al.  ModelFinder: Fast Model Selection for Accurate Phylogenetic Estimates , 2017, Nature Methods.

[35]  R. Baertsch,et al.  Speciation network in Laurasiatheria: retrophylogenomic signals , 2017, Genome research.

[36]  C. Francks,et al.  Next-gen sequencing identifies non-coding variation disrupting miRNA-binding sites in neurological disorders , 2017, Molecular Psychiatry.

[37]  D. Elleder,et al.  Discovery of an endogenous Deltaretrovirus in the genome of long-fingered bats (Chiroptera: Miniopteridae) , 2017, Proceedings of the National Academy of Sciences.

[38]  Eugene W. Myers,et al.  Non Hybrid Long Read Consensus Using Local De Bruijn Graph Assembly , 2017, bioRxiv.

[39]  Emma C. Teeling,et al.  Prenatal development supports a single origin of laryngeal echolocation in bats , 2017, Nature Ecology &Evolution.

[40]  D. Hayman Bats as Viral Reservoirs. , 2016, Annual review of virology.

[41]  D. Ray,et al.  Pinpointing the vesper bat transposon revolution using the Miniopterus natalensis genome , 2016, Mobile DNA.

[42]  E. Teeling,et al.  Mammal madness: is the mammal tree of life not yet resolved? , 2016, Philosophical Transactions of the Royal Society B: Biological Sciences.

[43]  S. Vernes What bats have to say about speech and language , 2016, Psychonomic bulletin & review.

[44]  HaroldC. Smith,et al.  The APOBEC Protein Family: United by Structure, Divergent in Function. , 2016, Trends in biochemical sciences.

[45]  Sarah C. Ayling,et al.  The Ensembl gene annotation system , 2016, Database J. Biol. Databases Curation.

[46]  O. Harismendy,et al.  Nrf2 Induces IL-17D to Mediate Tumor and Virus Surveillance. , 2016, Cell reports.

[47]  J. Wall,et al.  Transcriptomic and epigenomic characterization of the developing bat wing , 2016, Nature Genetics.

[48]  W. Choi,et al.  Clinical and Biologic Significance of MYC Genetic Mutations in De Novo Diffuse Large B-cell Lymphoma , 2016, Clinical Cancer Research.

[49]  Jie Cui,et al.  Unique Loss of the PYHIN Gene Family in Bats Amongst Mammals: Implications for Inflammasome Sensing , 2016, Scientific Reports.

[50]  O. Inatomi,et al.  Increased Expression of Interleukin-36, a Member of the Interleukin-1 Cytokine Family, in Inflammatory Bowel Disease , 2016, Inflammatory bowel diseases.

[51]  D. Ray,et al.  Accurate Transposable Element Annotation Is Vital When Analyzing New Genome Assemblies , 2016, Genome biology and evolution.

[52]  Raymond J. Moran,et al.  The Interrelationships of Placental Mammals and the Limits of Phylogenetic Inference , 2016, Genome biology and evolution.

[53]  Michael G. Nute,et al.  A comparative study of SVDquartets and other coalescent-based species tree estimation methods , 2015, BMC Genomics.

[54]  B. Di Camillo,et al.  FunPat: function-based pattern analysis on RNA-seq time series data , 2015, BMC Genomics.

[55]  Sergei L. Kosakovsky Pond,et al.  UC Office of the President Recent Work Title Less Is More : An Adaptive Branch-Site Random Effects Model for Efficient Detection of Episodic Diversifying Selection Permalink , 2015 .

[56]  A. Dobson,et al.  Bats as ‘special’ reservoirs for emerging zoonotic pathogens , 2015, Trends in Microbiology.

[57]  Laura Salter Kubatko,et al.  Quartet Inference from SNP Data Under the Coalescent Model , 2014, Bioinform..

[58]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[59]  R. Belshaw,et al.  Larger Mammalian Body Size Leads to Lower Retroviral Activity , 2014, PLoS pathogens.

[60]  R. Plowright,et al.  Bat Flight and Zoonotic Viruses , 2014, Emerging infectious diseases.

[61]  M. Grabherr,et al.  Broad-scale phylogenomics provides insights into retrovirus–host evolution , 2013, Proceedings of the National Academy of Sciences.

[62]  E. Stupka,et al.  Phylogenomic Analyses Elucidate the Evolutionary Relationships of Bats , 2013, Current Biology.

[63]  Justin Tan,et al.  Absence of SERPINB6A causes sensorineural hearing loss with multiple histopathologies in the mouse inner ear. , 2013, The American journal of pathology.

[64]  Gareth Jones,et al.  From the ultrasonic to the infrared: molecular evolution and the sensory biology of bats , 2013, Front. Physiol..

[65]  A. Katzourakis,et al.  Paleovirology and virally derived immunity. , 2012, Trends in ecology & evolution.

[66]  Shuyi Zhang,et al.  Virome Analysis for Identification of Novel Mammalian Viruses in Bat Species from Chinese Provinces , 2012, Journal of Virology.

[67]  C. Münk,et al.  An ancient history of gene duplications, fusions and losses in the evolution of APOBEC3 mutators in mammals , 2012, BMC Evolutionary Biology.

[68]  Eugene Berezikov,et al.  Evolution of microRNA diversity and regulation in animals , 2011, Nature Reviews Genetics.

[69]  K. Dittmar,et al.  Evolutionary maintenance of filovirus-like genes in bat genomes , 2011, BMC Evolutionary Biology.

[70]  Brianne R. Barker,et al.  Cross-regulation between the IL-1β/IL-18 processing inflammasome and other inflammatory cytokines. , 2011, Current opinion in immunology.

[71]  Thomas Mailund,et al.  On Computing the Coalescence Time Density in an Isolation-With-Migration Model With Few Samples , 2011, Genetics.

[72]  H. Philippe,et al.  Resolving Difficult Phylogenetic Questions: Why More Sequences Are Not Enough , 2011, PLoS biology.

[73]  James T. Elder,et al.  IL-1F5, -F6, -F8, and -F9: A Novel IL-1 Family Signaling System That Is Active in Psoriasis and Promotes Keratinocyte Antimicrobial Peptide Expression , 2011, The Journal of Immunology.

[74]  A. Katzourakis,et al.  Endogenous Viral Elements in Animal Genomes , 2010, PLoS genetics.

[75]  J. Sebat,et al.  Genomic duplication and overexpression of TJP2/ZO-2 leads to altered expression of apoptosis genes in progressive nonsyndromic hearing loss DFNA51. , 2010, American journal of human genetics.

[76]  M. King,et al.  A truncating mutation in SERPINB6 is associated with autosomal-recessive nonsyndromic sensorineural hearing loss. , 2010, American journal of human genetics.

[77]  Shunsuke Kimura,et al.  Uptake through glycoprotein 2 of FimH+ bacteria by M cells initiates mucosal immune response , 2009, Nature.

[78]  O. Pybus,et al.  Macroevolution of Complex Retroviruses , 2009, Science.

[79]  B. Charlesworth Effective population size and patterns of molecular evolution and variation , 2009, Nature Reviews Genetics.

[80]  Chris Mungall,et al.  AmiGO: online access to ontology and annotation data , 2008, Bioinform..

[81]  C. Petit,et al.  Mutation analysis of TMC1 identifies four new mutations and suggests an additional deafness gene at loci DFNA36 and DFNB7/11 , 2008, Clinical genetics.

[82]  Philip C. J. Donoghue,et al.  MicroRNAs and the advent of vertebrate morphological complexity , 2008, Proceedings of the National Academy of Sciences.

[83]  L. Rüttiger,et al.  Estrogen and the inner ear: megalin knockout mice suffer progressive hearing loss , 2008, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[84]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[85]  Dian Donnai,et al.  Mutations in LRP2, which encodes the multiligand receptor megalin, cause Donnai-Barrow and facio-oculo-acoustico-renal syndromes , 2007, Nature Genetics.

[86]  O. Pybus,et al.  Discovery and analysis of the first endogenous lentivirus , 2007, Proceedings of the National Academy of Sciences.

[87]  C. Feschotte,et al.  The evolutionary history of human DNA transposons: evidence for intense activity in the primate lineage. , 2007, Genome research.

[88]  J. Kere,et al.  Downstream target genes of the neuropeptide S-NPSR1 pathway. , 2006, Human molecular genetics.

[89]  M. Hasegawa,et al.  Pegasoferae, an unexpected mammalian clade revealed by tracking ancient retroposon insertions. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[90]  Faisal Ababneh,et al.  Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences , 2006, Bioinform..

[91]  N. Rosenberg,et al.  Discordance of Species Trees with Their Most Likely Gene Trees , 2006, PLoS genetics.

[92]  Burkhard Morgenstern,et al.  Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources , 2006, BMC Bioinformatics.

[93]  L. Poladian,et al.  Is the "Big Bang" in Animal Evolution Real? , 2005, Science.

[94]  Sergei L. Kosakovsky Pond,et al.  HyPhy: hypothesis testing using phylogenies , 2005, Bioinform..

[95]  S. O’Brien,et al.  A Molecular Phylogeny for Bats Illuminates Biogeography and the Fossil Record , 2005, Science.

[96]  Shizuo Akira,et al.  Lipocalin 2 mediates an innate immune response to bacterial infection by sequestrating iron , 2004, Nature.

[97]  J. de Magalhães,et al.  GenAge: a genomic and proteomic network map of human ageing , 2004, FEBS letters.

[98]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[99]  Wei Wang,et al.  Synleurin, a novel leucine-rich repeat protein that increases the intensity of pleiotropic cytokine responses. , 2003, Biochemical and biophysical research communications.

[100]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[101]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[102]  M. Nagura,et al.  Ultrastructural localization of megalin in the rat cochlear duct , 1999, Hearing Research.

[103]  Jason G. Cyster,et al.  A B-cell-homing chemokine made in lymphoid follicles activates Burkitt's lymphoma receptor-1 , 1998, Nature.

[104]  Genome-wide investigation of an ID cohort reveals de novo 3′UTR variants affecting gene expression , 2019 .

[105]  E. Myers,et al.  Annual Review of Animal Biosciences Bat Biology , Genomes , and the Bat 1 K Project : To Generate Chromosome-Level Genomes for All Living Bat Species , 2018 .

[106]  John Robinson,et al.  Identifying Optimal Models of Evolution. , 2017, Methods in molecular biology.

[107]  Yehu Moran,et al.  The evolutionary origin of plant and animal microRNAs. , 2017, Nature ecology & evolution.

[108]  David C. Wilson,et al.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2012, Nature.

[109]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.