Bee core venom genes predominantly originated before aculeate stingers evolved

Venoms have convergently evolved in all major animal lineages and are ideal candidates to unravel the genomic processes underlying convergent trait evolution. However, few animal groups have been studied in detail, and large-scale comparative genomic analyses to address toxin gene evolution are rare. The hyper-diverse hymenopterans are the most speciose group of venomous animals, but the origin of their toxin genes has been largely overlooked. We combined proteo-transcriptomics with comparative genomics compiling an up-to-date list of core bee venom proteins to investigate the origin of 11 venom genes in 30 hymenopteran genomes including two newly sequenced genomes of stingless bees. We found a more distinct pattern in which toxin genes originated predominantly by single gene co-option, a prevalent mechanism in parasitoid wasps. These are always accompanied by parallel expansion events in other bee and hymenopteran sister gene groups. The short toxic peptides melittin and apamin appear to be unique to bees, a result supported by a novel machine learning approach. Based on the syntenic pattern we propose here Anthophilin1 as a bee-unique gene family that includes apamin and MCDP. It appears that bees co-adapted their venom genes which predominantly already existed before the aculeate ovipositor evolved to a stinger which exclusively injects venom. Our results provide insight into the large-scale evolution of bee venom compounds, and we present here the first study revealing general processes of venom evolution at a comparative genomic level for this mega-diverse Hymenoptera.

[1]  B. Rost,et al.  Protein embeddings and deep learning predict binding residues for various ligand classes , 2021, Scientific Reports.

[2]  B. Rost,et al.  Protein language model embeddings for fast, accurate, alignment-free protein structure prediction , 2021, bioRxiv.

[3]  B. Berger,et al.  Learning the protein language: Evolution, structure, and function. , 2021, Cell systems.

[4]  Benjamin A. Neely,et al.  Proteogenomic Assessment of Intraspecific Venom Variability: Molecular Adaptations in the Venom Arsenal of Conus purpurascens , 2021, Molecular & cellular proteomics : MCP.

[5]  Michael Broe,et al.  Tracking the recruitment and evolution of snake toxins using the evolutionary context provided by the Bothrops jararaca genome , 2021, Proceedings of the National Academy of Sciences.

[6]  Kevin K. Yang,et al.  Learned Embeddings from Deep Learning to Visualize and Predict Protein Sets , 2021, Current protocols.

[7]  B. Rost,et al.  Clustering FunFams using sequence embeddings improves EC purity , 2021, bioRxiv.

[8]  Myle Ott,et al.  Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences , 2019, Proceedings of the National Academy of Sciences.

[9]  B. Devreese,et al.  Standard methods for Apis mellifera venom research , 2021 .

[10]  Tom Sercu,et al.  Transformer protein language models are unsupervised structure learners , 2020, bioRxiv.

[11]  I. Koludarov,et al.  How the Toxin got its Toxicity , 2020, Frontiers in Pharmacology.

[12]  Paul H. Williams,et al.  Genus-Wide Characterization of Bumblebee Genomes Provides Insights into Their Evolution and Variation in Ecological and Behavioral Traits , 2020, Molecular biology and evolution.

[13]  Burkhard Rost,et al.  Embeddings from deep learning transfer GO annotations beyond homology , 2020, Scientific Reports.

[14]  A. Walker The evolutionary dynamics of venom toxins made by insects and other animals. , 2020, Biochemical Society transactions.

[15]  B. Rost,et al.  ProtTrans: Towards Cracking the Language of Life’s Code Through Self-Supervised Deep Learning and High Performance Computing , 2020, bioRxiv.

[16]  Modeling Aspects , 2020, Finite Elements for Engineers with ANSYS Applications.

[17]  Beryl M. Jones,et al.  Developmental plasticity shapes social traits and selection in a facultatively eusocial bee , 2020, Proceedings of the National Academy of Sciences.

[18]  Giulia Zancolli,et al.  Venom systems as models for studying the origin and regulation of evolutionary novelties. , 2020, Molecular biology and evolution.

[19]  Burkhard Rost,et al.  Modeling aspects of the language of life through transfer-learning protein sequences , 2019, BMC Bioinformatics.

[20]  R. Jenner,et al.  The Diversity of Venom: The Importance of Behavior and Venom System Morphology in Understanding Its Ecology and Evolution , 2019, Toxins.

[21]  E. Undheim,et al.  Toxins from scratch? Diverse, multimodal gene origins in the predatory robber fly Dasypogon diadema indicate a dynamic venom evolution in dipteran insects , 2019, GigaScience.

[22]  B. V. von Reumont,et al.  The Significance of Comparative Genomics in Modern Evolutionary Venomics , 2019, Front. Ecol. Evol..

[23]  George M. Church,et al.  Unified rational protein engineering with sequence-only deep representation learning , 2019, bioRxiv.

[24]  K. Sunagar,et al.  Reconstructing the evolutionary history of a functionally diverse gene family reveals complexity at the genetic origins of novelty , 2019, bioRxiv.

[25]  Bonnie Berger,et al.  Learning protein sequence embeddings using information from structure , 2019, ICLR.

[26]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[27]  G. King,et al.  Entomo‐venomics: The evolution, biology and biochemistry of insect venoms , 2018, Toxicon : official journal of the International Society on Toxinology.

[28]  I. Vetter,et al.  A comprehensive portrait of the venom of the giant red bull ant, Myrmecia gulosa, reveals a hyperdiverse hymenopteran toxin gene family , 2018, Science Advances.

[29]  R. Norton,et al.  Venoms to the rescue , 2018, Science.

[30]  Johannes Söding,et al.  Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold , 2018, Nature Methods.

[31]  Matthew D MacManes,et al.  The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly , 2017, bioRxiv.

[32]  Alexander S. Mikheyev,et al.  A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds , 2018, BMC Genomics.

[33]  José Roberto Aparecido dos Santos-Pinto,et al.  Diversity of peptidic and proteinaceous toxins from social Hymenoptera venoms. , 2018, Toxicon : official journal of the International Society on Toxinology.

[34]  S. Carroll,et al.  Extremely Divergent Haplotypes in Two Toxin Gene Complexes Encode Alternative Venom Types within Rattlesnake Species , 2018, Current Biology.

[35]  Andrew J. Mason,et al.  A Single Mutation Unlocks Cascading Exaptations in the Origin of a Potent Pitviper Neurotoxin , 2018, Molecular biology and evolution.

[36]  Johannes Söding,et al.  Clustering huge protein sequence sets in linear time , 2017, Nature Communications.

[37]  Aditi Tayal,et al.  Hymenoptera Genome Database: Using HymenopteraMine to Enhance Genomic Studies of Hymenopteran Insects. , 2018, Methods in molecular biology.

[38]  D. Elieh Ali Komi,et al.  Immunology of Bee Venom , 2018, Clinical Reviews in Allergy & Immunology.

[39]  A. Lemmon,et al.  Quantity, Not Quality: Rapid Adaptation in a Polygenic Trait Proceeded Exclusively through Expression Differentiation , 2017, Molecular biology and evolution.

[40]  Mrinalini,et al.  The Evolution of Venom by Co-option of Single-Copy Genes , 2017, Current Biology.

[41]  T. Shinada,et al.  Isolation of biologically active peptides from the venom of Japanese carpenter bee, Xylocopa appendiculata , 2017, Journal of Venomous Animals and Toxins including Tropical Diseases.

[42]  Alexey M. Kozlov,et al.  Evolutionary History of the Hymenoptera , 2017, Current Biology.

[43]  B. Fry,et al.  Rapid Radiations and the Race to Redundancy: An Investigation of the Evolution of Australian Elapid Snake Venoms , 2016, Toxins.

[44]  C. Dunn,et al.  Comparative genomics and the diversity of life , 2016 .

[45]  S. Carroll,et al.  The Deep Origin and Recent Loss of Venom Toxin Genes in Rattlesnakes , 2016, Current Biology.

[46]  Min Zhao,et al.  Proteomic analysis of the venom and venom sac of the woodwasp, Sirex noctilio - Towards understanding its biological impact. , 2016, Journal of proteomics.

[47]  Lior Pachter,et al.  Erratum: Near-optimal probabilistic RNA-seq quantification , 2016, Nature Biotechnology.

[48]  Dave Wheeler,et al.  The venom gland transcriptome of the parasitoid wasp Nasonia vitripennis highlights the importance of novel genes in venom function , 2016, BMC Genomics.

[49]  Pei Zhang,et al.  Evolutionary trajectories of snake genes and genomes revealed by comparative analyses of five-pacer viper , 2016, Nature Communications.

[50]  Jun Chen,et al.  Melittin, the Major Pain-Producing Substance of Bee Venom , 2016, Neuroscience Bulletin.

[51]  Ji Hyeong Baek,et al.  Differential Properties of Venom Peptides and Proteins in Solitary vs. Social Hunting Wasps , 2016, Toxins.

[52]  B. Devreese,et al.  Honeybee Venom Proteome Profile of Queens and Winter Bees as Determined by a Mass Spectrometric Approach , 2015, Toxins.

[53]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[54]  Matthew E. Hudson,et al.  Genomic signatures of evolutionary transitions from solitary to group living , 2015, Science.

[55]  E. Giralt,et al.  Three Valuable Peptides from Bee and Wasp Venoms for Therapeutic and Biotechnological Use: Melittin, Apamin and Mastoparan , 2015, Toxins.

[56]  V. Čeřovský,et al.  Interaction of a novel antimicrobial peptide isolated from the venom of solitary bee Colletes daviesanus with phospholipid vesicles and Escherichia coli cells , 2014, Journal of peptide science : an official publication of the European Peptide Society.

[57]  H. Zakon,et al.  The Evolution of the Four Subunits of Voltage-Gated Calcium Channels: Ancient Roots, Increasing Complexity, and Multiple Losses , 2014, Genome biology and evolution.

[58]  Alexandros Stamatakis,et al.  ExaBayes: Massively Parallel Bayesian Tree Inference for the Whole-Genome Era , 2014, Molecular biology and evolution.

[59]  J. Logan,et al.  The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system , 2013, Proceedings of the National Academy of Sciences.

[60]  Nicholas R Casewell,et al.  Complex cocktails: the evolutionary novelty of venoms. , 2013, Trends in ecology & evolution.

[61]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[62]  Emily S. Wong,et al.  Venom evolution through gene duplications. , 2012, Gene.

[63]  Jaideep P. Sundaram,et al.  The Hymenoptera Genome Database , 2010 .

[64]  Seon-Woo Lee,et al.  Molecular cloning and antimicrobial activity of bombolitin, a component of bumblebee Bombus ignitus venom. , 2010, Comparative biochemistry and physiology. Part B, Biochemistry & molecular biology.

[65]  D. D. de Graaf,et al.  Venom Proteins of the Parasitoid Wasp Nasonia vitripennis: Recent Discovery of an Untapped Pharmacopee , 2010, Toxins.

[66]  F. Kondrashov,et al.  The evolution of gene duplications: classifying and distinguishing between models , 2010, Nature Reviews Genetics.

[67]  R. Stöcklin,et al.  Structural identification by mass spectrometry of a novel antimicrobial peptide from the venom of the solitary bee Osmia rufa (Hymenoptera: Megachilidae). , 2010, Toxicon : official journal of the International Society on Toxinology.

[68]  Ian S. Dunn,et al.  Exploring the Limits , 2009 .

[69]  R. Norton,et al.  The toxicogenomic multiverse: convergent recruitment of proteins into animal venoms. , 2009, Annual review of genomics and human genetics.

[70]  Z. Voburka,et al.  Lasioglossins: Three Novel Antimicrobial Peptides from the Venom of the Eusocial Bee Lasioglossum laticeps (Hymenoptera: Halictidae) , 2009, Chembiochem : a European journal of chemical biology.

[71]  Z. Voburka,et al.  Melectin: A Novel Antimicrobial Peptide from the Venom of the Cleptoparasitic Bee Melecta albifrons , 2008, Chembiochem : a European journal of chemical biology.

[72]  J. Finnerty,et al.  Concerted evolution of sea anemone neurotoxin genes is revealed through analysis of the Nematostella vectensis genome. , 2008, Molecular biology and evolution.

[73]  A. Chattopadhyay,et al.  Melittin: a Membrane-active Peptide with Diverse Functions , 2007, Bioscience reports.

[74]  Donald R Hoffman Hymenoptera venom allergens , 2006, Clinical reviews in allergy & immunology.

[75]  M. Nei,et al.  Concerted and birth-and-death evolution of multigene families. , 2005, Annual review of genetics.

[76]  M. Brochetto-Braga,et al.  Hymenoptera venom review focusing on Apis mellifera , 2003 .

[77]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[78]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.