Revisiting rustrela virus – new cases of encephalitis and a solution to the capsid enigma

Rustrela virus (RusV, species Rubivirus strelense) is a recently discovered relative of rubella virus (RuV) that has been detected in cases of encephalitis across a wide spectrum of mammals, including placental and marsupial animals. Here we diagnosed two additional cases of fatal RusV-associated meningoencephalitis in a South American coati (Nasua nasua) and a Eurasian otter (Lutra lutra) that were detected in a zoological garden with history of prior RusV infections. Both animals showed abnormal movement or unusual behaviour and their brains tested positive for RusV using specific RT-qPCR and RNA in situ hybridization. As previous sequencing of RusV proved to be very challenging, we employed a sophisticated target-specific capture enrichment with specifically designed RNA baits to generate complete RusV genome sequences from both detected encephalitic animals and apparently healthy wild yellow-necked field mice (Apodemus flavicollis). Furthermore, the technique was used to revise three previously published RusV genomes from two encephalitic animals and a wild yellow-necked field mouse. Virus-to-host sequence ratio and thereby sequence coverage improved markedly using the enrichment method as compared to standard procedures. When comparing the newly generated RusV sequences to the previously published RusV genomes, we identified a previously undetected stretch of 309 nucleotides predicted to represent the intergenic region and the sequence encoding the N-terminus of the capsid protein. This indicated that the original RusV sequence was likely incomplete due to misassembly of the genome at a region with an exceptionally high G+C content of >80 mol%, which could not be resolved even by enormous sequencing efforts with standard methods. The updated capsid protein amino acid sequence now resembles those of RuV and ruhugu virus in size and harbours a predicted RNA binding domain that was not encoded in the original RusV genome version. The new sequence data indicate that RusV has the largest overall genome (9,631 nucleotides), intergenic region (290 nucleotides) and capsid protein-encoding sequence (331 codons) within the genus Rubivirus.

[1]  A. Pic-Taylor,et al.  Insights into how environment shapes post-mortem RNA transcription in mouse brain , 2021, Scientific Reports.

[2]  P. Das,et al.  The Enigmatic Capsid Protein of an Encephalitic Rubivirus , 2021, Journal of Virology.

[3]  D. Höper,et al.  A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics , 2020, Computational and structural biotechnology journal.

[4]  M. Beer,et al.  Author Correction: Relatives of rubella virus in diverse mammals , 2020, Nature.

[5]  M. Beer,et al.  Next-generation diagnostics: virus capture facilitates a sensitive viral diagnosis for epizootic and zoonotic pathogens including SARS-CoV-2 , 2020, Microbiome.

[6]  L. Hansen,et al.  GC bias affects genomic and metagenomic reconstructions, underrepresenting GC-poor organisms , 2020, GigaScience.

[7]  Leonie F. Forth,et al.  Highly efficient library preparation for Ion Torrent sequencing using Y-adapters. , 2019, BioTechniques.

[8]  Martin Beer,et al.  A Versatile Sample Processing Workflow for Metagenomic Pathogen Detection , 2018, Scientific Reports.

[9]  Shanrong Zhao,et al.  Evaluation of two main RNA-seq approaches for gene quantification in clinical RNA sequencing: polyA+ selection versus rRNA depletion , 2018, Scientific Reports.

[10]  Hayden C. Metsky,et al.  Rapid Detection of Powassan Virus in a Patient With Encephalitis by Metagenomic Sequencing , 2017, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[11]  Karthik Gangavarapu,et al.  Genome sequencing reveals Zika virus diversity and spread in the Americas , 2017, bioRxiv.

[12]  M. Kubista,et al.  Effects of post-mortem and physical degradation on RNA integrity and quality , 2015, Biomolecular detection and quantification.

[13]  Tarjei S Mikkelsen,et al.  Enhanced methods for unbiased deep sequencing of Lassa and Ebola RNA viruses from clinical and biological samples , 2014, Genome Biology.

[14]  M. Rossmann,et al.  Rubella virus capsid protein structure and its role in virus assembly and infection , 2013, Proceedings of the National Academy of Sciences.

[15]  Tsunglin Liu,et al.  Effects of GC Bias in Next-Generation-Sequencing Data on De Novo Genome Assembly , 2013, PloS one.

[16]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[17]  H. Swerdlow,et al.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers , 2012, BMC Genomics.

[18]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[19]  Y. Benjamini,et al.  Summarizing and correcting the GC content bias in high-throughput sequencing , 2012, Nucleic acids research.

[20]  Songnian Hu,et al.  A comparison between ribo-minus RNA-sequencing and polyA-selected RNA-sequencing. , 2010, Genomics.

[21]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[22]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[23]  Juliane C. Dohm,et al.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing , 2008, Nucleic acids research.

[24]  M. Bauer,et al.  Quantification of mRNA degradation as possible indicator of postmortem interval--a pilot study. , 2003, Legal medicine.

[25]  T. Frey,et al.  Mapping the Rubella Virus Subgenomic Promoter , 2002, Journal of Virology.

[26]  P. Chong,et al.  Identification of domains in rubella virus genomic RNA and capsid protein necessary for specific interaction , 1996, Journal of virology.

[27]  D. Jarvis,et al.  Translocation and cleavage of rubella virus envelope glycoproteins: identification and role of the E2 signal sequence. , 1990, The Journal of general virology.

[28]  M. Suomalainen,et al.  The E2 signal sequence of rubella virus remains part of the capsid protein and confers membrane association in vitro , 1990, Journal of virology.

[29]  T. Frey,et al.  Sequence of the genome RNA of rubella virus: Evidence for genetic rearrangement during togavirus evolution☆ , 1990, Virology.

[30]  C. Oker-Blom,et al.  The gene order for rubella virus structural proteins is NH2-C-E2-E1-COOH , 1984, Journal of virology.

[31]  R. Pettersson,et al.  Rubella virus 40S genome RNA specifies a 24S subgenomic mRNA that codes for a precursor to structural proteins , 1984, Journal of virology.