Leptospira transcriptome sequencing using long-read technology reveals unannotated transcripts and potential polyadenylation of mRNA molecules

Leptospirosis caused by the spirochete bacteria Leptospira is an emerging zoonosis that causes life-threatening disease in humans and animals. However, a detailed understanding of Leptospira’s RNA profiles is limited. In this study, we sequenced and analyzed the transcriptome of multiple Leptospira serovars using Oxford Nanopore Technologies’ direct cDNA and direct RNA sequencing methods. We identified several operon transcriptional units, novel RNA coding regions, and evidence of potential posttranscriptional polyadenylation in the Leptospira transcriptome. Some coding regions of multiple RNA molecules that have not been previously annotated could be potential sRNA or ncRNA molecules that support gene expression regulation purposes in Leptospira. Many of the relative positions of these unannotated RNA coding regions were consistent in their neighboring coding regions across the reference genomes of two pathogenic Leptospira analyzed in this study. Besides, the majority of the unannotated coding regions and operon transcriptional units were not detected in the nonpathogenic Leptospira, suggesting potential virulence-related functions for these RNA molecules coding regions. Overall, our study confirms the utility of ONT’s sequencing in studying prokaryotic transcriptome profiles and offers a tool to improve our understanding of the structural composition of RNA molecules and prokaryotic polyadenylation. However, the findings from our study also warrant that the presence of homopolymers of adenine bases in the transcripts may interfere with the interpretation of bacterial transcriptome profiles. Carefully designed experiments are needed to unravel the role of the features described in this study in Leptospira virulence and pathogenesis. Author Summary Leptospirosis caused by the spirochete bacteria, Leptospira, is recognized as one of the most widespread zoonotic diseases. Leptospirosis is a neglected disease and has been estimated to cause over one million annual human clinical cases and over 60,000 deaths. Leptospira are maintained in the renal tubules of asymptomatic animal reservoirs, and in contaminated environments, such as soil and water. Studying gene expression profiles is an important component of bacterial pathogenesis studies, and a number of methods are available to do it. Our study compared various transcriptome sequencing methods using one of the latest next generation sequencing tools (Oxford Nanopore Technologies) and evaluated the compositions of RNA in different Leptospira’s transcriptomes. We identified many previously undescribed and potentially significant features within the Leptospira transcriptome that may contribute to the virulence and pathogenesis of Leptospira sp. Our findings lead to new opportunities for researchers to sequence prokaryotic RNA molecules and to unravel many regulatory mechanisms.

[1]  Joshua A. Arribere,et al.  Poly(a) selection introduces bias and undue noise in direct RNA-sequencing , 2022, BMC genomics.

[2]  Dina Grohmann,et al.  Nanopore sequencing of RNA and cDNA molecules in Escherichia coli , 2021, RNA.

[3]  Paul Theodor Pyl,et al.  Analysing high-throughput sequencing data in Python with HTSeq 2.0 , 2021, Bioinform..

[4]  S. R. Kushner,et al.  Regulation of mRNA decay in E. coli , 2021, Critical reviews in biochemistry and molecular biology.

[5]  E. Putz,et al.  Distinct transcriptional profiles of Leptospira borgpetersenii serovar Hardjo strains JB197 and HB203 cultured at different temperatures , 2021, PLoS neglected tropical diseases.

[6]  George H. Jones Acquisition of pcnB [poly(A) polymerase I] genes via horizontal transfer from the β, γ-Proteobacteria , 2021, Microbial genomics.

[7]  M. Rodnina,et al.  Translational Control by Ribosome Pausing in Bacteria: How a Non-uniform Pace of Translation Affects Protein Production and Folding , 2021, Frontiers in Microbiology.

[8]  Eric P. Nawrocki,et al.  Rfam 14: expanded coverage of metagenomic, viral and microRNA families , 2020, Nucleic Acids Res..

[9]  Chenhao Li,et al.  Determination of isoform-specific RNA structure with nanopore long reads , 2020, Nature Biotechnology.

[10]  O. Amster-Choder,et al.  RNA localization in prokaryotes: Where, when, how, and why , 2020, Wiley interdisciplinary reviews. RNA.

[11]  Jiannis Ragoussis,et al.  Methodologies for Transcript Profiling Using Long-Read Technologies , 2020, Frontiers in Genetics.

[12]  M. Ibba,et al.  Translational regulation of environmental adaptation in bacteria , 2020, The Journal of Biological Chemistry.

[13]  Christopher J. Tonkin,et al.  Direct Nanopore Sequencing of mRNA Reveals Landscape of Transcript Isoforms in Apicomplexan Parasites , 2020, mSystems.

[14]  Michael P. Snyder,et al.  Long-read assays shed new light on the transcriptome complexity of a viral pathogen , 2020, Scientific Reports.

[15]  C. Goarant,et al.  A systematic review of Leptospira in water and soil environments , 2020, PloS one.

[16]  Charles Cole,et al.  Realizing the potential of full-length transcriptome sequencing , 2019, Philosophical Transactions of the Royal Society B.

[17]  Kanae Shiokawa,et al.  Leptospira infection in rats: A literature review of global prevalence and distribution , 2019, PLoS neglected tropical diseases.

[18]  A. T. Vincent,et al.  Revisiting the taxonomy and evolution of pathogenicity of the genus Leptospira through the prism of genomics , 2019, PLoS neglected tropical diseases.

[19]  S. Brisse,et al.  Genus-wide Leptospira core genome multilocus sequence typing for strain taxonomy and global surveillance , 2019, PLoS neglected tropical diseases.

[20]  Kornel Labun,et al.  tailfindr: alignment-free poly(A) length measurement for Oxford Nanopore RNA and DNA sequencing , 2019, bioRxiv.

[21]  T. Derrien,et al.  Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules , 2019, Scientific Reports.

[22]  A. Pasquinelli,et al.  Tales of Detailed Poly(A) Tails. , 2019, Trends in cell biology.

[23]  Angela N. Brooks,et al.  Nanopore native RNA sequencing of a human poly(A) transcriptome , 2018, bioRxiv.

[24]  V. Kaberdin,et al.  RNA polyadenylation and its consequences in prokaryotes , 2018, Philosophical Transactions of the Royal Society B: Biological Sciences.

[25]  W. Wang,et al.  MinIONQC: fast and simple quality control for MinION sequencing data , 2018, Bioinform..

[26]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[27]  A. Serganov,et al.  A Novel RNA Phosphorylation State Enables 5' End-Dependent Degradation in Escherichia coli. , 2017, Molecular cell.

[28]  Winston Timp,et al.  Detecting DNA cytosine methylation using nanopore sequencing , 2017, Nature Methods.

[29]  J. Coppee,et al.  Genome-Wide Transcriptional Start Site Mapping and sRNA Identification in the Pathogen Leptospira interrogans , 2017, Front. Cell. Infect. Microbiol..

[30]  Minjun Yang,et al.  Whole genome sequencing revealed host adaptation-focused genomic plasticity of pathogenic Leptospira , 2016, Scientific Reports.

[31]  N. Loman,et al.  A complete bacterial genome assembled de novo using only nanopore sequencing data , 2015, Nature Methods.

[32]  Alexander F. Palazzo,et al.  Non-coding RNA: what is functional and what is junk? , 2015, Front. Genet..

[33]  K. Hokamp,et al.  A Model System for Studying the Transcriptomic and Physiological Changes Associated with Mammalian Host-Adaptation by Leptospira interrogans Serovar Copenhageni , 2014, PLoS pathogens.

[34]  Byung-Kwan Cho,et al.  Current Challenges in Bacterial Transcriptomics , 2013, Genomics & informatics.

[35]  S. R. Kushner,et al.  Polyadenylation helps regulate functional tRNA levels in Escherichia coli , 2012, Nucleic acids research.

[36]  A. J. Carpousis,et al.  A tale of two mRNA degradation pathways mediated by RNase E , 2011, Molecular microbiology.

[37]  T. Fennell,et al.  Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries , 2011, Genome Biology.

[38]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[39]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[40]  M. Irimia,et al.  When good transcripts go bad: artifactual RT-PCR 'splicing' and genome analysis. , 2008, BioEssays : news and reviews in molecular, cellular and developmental biology.

[41]  F. Haddad,et al.  Potential pitfalls in the accuracy of analysis of natural sense-antisense RNA pairs by reverse transcription-PCR , 2007, BMC biotechnology.

[42]  F. Repoila,et al.  Small noncoding RNAs controlling pathogenesis. , 2007, Current opinion in microbiology.

[43]  S. R. Kushner,et al.  The majority of Escherichia coli mRNAs undergo post-transcriptional modification in exponentially growing cells , 2006, Nucleic acids research.

[44]  T. Gingeras,et al.  Prokaryotic RNA preparation methods useful for high density array analysis: comparison of two approaches. , 2001, Nucleic acids research.

[45]  B. Adler,et al.  Leptospira and Leptospirosis , 2001, Current Topics in Microbiology and Immunology.

[46]  S. R. Kushner,et al.  Polyadenylylation helps regulate mRNA decay in Escherichia coli. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[47]  G. Cao,et al.  Identification of the gene for an Escherichia coli poly(A) polymerase. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[48]  M. Picardeau Toolbox of Molecular Techniques for Studying Leptospira Spp. , 2018, Current topics in microbiology and immunology.

[49]  D. Bulach,et al.  Leptospiral Genomics and Pathogenesis. , 2018, Current topics in microbiology and immunology.

[50]  B. Adler Spirochete Biology: The Post Genomic Era , 2018, Current Topics in Microbiology and Immunology.

[51]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[52]  R. Sorek,et al.  Prokaryotic transcriptomics: a new view on regulation, physiology and pathogenicity , 2010, Nature Reviews Genetics.

[53]  Darren Swanson,et al.  Adaptation in a , 2003 .

[54]  N. Sarkar,et al.  Polyadenylation of mRNA in prokaryotes. , 1997, Annual review of biochemistry.