PIANO: A Web Server for Pseudouridine-Site (Ψ) Identification and Functional Annotation

Known as the “fifth RNA nucleotide”, pseudouridine (Ψ or psi) is the first-discovered and most abundant RNA modification occurring at the Uridine site, and it plays a prominent role in a number of biological processes. Thousands of Ψ sites have been identified within different biological contexts thanks to the advancement in high-throughput sequencing technology; nevertheless, the transcriptome-wide distribution, biomolecular functions, regulatory mechanisms, and disease relevance of pseudouridylation are largely elusive. We report here a web server—PIANO—for p seudouridine site (Ψ) i dentification a nd fu n ctional ann o tation. PIANO was built upon a high-accuracy predictor that takes advantage of both conventional sequence features and 42 additional genomic features. When tested on six independent datasets generated from four independent Ψ-profiling technologies (Ψ-seq, RBS-seq, Pseudo-seq, and CeU-seq) as benchmarks, PIANO achieved an average AUC of 0.955 and 0.838 under the full transcript and mature mRNA models, respectively, marking a substantial improvement in accuracy compared to the existing in silico Ψ-site prediction methods, i.e., PPUS (0.713 and 0.707), iRNA-PseU (0.713 and 0.712), and PseUI (0.634 and 0.652). Besides, PIANO web server systematically annotates the predicted Ψ sites with post-transcriptional regulatory mechanisms (miRNA-targets, RBP-binding regions, and splicing sites) in its prediction report to help the users explore potential machinery of Ψ. Moreover, a concise query interface was also built for 4,303 known Ψ sites, which is currently the largest collection of experimentally validated human Ψ sites. The PIANO website is freely accessible at: http://piano.rnamd.com.

[1]  Chengqi Yi,et al.  Transcriptome-wide dynamics of RNA pseudouridylation , 2015, Nature Reviews Molecular Cell Biology.

[2]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[3]  Y. Bykhovskaya,et al.  Missense mutation in pseudouridine synthase 1 (PUS1) causes mitochondrial myopathy and sideroblastic anemia (MLASA). , 2004, American journal of human genetics.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  L. Montanaro,et al.  RNA Pseudouridylation in Physiology and Medicine: For Better and for Worse , 2017, Genes.

[6]  Xiujuan Lei,et al.  WITMSG: Large-scale Prediction of Human Intronic m6A RNA Methylation Sites from Sequence and Genomic Features , 2020, Current genomics.

[7]  Hao Lin,et al.  XG-PseU: an eXtreme Gradient Boosting based method for identifying pseudouridine sites , 2019, Molecular Genetics and Genomics.

[8]  Yan-Hui Li,et al.  PPUS: a web server to predict PUS-specific pseudouridine sites , 2015, Bioinform..

[9]  W. Gilbert,et al.  Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells , 2014, Nature.

[10]  Glen Spraggon,et al.  Crystal structure of human Pus10, a novel pseudouridine synthase. , 2007, Journal of molecular biology.

[11]  Janusz M. Bujnicki,et al.  MODOMICS: a database of RNA modification pathways. 2017 update , 2017, Nucleic Acids Res..

[12]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[13]  Davide Ruggero,et al.  rRNA pseudouridylation defects affect ribosomal ligand binding and translational fidelity from yeast to human cells. , 2011, Molecular cell.

[14]  Maryam Zaringhalam,et al.  Pseudouridylation meets next-generation sequencing. , 2016, Methods.

[15]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[16]  Yi-Tao Yu,et al.  Post-transcriptional pseudouridylation in mRNA as well as in some major types of noncoding RNAs. , 2019, Biochimica et biophysica acta. Gene regulatory mechanisms.

[17]  Hsien-Da Huang,et al.  KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns , 2007, Nucleic Acids Res..

[18]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[19]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[20]  Gang Xu,et al.  POSTAR2: deciphering the post-transcriptional regulatory logics , 2018, Nucleic Acids Res..

[21]  Dong Yue,et al.  Improving performance of mammalian microRNA target prediction , 2010, BMC Bioinformatics.

[22]  Wei Chen,et al.  iRNA-PseU: Identifying RNA pseudouridine sites , 2016, Molecular therapy. Nucleic acids.

[23]  F. Jiang,et al.  Small nucleolar RNA 42 acts as an oncogene in lung tumorigenesis , 2011, Oncogene.

[24]  Gideon Rechavi,et al.  The dynamic N1-methyladenosine methylome in eukaryotic messenger RNA , 2016, Nature.

[25]  Shiqing Ma,et al.  Chemical pulldown reveals dynamic pseudouridylation of the mammalian transcriptome. , 2015, Nature chemical biology.

[26]  Yi Xiong,et al.  PseUI: Pseudouridine sites identification based on RNA sequence information , 2018, BMC Bioinformatics.

[27]  Tony Gutschner,et al.  The Dark Side of the Epitranscriptome: Chemical Modifications in Long Non-Coding RNAs , 2017, International journal of molecular sciences.

[28]  Maxwell R. Mumbach,et al.  Transcriptome-wide Mapping Reveals Widespread Dynamic-Regulated Pseudouridylation of ncRNA and mRNA , 2014, Cell.

[29]  S. L. Davis,et al.  Regulation of nuclear receptor activity by a pseudouridine synthase through posttranscriptional modification of steroid receptor RNA activator. , 2004, Molecular cell.

[30]  Yi-Tao Yu,et al.  Detection and Quantification of Pseudouridine in RNA. , 2018, Methods in molecular biology.

[31]  Samie R Jaffrey,et al.  Rethinking m6A Readers, Writers, and Erasers. , 2017, Annual review of cell and developmental biology.

[32]  V. Ramakrishnan,et al.  Unusual base pairing during the decoding of a stop codon by the ribosome , 2013, Nature.

[33]  Jionglong Su,et al.  WHISTLE: a high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach , 2019, Nucleic acids research.

[34]  Ya-Zhou Sun,et al.  RNA methylation and diseases: experimental results, databases, Web servers and computational models , 2019, Briefings Bioinform..

[35]  Fowzan S Alkuraya,et al.  PUS7 mutations impair pseudouridylation in humans and cause intellectual disability and microcephaly , 2019, Human Genetics.

[36]  John Karijolich,et al.  Modifying the genetic code: Converting nonsense codons into sense codons by targeted pseudouridylation , 2011, Nature.

[37]  Jenny Hansson,et al.  Pseudouridylation of tRNA-Derived Fragments Steers Translational Control in Stem Cells , 2018, Cell.

[38]  Hsien-Da Huang,et al.  miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions , 2017, Nucleic Acids Res..

[39]  Bradley R. Cairns,et al.  Transcriptome-wide profiling of multiple RNA modifications simultaneously at single-base resolution , 2019, Proceedings of the National Academy of Sciences.

[40]  A. Siepel,et al.  Probabilities of Fitness Consequences for Point Mutations Across the Human Genome , 2014, Nature Genetics.

[41]  Yi-Tao Yu,et al.  RNA pseudouridylation: new insights into an old modification. , 2013, Trends in biochemical sciences.

[42]  Shun Liu,et al.  RMBase v2.0: deciphering the map of RNA modifications from epitranscriptome sequencing data , 2017, Nucleic Acids Res..

[43]  P. Brown,et al.  Transcriptome-Wide Mapping of Pseudouridines: Pseudouridine Synthases Modify Specific mRNAs in S. cerevisiae , 2014, PloS one.

[44]  WALDO E. COHN,et al.  Nucleoside-5′-Phosphates from Ribonucleic Acid , 1951, Nature.

[45]  Douglas H. Turner,et al.  The contribution of pseudouridine to stabilities and structure of RNAs , 2013, Nucleic acids research.

[46]  Schraga Schwartz,et al.  TRUB1 is the predominant pseudouridine synthase acting on mammalian mRNA via a predictable and conserved code. , 2017, Genome research.

[47]  Douglas L Black,et al.  m6A mRNA modifications are deposited in nascent pre-mRNA and are not required for splicing but do specify cytoplasmic turnover , 2017, Genes & development.

[48]  Fowzan S. Alkuraya,et al.  A homozygous truncating mutation in PUS3 expands the role of tRNA modification in normal cognition , 2016, Human Genetics.

[49]  J. Chen,et al.  Pseudouridine synthase 3 from mouse modifies the anticodon loop of tRNA. , 2000, Biochemistry.

[50]  Jiangning Song,et al.  Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences , 2019, Briefings Bioinform..

[51]  Marcin Feder,et al.  MODOMICS: a database of RNA modification pathways , 2005, Nucleic Acids Res..

[52]  Hui Zhou,et al.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data , 2013, Nucleic Acids Res..

[53]  Robert Gentleman,et al.  Software for Computing and Annotating Genomic Ranges , 2013, PLoS Comput. Biol..

[54]  M. Gevrey,et al.  Review and comparison of methods to study the contribution of variables in artificial neural network models , 2003 .

[55]  Wei Chen,et al.  MethyRNA: a web server for identification of N6-methyladenosine sites , 2017, Journal of biomolecular structure & dynamics.

[56]  Wei Chen,et al.  iRNA-2OM: A Sequence-Based Predictor for Identifying 2′-O-Methylation Sites in Homo sapiens , 2018, J. Comput. Biol..

[57]  D. Bartel,et al.  Predicting effective microRNA target sites in mammalian mRNAs , 2015, eLife.

[58]  Jionglong Su,et al.  Prediction of RNA Methylation Status From Gene Expression Data Using Classification and Regression Methods , 2020, Evolutionary bioinformatics online.