Structure alignment-based classification of RNA-binding pockets reveals regional RNA recognition motifs on protein surfaces

BackgroundMany critical biological processes are strongly related to protein-RNA interactions. Revealing the protein structure motifs for RNA-binding will provide valuable information for deciphering protein-RNA recognition mechanisms and benefit complementary structural design in bioengineering. RNA-binding events often take place at pockets on protein surfaces. The structural classification of local binding pockets determines the major patterns of RNA recognition.ResultsIn this work, we provide a novel framework for systematically identifying the structure motifs of protein-RNA binding sites in the form of pockets on regional protein surfaces via a structure alignment-based method. We first construct a similarity network of RNA-binding pockets based on a non-sequential-order structure alignment method for local structure alignment. By using network community decomposition, the RNA-binding pockets on protein surfaces are clustered into groups with structural similarity. With a multiple structure alignment strategy, the consensus RNA-binding pockets in each group are identified. The crucial recognition patterns, as well as the protein-RNA binding motifs, are then identified and analyzed.ConclusionsLarge-scale RNA-binding pockets on protein surfaces are grouped by measuring their structural similarities. This similarity network-based framework provides a convenient method for modeling the structural relationships of functional pockets. The local structural patterns identified serve as structure motifs for the recognition with RNA on protein surfaces.

[1]  R. Bahadur,et al.  Hydration of protein–RNA recognition sites , 2014, Nucleic acids research.

[2]  Zhi-Ping Liu,et al.  Predicting gene ontology functions from protein's regional surface structures , 2007, BMC Bioinformatics.

[3]  Jae-Hyung Lee,et al.  RNABindR: a server for analyzing and predicting RNA-binding sites in proteins , 2007, Nucleic Acids Res..

[4]  Luonan Chen,et al.  Revealing divergent evolution, identifying circular permutations and detecting active-sites by protein structure comparison , 2006, BMC Structural Biology.

[5]  Yael Mandel-Gutfreund,et al.  RBPmap: a web server for mapping binding sites of RNA-binding proteins , 2014, Nucleic Acids Res..

[6]  T. Glisovic,et al.  RNA‐binding proteins and post‐transcriptional gene regulation , 2008, FEBS letters.

[7]  Howard Y. Chang,et al.  Structural imprints in vivo decode RNA regulatory mechanisms , 2015, Nature.

[8]  Mohsen Khorshid,et al.  CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins , 2010, Nucleic Acids Res..

[9]  E. Skordalakes,et al.  Structure of the RNA-binding domain of telomerase: implications for RNA recognition and binding. , 2007, Structure.

[10]  Jonathan J. Ellis,et al.  Protein–RNA interactions: Structural analysis and functional classes , 2006, Proteins.

[11]  Jian Zhu,et al.  Systematic identification of transcriptional and post-transcriptional regulations in human respiratory epithelial cells during influenza A virus infection , 2014, BMC Bioinformatics.

[12]  Hongyu Miao,et al.  Prediction of protein-RNA interactions using sequence and structure descriptors , 2016, Neurocomputing.

[13]  M. Wickens,et al.  A 5′ cytosine binding pocket in Puf3p specifies regulation of mitochondrial mRNAs , 2009, Proceedings of the National Academy of Sciences.

[14]  Patrice Koehl,et al.  The ASTRAL compendium for protein structure and sequence analysis , 2000, Nucleic Acids Res..

[15]  Zhi-Ping Liu,et al.  Prediction of protein-RNA binding sites by a random forest method with combined features , 2010, Bioinform..

[16]  Zhiping Weng,et al.  FAST: A novel protein structure alignment algorithm , 2004, Proteins.

[17]  Daniel Herschlag,et al.  Diverse RNA-Binding Proteins Interact with Functionally Related Sets of RNAs, Suggesting an Extensive Regulatory System , 2008, PLoS biology.

[18]  T. Hall Expanding the RNA-recognition code of PUF proteins , 2014, Nature Structural &Molecular Biology.

[19]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .

[20]  Y. Shamoo,et al.  Structure-based analysis of protein-RNA interactions using the program ENTANGLE. , 2001, Journal of molecular biology.

[21]  Y. Wang,et al.  PRINTR: Prediction of RNA binding sites in proteins using SVM and profiles , 2008, Amino Acids.

[22]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[23]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[24]  S. Jones,et al.  Protein-RNA interactions: a structural analysis. , 2001, Nucleic acids research.

[25]  Kevin Struhl,et al.  Transcriptome-scale RNase-footprinting of RNA-protein complexes , 2015, Nature Biotechnology.

[26]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[27]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[28]  Lourdes Peña Castillo,et al.  Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins , 2009, Nature Biotechnology.

[29]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Christopher R. Sibley,et al.  iCLIP: Protein–RNA interactions at nucleotide resolution , 2014, Methods.

[31]  Nikolaus Rajewsky,et al.  Competition between target sites of regulators shapes post-transcriptional gene regulation , 2014, Nature Reviews Genetics.

[32]  Wei Wu,et al.  NPInter v3.0: an upgraded database of noncoding RNA-associated interactions , 2016, Database J. Biol. Databases Curation.

[33]  Canglin Wu,et al.  RegNetwork: an integrated database of transcriptional and post-transcriptional regulatory networks in human and mouse , 2015, Database J. Biol. Databases Curation.

[34]  S Thirup,et al.  The crystal structure of Cys-tRNACys-EF-Tu-GDPNP reveals general and specific features in the ternary complex and in tRNA. , 1999, Structure.

[35]  Jiehua Zhu,et al.  National Natural Science Foundation of China (NSFC) , 2013 .

[36]  Xiang-Sun Zhang,et al.  Bridging protein local structures and protein functions , 2008, Amino Acids.

[37]  Ruth Nussinov,et al.  Prediction of interacting single-stranded RNA bases by protein-binding patterns. , 2008, Journal of molecular biology.

[38]  M. Kiebler,et al.  Faculty Opinions recommendation of Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. , 2009 .

[39]  Jie Liang,et al.  CASTp: Computed Atlas of Surface Topography of proteins , 2003, Nucleic Acids Res..

[40]  Anne-Marie Alleaume,et al.  Improved binding site assignment by high-resolution mapping of RNA–protein interactions using iCLIP , 2015, Nature Communications.

[41]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  Gabriele Varani,et al.  RNA is rarely at a loss for companions; as soon as RNA , 2008 .

[43]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[44]  ZVI GALIL,et al.  Efficient algorithms for finding maximum matching in graphs , 1986, CSUR.

[45]  E. Jankowsky,et al.  Specificity and nonspecificity in RNA–protein interactions , 2015, Nature Reviews Molecular Cell Biology.

[46]  S. Gerstberger,et al.  A census of human RNA-binding proteins , 2014, Nature Reviews Genetics.

[47]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[48]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature Reviews Genetics.

[50]  Liangjiang Wang,et al.  BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences , 2006, Nucleic Acids Res..