A nonredundant structure dataset for benchmarking protein‐RNA computational docking

Protein–RNA interactions play an important role in many biological processes. The ability to predict the molecular structures of protein–RNA complexes from docking would be valuable for understanding the underlying chemical mechanisms. We have developed a novel nonredundant benchmark dataset for protein–RNA docking and scoring. The diverse dataset of 72 targets consists of 52 unbound–unbound test complexes, and 20 unbound–bound test complexes. Here, unbound–unbound complexes refer to cases in which both binding partners of the cocrystallized complex are either in apo form or in a conformation taken from a different protein–RNA complex, whereas unbound–bound complexes are cases in which only one of the two binding partners has another experimentally determined conformation. The dataset is classified into three categories according to the interface root mean square deviation and the percentage of native contacts in the unbound structures: 49 easy, 16 medium, and 7 difficult targets. The bound and unbound cases of the benchmark dataset are expected to benefit the development and improvement of docking and scoring algorithms for the docking community. All the easy‐to‐view structures are freely available to the public at http://zoulab.dalton.missouri.edu/RNAbenchmark/. © 2012 Wiley Periodicals, Inc.

[1]  M. Sternberg,et al.  Prediction of protein-protein interactions by docking methods. , 2002, Current opinion in structural biology.

[2]  Z. Weng,et al.  Protein–protein docking benchmark version 3.0 , 2008, Proteins.

[3]  Alexandre M. J. J. Bonvin,et al.  Pushing the limits of what is achievable in protein–DNA docking: benchmarking HADDOCK’s performance , 2010, Nucleic acids research.

[4]  Z. Weng,et al.  Protein–protein docking benchmark 2.0: An update , 2005, Proteins.

[5]  W. Filipowicz,et al.  Regulation of mRNA translation and stability by microRNAs. , 2010, Annual review of biochemistry.

[6]  Xiaoqin Zou,et al.  Scoring and Lessons Learned with the CSAR Benchmark Using an Improved Iterative Knowledge-Based Scoring Function , 2011, J. Chem. Inf. Model..

[7]  Dominique Douguet,et al.  DOCKGROUND system of databases for protein recognition studies: Unbound structures for docking , 2007, Proteins.

[8]  Kai-Wei Chang,et al.  RNA-binding proteins in human genetic disease. , 2008, Trends in genetics : TIG.

[9]  Xiaoqin Zou,et al.  Advances and Challenges in Protein-Ligand Docking , 2010, International journal of molecular sciences.

[10]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[11]  Z. Weng,et al.  A structure‐based benchmark for protein–protein binding affinity , 2011, Protein science : a publication of the Protein Society.

[12]  Xiaoqin Zou,et al.  Construction and Test of Ligand Decoy Sets Using MDock: Community Structure-Activity Resource Benchmarks for Binding Mode Prediction , 2011, J. Chem. Inf. Model..

[13]  Alexandre M. J. J. Bonvin,et al.  A protein–DNA docking benchmark , 2008, Nucleic acids research.

[14]  J. Janin,et al.  Computer analysis of protein-protein interaction. , 1978, Journal of molecular biology.

[15]  J. Irwin,et al.  Docking and chemoinformatic screens for new ligands and targets. , 2009, Current opinion in biotechnology.

[16]  Jeffrey J. Gray,et al.  High-resolution protein-protein docking. , 2006, Current opinion in structural biology.

[17]  Zhiping Weng,et al.  A protein–protein docking benchmark , 2003, Proteins.

[18]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[19]  Vijay S. Pande,et al.  A-Site Residues Move Independently from P-Site Residues in all-Atom Molecular Dynamics Simulations of the 70S Bacterial Ribosome , 2012, PloS one.

[20]  Sarath Chandra Janga,et al.  Dissecting the expression dynamics of RNA-binding proteins in posttranscriptional regulatory networks , 2009, Proceedings of the National Academy of Sciences.

[21]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[22]  G. Kapler,et al.  Tetrahymena ORC contains a ribosomal RNA fragment that participates in rDNA origin recognition , 2007, The EMBO journal.

[23]  Alexandre M J J Bonvin,et al.  Flexible protein-protein docking. , 2006, Current opinion in structural biology.

[24]  Emidio Capriotti,et al.  Computational RNA Structure Prediction , 2008 .

[25]  Xiaoqin Zou,et al.  Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions. , 2010, Physical chemistry chemical physics : PCCP.

[26]  Matthias Rarey,et al.  Small Molecule Docking and Scoring , 2001 .

[27]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[28]  Eric Westhof,et al.  The non-Watson-Crick base pairs and their associated isostericity matrices. , 2002, Nucleic acids research.

[29]  Ruth Nussinov,et al.  Predicting molecular interactions in silico: II. Protein-protein and protein-drug docking. , 2003 .

[30]  Richard D. Smith,et al.  CSAR Benchmark Exercise of 2010: Selection of the Protein–Ligand Complexes , 2011, J. Chem. Inf. Model..

[31]  Natasja Brooijmans,et al.  Molecular recognition and docking algorithms. , 2003, Annual review of biophysics and biomolecular structure.

[32]  Gabriele Varani,et al.  RNA is rarely at a loss for companions; as soon as RNA , 2008 .

[33]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[34]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[35]  J. Irwin,et al.  Lead discovery using molecular docking. , 2002, Current opinion in chemical biology.

[36]  H. Wolfson,et al.  Principles of flexible protein–protein docking , 2008, Proteins.

[37]  Donny D. Licatalosi,et al.  RNA processing and its regulation: global insights into biological networks , 2010, Nature Reviews Genetics.

[38]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[39]  J. Keene,et al.  The ribonome: a dominant force in co‐ordinating gene expression , 2009, Biology of the cell.

[40]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[41]  Z. Lorković,et al.  Role of plant RNA-binding proteins in development, stress response and genome organization. , 2009, Trends in plant science.

[42]  S. Wodak,et al.  Assessment of CAPRI predictions in rounds 3–5 shows progress in docking procedures , 2005, Proteins.

[43]  Marc F Lensink,et al.  Docking and scoring protein interactions: CAPRI 2009 , 2010, Proteins.

[44]  Xiaoqin Zou,et al.  MDockPP: A hierarchical approach for protein‐protein docking and its application to CAPRI rounds 15–19 , 2010, Proteins.

[45]  Pedro Alexandrino Fernandes,et al.  Protein–ligand docking: Current status and future challenges , 2006, Proteins.

[46]  Daniel Herschlag,et al.  Diverse RNA-Binding Proteins Interact with Functionally Related Sets of RNAs, Suggesting an Extensive Regulatory System , 2008, PLoS biology.