In silico λ-dynamics predicts protein binding specificities to modified RNAs

RNA modifications shape gene expression through a smorgasbord of chemical changes to canonical RNA bases. Although numbering in the hundreds, only a few RNA modifications are well characterized, in part due to the absence of methods to identify modification sites. Antibodies remain a common tool to identify modified RNA and infer modification sites through straightforward applications. However, specificity issues can result in off-target binding and confound conclusions. This work utilizes in silico λ-dynamics to efficiently estimate binding free energy differences of modification-targeting antibodies between a variety of naturally occurring RNA modifications. Crystal structures of inosine and N6-methyladenosine (m6A) targeting antibodies bound to their modified ribonucleosides were determined and served as structural starting points. λ-Dynamics was utilized to predict RNA modifications that permit or inhibit binding to these antibodies. In vitro RNA-antibody binding assays supported the accuracy of these in silico results. High agreement between experimental and computed binding propensities demonstrated that λ-dynamics can serve as a predictive screen for antibody specificity against libraries of RNA modifications. More importantly, this strategy is an innovative way to elucidate how hundreds of known RNA modifications interact with biological molecules without the limitations imposed by in vitro or in vivo methodologies.

[1]  Guan-Zheng Luo,et al.  Systematic comparison of tools used for m6A mapping from nanopore direct RNA sequencing , 2023, Nature Communications.

[2]  Jonah Z. Vilseck,et al.  Identification of nonhistone substrates of the lysine methyltransferase PRDM9 , 2023, The Journal of biological chemistry.

[3]  Jonah Z. Vilseck,et al.  Optimizing Multisite λ-Dynamics Throughput with Charge Renormalization , 2022, J. Chem. Inf. Model..

[4]  Ploy N. Pratanwanich,et al.  Detection of m6A from direct RNA sequencing using a multiple instance learning framework , 2021, bioRxiv.

[5]  Ploy N. Pratanwanich,et al.  Identification of differential RNA modifications from nanopore direct RNA sequencing with xPore , 2021, Nature Biotechnology.

[6]  Ryan L. Hayes,et al.  A strategy for proline and glycine mutations to proteins with alchemical free energy calculations , 2021, J. Comput. Chem..

[7]  A. Reddy,et al.  Quantitative profiling of N6-methyladenosine at single-base resolution in stem-differentiating xylem of Populus trichocarpa using Nanopore direct RNA sequencing , 2021, Genome Biology.

[8]  Jonah Z. Vilseck,et al.  Mutant thermal proteome profiling for characterization of missense protein variants and their associated phenotypes within the proteome , 2020, The Journal of Biological Chemistry.

[9]  Anton J. Enright,et al.  RNA modifications detection by comparative Nanopore direct RNA sequencing , 2019, Nature Communications.

[10]  Christopher J. Williams,et al.  Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix , 2019, Acta crystallographica. Section D, Structural biology.

[11]  Jonah Z. Vilseck,et al.  Overcoming Challenging Substituent Perturbations with Multisite λ-Dynamics: A Case Study Targeting β-Secretase 1. , 2019, The journal of physical chemistry letters.

[12]  Xing Li,et al.  Antibody cross-reactivity accounts for widespread appearance of m1A in 5’UTRs , 2019, Nature Communications.

[13]  Schraga Schwartz,et al.  Accurate detection of m6A RNA modifications in native RNA sequences , 2019, Nature Communications.

[14]  I. Nookaew,et al.  Decoding the epitranscriptional landscape from native RNA sequences , 2018, bioRxiv.

[15]  Jonah Z. Vilseck,et al.  Approaching protein design with multisite λ dynamics: Accurate and scalable mutational folding free energies in T4 lysozyme , 2018, Protein science : a publication of the Protein Society.

[16]  Jonah Z. Vilseck,et al.  Predicting Binding Free Energies in a Large Combinatorial Chemical Space Using Multisite λ Dynamics. , 2018, The journal of physical chemistry letters.

[17]  Christopher J. Williams,et al.  MolProbity: More and better reference data for improved all‐atom structure validation , 2018, Protein science : a publication of the Protein Society.

[18]  B. L. de Groot,et al.  Alchemical Free Energy Calculations for Nucleotide Mutations in Protein-DNA Complexes. , 2017, Journal of chemical theory and computation.

[19]  Schraga Schwartz,et al.  The m1A landscape on cytosolic and mitochondrial mRNA at single-base resolution , 2017, Nature.

[20]  Jonah Z. Vilseck,et al.  Adaptive Landscape Flattening Accelerates Sampling of Alchemical Space in Multisite λ Dynamics. , 2017, The journal of physical chemistry. B.

[21]  Laura Pérez-Benito,et al.  Acylguanidine Beta Secretase 1 Inhibitors: A Combined Experimental and Free Energy Perturbation Study. , 2017, Journal of chemical theory and computation.

[22]  R. Micura,et al.  Distinct 5-methylcytosine profiles in poly(A) RNA from mouse embryonic stem cells and brain , 2017, Genome Biology.

[23]  B. L. de Groot,et al.  CHARMM36m: an improved force field for folded and intrinsically disordered proteins , 2016, Nature Methods.

[24]  Jana K. Shen,et al.  All-Atom Continuous Constant pH Molecular Dynamics With Particle Mesh Ewald and Titratable Water. , 2016, Journal of chemical theory and computation.

[25]  Chengqi Yi,et al.  Transcriptome-wide mapping reveals reversible and dynamic N(1)-methyladenosine methylome. , 2016, Nature chemical biology.

[26]  Benedict Leimkuhler,et al.  Efficient molecular dynamics using geodesic integration and solvent–solute splitting , 2016, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[27]  Gideon Rechavi,et al.  The dynamic N1-methyladenosine methylome in eukaryotic messenger RNA , 2016, Nature.

[28]  Alexander D. MacKerell,et al.  Additive CHARMM force field for naturally occurring modified ribonucleotides , 2016, J. Comput. Chem..

[29]  Fei Wang,et al.  Transcriptome-wide distribution and function of RNA hydroxymethylcytosine , 2016, Science.

[30]  Matthieu Schapira,et al.  Structural Basis for the Discriminative Recognition of N6-Methyladenosine RNA by the Human YT521-B Homology Domain Family of Proteins* , 2015, The Journal of Biological Chemistry.

[31]  W. L. Jorgensen,et al.  Picomolar Inhibitors of HIV-1 Reverse Transcriptase: Design and Crystallography of Naphthyl Phenyl Ethers. , 2014, ACS medicinal chemistry letters.

[32]  William L Jorgensen,et al.  Picomolar inhibitors of HIV reverse transcriptase featuring bicyclic replacement of a cyanovinylphenyl group. , 2013, Journal of the American Chemical Society.

[33]  Alexander D. MacKerell,et al.  Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ(1) and χ(2) dihedral angles. , 2012, Journal of chemical theory and computation.

[34]  M. Kupiec,et al.  Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq , 2012, Nature.

[35]  T. Clark,et al.  Predicting the effects of basepair mutations in DNA-protein complexes by thermodynamic integration. , 2011, Biophysical journal.

[36]  Alexander D. MacKerell,et al.  Impact of 2′‐hydroxyl sampling on the conformational properties of RNA: Update of the CHARMM all‐atom additive force field for RNA , 2011, J. Comput. Chem..

[37]  S. Fanning,et al.  An anti‐hapten camelid antibody reveals a cryptic binding site with significant energetic contributions from a nonhypervariable loop , 2011, Protein science : a publication of the Protein Society.

[38]  Jan H. Jensen,et al.  Improved Treatment of Ligands and Coupling Effects in Empirical Calculation and Rationalization of pKa Values. , 2011, Journal of chemical theory and computation.

[39]  David L Mobley,et al.  Alchemical free energy methods for drug discovery: progress and challenges. , 2011, Current opinion in structural biology.

[40]  Jan H. Jensen,et al.  PROPKA3: Consistent Treatment of Internal and Surface Residues in Empirical pKa Predictions. , 2011, Journal of chemical theory and computation.

[41]  V. Bafna,et al.  Template Proteogenomics: Sequencing Whole Proteins Using an Imperfect Database* , 2010, Molecular & Cellular Proteomics.

[42]  Jennifer L. Knight,et al.  λ‐Dynamics free energy simulation methods , 2009, J. Comput. Chem..

[43]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[44]  P. Pevzner,et al.  Automated de novo protein sequencing of monoclonal antibodies , 2008, Nature Biotechnology.

[45]  Taehoon Kim,et al.  CHARMM‐GUI: A web‐based graphical user interface for CHARMM , 2008, J. Comput. Chem..

[46]  Alexander D. MacKerell,et al.  Extending the treatment of backbone energetics in protein force fields: Limitations of gas‐phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations , 2004, J. Comput. Chem..

[47]  J. Åqvist,et al.  Molecular Dynamics Simulations of Water and Biomolecules with a Monte Carlo Constant Pressure Algorithm , 2004 .

[48]  Hung Lee,et al.  Emerging trends in the synthesis and improvement of hapten-specific recombinant antibodies. , 2003, Biotechnology advances.

[49]  P. Hünenberger,et al.  A fast SHAKE algorithm to solve distance constraint equations for small molecules in molecular dynamics simulations , 2001, J. Comput. Chem..

[50]  Charles L. Brooks,et al.  λ‐dynamics: A new approach to free energy calculations , 1996 .

[51]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[52]  D. Ferguson,et al.  Isothermal-isobaric molecular dynamics simulations with Monte Carlo volume sampling , 1995 .

[53]  Y. Li,et al.  Preparation, characterization and crystallization of an antibody Fab fragment that recognizes RNA. Crystal structures of native Fab and three Fab-mononucleotide complexes. , 1995, Journal of molecular biology.

[54]  Bernard R. Brooks,et al.  New spherical‐cutoff methods for long‐range forces in macromolecular simulation , 1994, J. Comput. Chem..

[55]  Peter A. Kollman,et al.  FREE ENERGY CALCULATIONS : APPLICATIONS TO CHEMICAL AND BIOCHEMICAL PHENOMENA , 1993 .

[56]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[57]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[58]  H. C. Andersen Rattle: A “velocity” version of the shake algorithm for molecular dynamics calculations , 1983 .

[59]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[60]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[61]  H. Berendsen,et al.  ALGORITHMS FOR MACROMOLECULAR DYNAMICS AND CONSTRAINT DYNAMICS , 1977 .

[62]  S. Cohen,et al.  Antibody structure , 2006 .

[63]  L. Trudel,et al.  Monoclonal versus polyclonal antibodies: distinguishing characteristics, applications, and information resources. , 2005, ILAR journal.

[64]  Alexander D. MacKerell,et al.  All-atom empirical force field for nucleic acids: I. Parameter optimization based on small molecule and condensed phase macromolecular target data , 2000, J. Comput. Chem..