MATCH: An atom‐typing toolset for molecular mechanics force fields

We introduce a toolset of program libraries collectively titled multipurpose atom‐typer for CHARMM (MATCH) for the automated assignment of atom types and force field parameters for molecular mechanics simulation of organic molecules. The toolset includes utilities for the conversion of multiple chemical structure file formats into a molecular graph. A general chemical pattern‐matching engine using this graph has been implemented whereby assignment of molecular mechanics atom types, charges, and force field parameters are achieved by comparison against a customizable list of chemical fragments. While initially designed to complement the CHARMM simulation package and force fields by generating the necessary input topology and atom‐type data files, MATCH can be expanded to any force field and program, and has core functionality that makes it extendable to other applications such as fragment‐based property prediction. In this work, we demonstrate the accurate construction of atomic parameters of molecules within each force field included in CHARMM36 through exhaustive cross validation studies illustrating that bond charge increment rules derived from one force field can be transferred to another. In addition, using leave‐one‐out substitution it is shown that it is also possible to substitute missing intra and intermolecular parameters with ones included in a force field to complete the parameterization of novel molecules. Finally, to demonstrate the robustness of MATCH and the coverage of chemical space offered by the recent CHARMM general force field (Vanommeslaeghe, et al., J Comput Chem 2010, 31, 671), one million molecules from the PubChem database of small molecules are typed, parameterized, and minimized. © 2011 Wiley Periodicals, Inc. J Comput Chem, 2011

[1]  Chris Oostenbrink,et al.  A biomolecular force field based on the free enthalpy of hydration and solvation: The GROMOS force‐field parameter sets 53A5 and 53A6 , 2004, J. Comput. Chem..

[2]  R. Dror,et al.  Long-timescale molecular dynamics simulations of protein structure and function. , 2009, Current opinion in structural biology.

[3]  C. Brooks,et al.  Novel generalized Born methods , 2002 .

[4]  Michael F. Lynch,et al.  Review of ring perception algorithms for chemical graphs , 1989, J. Chem. Inf. Comput. Sci..

[5]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents. 7. Parallel simulation of a relaxation algorithm for chemical substructure search , 1986, Journal of chemical information and computer sciences.

[6]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[7]  P. Kollman,et al.  Automatic atom type and bond type perception in molecular mechanical calculations. , 2006, Journal of molecular graphics & modelling.

[8]  Richard A. Friesner,et al.  A mixed quantum mechanics/molecular mechanics (QM/MM) method for large‐scale modeling of chemistry in protein environments , 2000, J. Comput. Chem..

[9]  U. Singh,et al.  A NEW FORCE FIELD FOR MOLECULAR MECHANICAL SIMULATION OF NUCLEIC ACIDS AND PROTEINS , 1984 .

[10]  A Srinivas Reddy,et al.  Virtual screening in drug discovery -- a computational perspective. , 2007, Current protein & peptide science.

[11]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules J. Am. Chem. Soc. 1995, 117, 5179−5197 , 1996 .

[12]  Alexander D. MacKerell Empirical force fields for biological macromolecules: Overview and issues , 2004, J. Comput. Chem..

[13]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[14]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents. 10. Assignment and logical bubble-up of ring screens for structurally explicit generics , 1989, J. Chem. Inf. Comput. Sci..

[15]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents. 5. Algorithmic generation of fragment descriptors for generic structure screening , 1984, J. Chem. Inf. Comput. Sci..

[16]  Richard A. Sykes,et al.  SMILES Extensions for Pattern Matching and Molecular Transformations: Applications in Chemoinformatics , 1999, J. Chem. Inf. Comput. Sci..

[17]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents. 14. Fragment generation from generic structures , 1992, J. Chem. Inf. Comput. Sci..

[18]  Gerard J. Kleywegt,et al.  Crystallographic refinement of ligand complexes , 2006, Acta crystallographica. Section D, Biological crystallography.

[19]  Alexander D. MacKerell,et al.  Molecular dynamics simulations of nucleic acid-protein complexes. , 2008, Current opinion in structural biology.

[20]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[21]  James C. Tiernan,et al.  An efficient search algorithm to find the elementary circuits of a graph , 1970, CACM.

[22]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents. 15. Generation of topological fragment descriptors from nontopological representations of generic structure components , 1993, J. Chem. Inf. Comput. Sci..

[23]  S. Schreiber,et al.  Target-oriented and diversity-oriented organic synthesis in drug discovery. , 2000, Science.

[24]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[25]  Alexander D. MacKerell,et al.  CHARMM general force field: A force field for drug‐like molecules compatible with the CHARMM all‐atom additive biological force fields , 2009, J. Comput. Chem..

[26]  Thomas A. Halgren,et al.  Merck molecular force field. V. Extension of MMFF94 using experimental data, additional computational data, and empirical rules , 1996, J. Comput. Chem..

[27]  David A. Dixon,et al.  Annual reports in computational chemistry , 2007 .

[28]  A. W. Schüttelkopf,et al.  PRODRG: a tool for high-throughput crystallography of protein-ligand complexes. , 2004, Acta crystallographica. Section D, Biological crystallography.