Handling of Tautomerism and Stereochemistry in Compound Registration

Automated registration of compounds from external sources is necessitated by the numerous compound acquisitions from vendors and by the increasing number of collaborations with external partners. A prerequisite for automating compound registration is a robust module for determining the structural novelty of the input structures. Any such tool needs to be able to take uncertainty about stereochemistry into account and to identify tautomeric forms of the same compound. It also needs to validate structures for potential mistakes in connectivity and stereochemistry. Genentech has implemented a Structure Normalization Module based on toolkits offered by OpenEye Scientific Software. The module is incorporated in a graphical application for single compound registration and in scripts for bulk registration. It is also used for checking compounds submitted by our collaborators via partner-specific Internet sites. The Genentech Structure Normalization Module employs the widely used V2000 molfile format to accommodate structures received from a wide variety of sources. To determine how much information is known about the stereochemistry of each compound, the module requires a separate stereochemical assignment. A structural uniqueness check is performed by comparing the canonical SMILES of a standard tautomer. This paper offers a discussion of the steps taken to validate the chemical structure and generate the canonical SMILES of the standard tautomer. It also describes the integration of the validation module in compound registration pathways.

[1]  Morton E. Munk,et al.  Graph automorphism perception algorithms in computer-enhanced structure elucidation , 1993, J. Chem. Inf. Comput. Sci..

[2]  S. Krishnan,et al.  Hash Functions for Rapid Storage and Retrieval of Chemical Structures , 1978, J. Chem. Inf. Comput. Sci..

[3]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[4]  Yvonne C. Martin,et al.  Let’s not forget tautomers , 2009, J. Comput. Aided Mol. Des..

[5]  Susan Ainsworth MANAGING OUTSOURCING: In DRUG AND BIOTECH FIRMS, chemists are working to get more bang out of the CRO buck , 2011 .

[6]  John Figueras Automorphism and equivalence classes , 1992, J. Chem. Inf. Comput. Sci..

[7]  H. L. Morgan The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. , 1965 .

[8]  Paul M. Selzer,et al.  The Impact of Tautomer Forms on Pharmacophore-Based Virtual Screening , 2006, J. Chem. Inf. Model..

[9]  Jean-Loup Faulon,et al.  Isomorphism, Automorphism Partitioning, and Canonical Labeling Can Be Solved in Polynomial-Time for Molecular Graphs , 1998, J. Chem. Inf. Comput. Sci..

[10]  N. Null The IUPAC International Chemical Identifier (InChI) , 2009 .

[11]  Steve Ritter PLASTICS RELEASE ESTROGENIC CHEMICALS: Study finds that even BPA-FREE PRODUCTS leach potentially problematic compounds , 2011 .

[12]  T. O'Brien,et al.  Fragment-based drug discovery. , 2004, Journal of medicinal chemistry.

[13]  O. Hucke,et al.  Assessing atropisomer axial chirality in drug discovery and development. , 2011, Journal of medicinal chemistry.

[14]  W. T. Wipke,et al.  Stereochemically unique naming algorithm , 1974 .

[15]  Wendy A. Warr,et al.  Tautomerism in chemical information management systems , 2010, J. Comput. Aided Mol. Des..

[16]  Loriano Storchi,et al.  Tautomer Enumeration and Stability Prediction for Virtual Screening on Large Chemical Databases , 2009, J. Chem. Inf. Model..

[17]  Roger A. Sayle,et al.  So you think you understand tautomerism? , 2010, J. Comput. Aided Mol. Des..

[18]  Robert E. Stobaugh,et al.  The Chemical Abstracts Service Chemical Registry System. III. Stereochemistry , 1977, J. Chem. Inf. Comput. Sci..

[19]  O. Hucke,et al.  Revealing Atropisomer Axial Chirality in Drug Discovery , 2011, ChemMedChem.

[20]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[21]  Alan H. Lipkus,et al.  Chemical Abstracts Service Chemical Registry System. 13. Enhanced handling of stereochemistry , 1991, J. Chem. Inf. Comput. Sci..