From Structure Diagrams to Visual Chemical Patterns

The intuitive way of chemists to communicate molecules is via two-dimensional structure diagrams. The straightforward visual representations are mostly preferred to the often complicated systematic chemical names. For chemical patterns, however, no comparable visualization standards have evolved so far. Chemical patterns denoting descriptions of chemical features are needed whenever a set of molecules is filtered for certain properties. The currently available representations are constrained to linear molecular pattern languages which are hardly human readable and therefore keep chemists without computational background from systematically formulating patterns. Therefore, we introduce a new visualization concept for chemical patterns. The common standard concept of structure diagrams is extended to account for property descriptions and logic combinations of chemical features in patterns. As a first application of the new concept, we developed the SMARTSviewer, a tool that converts chemical patterns encoded in SMARTS strings to a visual representation. The graphic pattern depiction provides an overview of the specified chemical features, variations, and similarities without needing to decode the often cryptic linear expressions. Taking recent chemical publications from various fields, we demonstrate the wide application range of a graphical chemical pattern language.

[1]  Michael M. Hann,et al.  RECAP-Retrosynthetic Combinatorial Analysis Procedure: A Powerful New Technique for Identifying Privileged Molecular Fragments with Useful Applications in Combinatorial Chemistry , 1998, J. Chem. Inf. Comput. Sci..

[2]  Eric J. Martin,et al.  Conformational Sampling of Bioactive Molecules: A Comparative Study , 2007, J. Chem. Inf. Model..

[3]  Robin Taylor,et al.  New software for searching the Cambridge Structural Database and visualizing crystal structures. , 2002, Acta crystallographica. Section B, Structural science.

[4]  Jonathan Brecher Graphical representation standards for chemical structure diagrams (IUPAC Recommendations 2008) , 2008 .

[5]  Xile Hu,et al.  The nickel/copper-catalyzed direct alkylation of heterocyclic C-H bonds. , 2010, Angewandte Chemie.

[6]  K. Meguellati,et al.  DNA-templated synthesis of trimethine cyanine dyes: a versatile fluorogenic reaction for sensing G-quadruplex formation. , 2010, Angewandte Chemie.

[7]  Robert D. Clark,et al.  SYBYL Line Notation (SLN): A Single Notation to Represent Chemical Structures, Queries, Reactions, and Virtual Libraries. , 2009 .

[8]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[9]  Harold E. Helson Structure Diagram Generation , 2010 .

[10]  Ian A. Watson,et al.  Characteristic physical properties and structural fragments of marketed oral drugs. , 2004, Journal of medicinal chemistry.

[11]  Matthias Rarey,et al.  Automated Drawing of Structural Molecular Formulas under Constraints , 2004, J. Chem. Inf. Model..

[12]  Evan Bolton,et al.  The PubChem chemical structure sketcher , 2009, J. Cheminformatics.

[13]  Tudor I. Oprea,et al.  An automated PLS search for biologically relevant QSAR descriptors , 2004, J. Comput. Aided Mol. Des..

[14]  Jörg K. Wegner,et al.  Molecular Query Language (MQL)A Context-Free Grammar for Substructure Matching , 2007, J. Chem. Inf. Model..

[15]  Andreas Zell,et al.  The compressed feature matrix—a fast method for feature based substructure search , 2003, Journal of molecular modeling.

[16]  Lynda B. M. Ellis,et al.  Microbial Pathway Prediction: A Functional Group Approach , 2003, J. Chem. Inf. Comput. Sci..

[17]  Yvonne C. Martin,et al.  ALADDIN: An integrated tool for computer-assisted molecular design and pharmacophore recognition from geometric, steric, and substructure searching of three-dimensional molecular structures , 1989, J. Comput. Aided Mol. Des..

[18]  Alfred V. Aho,et al.  The Theory of Parsing, Translation, and Compiling , 1972 .

[19]  Nicolas Foloppe,et al.  Drug-like Annotation and Duplicate Analysis of a 23-Supplier Chemical Database Totalling 2.7 Million Compounds , 2004, J. Chem. Inf. Model..

[20]  Brian Hudson,et al.  Strategic Pooling of Compounds for High-Throughput Screening , 1999, J. Chem. Inf. Comput. Sci..

[21]  S. Enoch,et al.  Identification of mechanisms of toxic action for skin sensitisation using a SMARTS pattern based approach , 2008, SAR and QSAR in environmental research.

[22]  Matthias Rarey,et al.  Recore: A Fast and Versatile Method for Scaffold Hopping Based on Small Molecule Crystal Structure Conformations , 2007, J. Chem. Inf. Model..

[23]  W Patrick Walters,et al.  Prediction of 'drug-likeness'. , 2002, Advanced drug delivery reviews.

[24]  Gisbert Schneider,et al.  Computer-based de novo design of drug-like molecules , 2005, Nature Reviews Drug Discovery.

[25]  Ian Collins,et al.  Creating an Antibacterial with in Vivo Efficacy: Synthesis and Characterization of Potent Inhibitors of the Bacterial Cell Division Protein FtsZ with Improved Pharmaceutical Properties , 2010, Journal of medicinal chemistry.

[26]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents, 1. Introduction and general strategy , 1981, J. Chem. Inf. Comput. Sci..

[27]  Franziska Berger,et al.  Counterexamples in Chemical Ring Perception , 2004, J. Chem. Inf. Model..

[28]  Michael M. Hann,et al.  RECAP — Retrosynthetic Combinatorial Analysis Procedure: A Powerful New Technique for Identifying Privileged Molecular Fragments with Useful Applications in Combinatorial Chemistry. , 1998 .