An Additive Definition of Molecular Complexity

A framework for molecular complexity is established that is based on information theory and consistent with chemical knowledge. The resulting complexity index Cm is derived from abstracting the information content of a molecule by the degrees of freedom in the microenvironments on a per-atom basis, allowing the molecular complexity to be calculated in a simple and additive way. This index allows the complexity of any molecule to be universally assessed and is sensitive to stereochemistry, heteroatoms, and symmetry. The performance of this complexity index is evaluated and compared against the current state of the art. Its additive character gives consistent values also for very large molecules and supports direct comparisons of chemical reactions. Finally, this approach may provide a useful tool for medicinal chemistry in drug design and lead selection, as demonstrated by correlating molecular complexities of antibiotics with compound-specific parameters.

[1]  Kenso Soai,et al.  Asymmetric Autocatalysis Triggered by Carbon Isotope (13C/12C) Chirality , 2009, Science.

[2]  Steven H. Bertz,et al.  Complexity of synthetic routes: Linear, convergent and reflexive syntheses , 2003 .

[3]  Steven H. Bertz,et al.  The first general index of molecular complexity , 1981 .

[4]  Milan Randic,et al.  Search for all self-avoiding paths graphs for molecular graphs , 1979, Comput. Chem..

[5]  Michel J. Bertrand,et al.  General index of molecular complexity and chromatographic retention data , 1986 .

[6]  Tudor I. Oprea,et al.  Rapid Evaluation of Synthetic and Molecular Complexity for in Silico Chemistry , 2005, J. Chem. Inf. Model..

[7]  René Barone,et al.  A New and Simple Approach to Chemical Complexity. Application to the Synthesis of Natural Products , 2001, J. Chem. Inf. Comput. Sci..

[8]  Yutaka Endo,et al.  Development of a Method for Evaluating Drug-Likeness and Ease of Synthesis Using a Data Set in Which Compounds Are Assigned Scores Based on Chemists' Intuition , 2003, J. Chem. Inf. Comput. Sci..

[9]  Danail Bonchev,et al.  Novel Indices for the Topological Complexity of Molecules , 1997 .

[10]  Steven H. Bertz,et al.  Convergence, molecular complexity, and synthetic analysis , 1982 .

[11]  C. Lipinski Lead- and drug-like compounds: the rule-of-five revolution. , 2004, Drug discovery today. Technologies.

[12]  M. Randic,et al.  On the Concept of Molecular Complexity , 2002 .

[13]  Jun Li,et al.  Current complexity: a tool for assessing the complexity of organic molecules. , 2015, Organic & biomolecular chemistry.

[14]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[15]  Danail Bonchev,et al.  Overall Connectivities/Topological Complexities: A New Powerful Tool for QSPR/QSAR , 2000, J. Chem. Inf. Comput. Sci..

[16]  D. G. Bonchev KOLMOGOROV'S INFORMATION, SHANNON'S ENTROPY, AND TOPOLOGICAL COMPLEXITY OFMOLECULES , 1995 .

[17]  Nenad Trinajstić,et al.  Topological characterization of cyclic structures , 1980 .

[18]  Steven H. Bertz,et al.  Complexity of synthetic reactions. The use of complexity indices to evaluate reactions, transforms and disconnections , 2003 .

[19]  Dejan Plavšić,et al.  Characterization of molecular complexity , 2003 .

[20]  W Patrick Walters,et al.  Prediction of 'drug-likeness'. , 2002, Advanced drug delivery reviews.

[21]  G. Whitesides,et al.  Complexity in chemistry. , 1999, Science.

[22]  H. W. Whitlock,et al.  On the Structure of Total Synthesis of Complex Natural Products , 1998 .

[23]  A. Schuffenhauer,et al.  Complex molecules: do they add value? , 2005, Current opinion in chemical biology.

[24]  Andrew R. Leach,et al.  Molecular Complexity and Its Impact on the Probability of Finding Leads for Drug Discovery , 2001, J. Chem. Inf. Comput. Sci..

[25]  Shahul H. Nilar,et al.  The importance of molecular complexity in the design of screening libraries , 2013, Journal of Computer-Aided Molecular Design.

[26]  Biye Ren Atom-Type-Based AI Topological Descriptors: Application in Structure-Boiling Point Correlations of Oxo Organic Compounds , 2003, J. Chem. Inf. Comput. Sci..

[27]  K C Nicolaou,et al.  Constructing molecular complexity and diversity: total synthesis of natural products of biological and medicinal importance. , 2012, Chemical Society reviews.

[28]  G. V. Paolini,et al.  Quantifying the chemical beauty of drugs. , 2012, Nature chemistry.

[29]  Ping Huang,et al.  Molecular complexity: a simplified formula adapted to individual atoms , 1987, J. Chem. Inf. Comput. Sci..

[30]  Thomas Sander,et al.  About Complexity and Self-Similarity of Chemical Structures in Drug Discovery , 2013, CCS 2013.

[31]  A. Leach,et al.  Molecular complexity and fragment-based drug discovery: ten years on. , 2011, Current opinion in chemical biology.

[32]  Robert P. Sheridan,et al.  Modeling a Crowdsourced Definition of Molecular Complexity , 2014, J. Chem. Inf. Model..

[33]  M. Randic Characterization of molecular branching , 1975 .

[34]  Peter Ertl,et al.  Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions , 2009, J. Cheminformatics.

[35]  P. Leeson,et al.  The influence of drug-like concepts on decision-making in medicinal chemistry , 2007, Nature Reviews Drug Discovery.

[36]  Gerta Rücker,et al.  Walk Counts, Labyrinthicity, and Complexity of Acyclic and Cyclic Graphs and Molecules , 2000, J. Chem. Inf. Comput. Sci..

[37]  Steven H. Bertz,et al.  Rigorous mathematical approaches to strategic bonds and synthetic analysis based on conceptually simple new complexity indices , 1997 .

[38]  René Barone,et al.  Information Theory Description of Synthetic Strategies in the Polyquinane Series. The Holosynthon Concept , 1998 .

[39]  Gerta Rücker,et al.  Substructure, Subgraph, and Walk Counts as Measures of the Complexity of Graphs and Molecules , 2001, J. Chem. Inf. Comput. Sci..

[40]  Gerta Rücker,et al.  Organic Synthesis - Art or Science? , 2004, J. Chem. Inf. Model..

[41]  Weimin Guo,et al.  Novel distance-based atom-type topological indices DAI for QSPR/QSAR studies of alcohols , 2006, Journal of molecular modeling.

[42]  M. Chastrette,et al.  Molecular complexity determines the number of olfactory notes and the pleasantness of smells , 2011, Scientific reports.