Trends in information theory-based chemical structure codification

This report offers a chronological review of the most relevant applications of information theory in the codification of chemical structure information, through the so-called information indices. Basically, these are derived from the analysis of the statistical patterns of molecular structure representations, which include primitive global chemical formulae, chemical graphs, or matrix representations. Finally, new approaches that attempt to go “back to the roots” of information theory, in order to integrate other information-theoretic measures in chemical structure coding are discussed.

[1]  Lemont B. Kier,et al.  Electrotopological State Indices for Atom Types: A Novel Combination of Electronic, Topological, and Valence State Information , 1995, J. Chem. Inf. Comput. Sci..

[2]  E. Cayley,et al.  Ueber die analytischen Figuren, welche in der Mathematik Bäume genannt werden und ihre Anwendung auf die Theorie chemischer Verbindungen , 1875 .

[3]  Paul W Ayers,et al.  What is an atom in a molecule? , 2005, The journal of physical chemistry. A.

[4]  I. M. Kogan,et al.  Applied Information Theory , 1987 .

[5]  Gilles Klopman,et al.  A new approach to structure-activity using distance information content of graph vertices: A study with phenylalkylamines , 1988 .

[6]  Eugenio Uriarte,et al.  Markovian Backbone Negentropies: Molecular descriptors for protein research. I. Predicting protein stability in Arc repressor mutants , 2004, Proteins.

[7]  A. Balaban,et al.  New vertex invariants and topological indices of chemical graphs based on information on distances , 1991 .

[8]  E. Trucco,et al.  On the information content of graphs: Compound symbols; Different states for each point , 1956 .

[9]  Jordi Mestres,et al.  SHED: Shannon Entropy Descriptors from Topological Feature Distributions , 2006, J. Chem. Inf. Model..

[10]  A. Balaban,et al.  Topological Indices and Related Descriptors in QSAR and QSPR , 2003 .

[11]  Richard W. Hamming,et al.  Coding and Information Theory , 1980 .

[12]  E. Maasoumi A compendium to information theory in economics and econometrics , 1993 .

[13]  A. T. Balaban and O. Ivanciuc,et al.  Historical Development of Topological Indices , 2000 .

[14]  W. Ebeling,et al.  On grammars, complexity, and information measures of biological macromolecules , 1980 .

[15]  Roberto Todeschini,et al.  Molecular descriptors for chemoinformatics , 2009 .

[16]  Matthias Dehmer,et al.  Information Indices with High Discriminative Power for Graphs , 2012, PloS one.

[17]  Dusanka Janezic,et al.  Graph-Theoretical Matrices in Chemistry , 2015 .

[18]  Roberto Todeschini,et al.  Structure/Response Correlations and Similarity/Diversity Analysis by GETAWAY Descriptors, 1. Theory of the Novel 3D Molecular Descriptors , 2002, J. Chem. Inf. Comput. Sci..

[19]  C. Raychaudhury,et al.  Discrimination of isomeric structures using information theoretic topological indices , 1984 .

[20]  Francisco Torrens,et al.  Relations frequency hypermatrices in mutual, conditional, and joint entropy‐based information indices , 2013, J. Comput. Chem..

[21]  Henri Poincaré,et al.  Second Complément à l'Analysis Situs , 1900 .

[22]  E. Desurvire Classical and Quantum Information Theory: An Introduction for the Telecom Scientist , 2009 .

[23]  Frank Harary,et al.  Graph Theory , 2016 .

[24]  Gilles Klopman,et al.  A novel approach to the use of graph theory in structure–activity relationship studies. Application to the qualitative evaluation of mutagenicity in a series of nonfused ring aromatic compounds , 1988 .

[25]  Arieh Ben-Naim,et al.  Entropy: Order or Information , 2011 .

[26]  N. Trinajstic,et al.  Information theory, distance matrix, and molecular branching , 1977 .

[27]  Alexandru T. Balaban,et al.  Chemical graphs , 1979 .

[28]  Shu Lin,et al.  Error control coding : fundamentals and applications , 1983 .

[29]  Paola Gramatica,et al.  Structure/Response Correlations and Similarity/Diversity Analysis by GETAWAY Descriptors, 2. Application of the Novel 3D Molecular Descriptors to QSAR/QSPR Studies , 2002, J. Chem. Inf. Comput. Sci..

[30]  D. H. Rouvray The pioneering contributions of cayley and sylvester to the mathematical description of chemical structure , 1989 .

[31]  Steven H. Bertz,et al.  The first general index of molecular complexity , 1981 .

[32]  Aurel A. Lazar,et al.  Information theory in neuroscience , 2011, Journal of Computational Neuroscience.

[33]  Milan Randić,et al.  Aromaticity of Polycyclic Conjugated Hydrocarbons , 2003 .

[34]  P. Bernaola-Galván,et al.  Compositional segmentation and long-range fractal correlations in DNA sequences. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[35]  Daniel Cabrol-Bass,et al.  Evaluation in Quantitative Structure-Property Relationship Models of Structural Descriptors Derived from Information-Theory Operators , 2000, J. Chem. Inf. Comput. Sci..

[36]  A. Michalos,et al.  Readings in Mathematical Social Science , 1968 .

[37]  J. Gálvez,et al.  Event-based criteria in GT-STAF information indices: theory, exploratory diversity analysis and QSPR applications , 2013, SAR and QSAR in environmental research.

[38]  A. Mowshowitz,et al.  Entropy and the complexity of graphs. I. An index of the relative complexity of a graph. , 1968, The Bulletin of mathematical biophysics.

[39]  Stephan Borgert,et al.  On Entropy-Based Molecular Descriptors: Statistical Analysis of Real and Synthetic Chemical Structures , 2009, J. Chem. Inf. Model..

[40]  R. García-Domenech,et al.  Some new trends in chemical graph theory. , 2008, Chemical reviews.

[41]  Danail Bonchev,et al.  Informationsgehalt chemischer Elemente , 1977 .

[42]  Matthias Dehmer,et al.  A history of graph entropy measures , 2011, Inf. Sci..

[43]  R. Todeschini,et al.  Molecular Descriptors for Chemoinformatics: Volume I: Alphabetical Listing / Volume II: Appendices, References , 2009 .

[44]  Danail Bonchev,et al.  Generalization of the Graph Center Concept, and Derived Topological Centric Indexes , 1980, J. Chem. Inf. Comput. Sci..

[45]  C Cosmi,et al.  Characterization of nucleotidic sequences using maximum entropy techniques. , 1990, Journal of theoretical biology.

[46]  Stephan Borgert,et al.  Entropy Bounds for Molecular Hierarchical Networks , 2008 .

[47]  S C Basak,et al.  Molecular topology and narcosis. A quantitative structure-activity relationship (QSAR) study of alcohols using complementary information content (CIC). , 1983, Arzneimittel-Forschung.

[48]  Subhash C. Basak,et al.  A quantitative structure activity relationship (QSAR) analysis of carbomoyl piperidines, barbiturates and alkanes using information – theoretic topological indices-1 , 1981 .

[49]  Thomas D. Schneider,et al.  Fast Multiple Alignment of Ungapped DNA Sequences Using Information Theory and a Relaxation Method , 1996, Discret. Appl. Math..

[50]  H. Quastler,et al.  Essays on the use of information theory in biology , 1953 .

[51]  F. Harary,et al.  Chemical graphs—V : Enumeration and proposed nomenclature of benzenoid cata-condensed polycyclic aromatic hydrocarbons , 1968 .

[52]  N. Rashevsky Life, information theory, and topology , 1955 .

[53]  H. Hosoya Topological Index. A Newly Proposed Quantity Characterizing the Topological Nature of Structural Isomers of Saturated Hydrocarbons , 1971 .

[54]  Francisco Torrens,et al.  Shannon's, mutual, conditional and joint entropy information indices: generalization of global indices defined from local vertex invariants. , 2013, Current computer-aided drug design.

[55]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[56]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[57]  R. Parr,et al.  Information Theory Thermodynamics of Molecules and Their Hirshfeld Fragments , 2001 .

[58]  Abbe Mowshowitz,et al.  Entropy and the complexity of graphs: IV. Entropy measures and graphical structure , 1968 .

[59]  Danail G. Bonchev,et al.  Information Theoretic Complexity Measures , 2009, Encyclopedia of Complexity and Systems Science.

[60]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[61]  R. Nalewajski Applications of the Information Theory to Problems of Molecular Electronic Structure and Chemical Reactivity , 2002 .

[62]  Danail Bonchev,et al.  Topological indices for molecular fragments and new graph invariants , 1988 .

[63]  Ovidiu Ivanciuc,et al.  DESIGN OF TOPOLOGICAL INDICES. PART 3. NEW IDENTIFICATION NUMBERS FOR CHEMICAL STRUCTURES : MINID AND MINSID , 1996 .

[64]  Alexandru T Balaban,et al.  Graphical representation of proteins. , 2011, Chemical reviews.

[65]  L. Pogliani From molecular connectivity indices to semiempirical connectivity terms: recent trends in graph theoretical descriptors. , 2000, Chemical reviews.

[66]  E. Broniatowska,et al.  Entropy displacement and information distance analysis of electron distributions in molecules and their Hirshfeld atoms , 2003 .

[67]  Danail Bonchev,et al.  The concept for the centre of a chemical structure and its applications , 1989 .

[68]  H. Wiener Structural determination of paraffin boiling points. , 1947, Journal of the American Chemical Society.

[69]  G. Whitesides,et al.  Complexity in chemistry. , 1999, Science.

[70]  M. Dehmer,et al.  Entropy Bounds for Hierarchical Molecular Networks , 2008, PloS one.

[71]  E. Trucco A note on the information content of graphs , 1956 .

[72]  Richard Wesley Hamming,et al.  Coding and information theory (2. ed.) , 1986 .

[73]  A. Balaban,et al.  Vertex- and Edge-Weighted Molecular Graphs and Derived Structural Descriptors , 2000 .

[74]  Ovidiu Ivanciuc,et al.  Chemical graphs with degenerate topological indices based on information on distances , 1993 .

[75]  Danail Bonchev,et al.  Information theoretic indices for characterization of chemical structures , 1983 .

[76]  D. Kamenski,et al.  Symmetry and information content of chemical structures , 1976 .

[77]  D. H. Rouvray,et al.  Complexity : Introduction and Fundamentals , 2003 .

[78]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[79]  Robert E. Ulanowicz,et al.  The Central Role of Information Theory in Ecology , 2011, Towards an Information Theory of Complex Networks.

[80]  R. Blahut Theory and practice of error control codes , 1983 .

[81]  Matthias Dehmer,et al.  Structural information content of networks: Graph entropy based on local vertex functionals , 2008, Comput. Biol. Chem..