Graphs and networks in chemical and biological informatics: past, present and future.

Chemical and biological network analysis has recently garnered intense interest from the perspective of drug design and discovery. While graph theoretic concepts have a long history in chemistry - predating quantum mechanics - and graphical measures of chemical structures date back to the 1970s, it is only recently with the advent of public repositories of information and availability of high-throughput assays and computational resources that network analysis of large-scale chemical networks, such as protein-protein interaction networks, has become possible. Drug design and discovery are undergoing a paradigm shift, from the notion of 'one target, one drug' to a much more nuanced view that relies on multiple sources of information: genomic, proteomic, metabolomic and so on. This holistic view of drug design is an incredibly daunting undertaking still very much in its infancy. Here, we focus on current developments in graph- and network-centric approaches in chemical and biological informatics, with particular reference to applications in the fields of SAR modeling and drug design. Key insights from the past suggest a path forward via visualization and fusion of multiple sources of chemical network data.

[1]  Valerie J Gillet,et al.  Reduced graphs and their applications in chemoinformatics. , 2011, Methods in molecular biology.

[2]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[3]  T. Klabunde Chemogenomic approaches to drug discovery: similar receptors bind similar ligands , 2007, British journal of pharmacology.

[4]  Hanah Margalit,et al.  Characterization and prediction of protein–protein interactions within and between complexes , 2006, Proceedings of the National Academy of Sciences.

[5]  Pierre Baldi,et al.  Discovery of Power-Laws in Chemical Space , 2008, J. Chem. Inf. Model..

[6]  Peter Murray-Rust Semantic science and its communication - a personal view , 2011, J. Cheminformatics.

[7]  Sourav Das,et al.  Rapid Comparison of Protein Binding Site Surfaces with Property Encoded Shape Distributions , 2009, J. Chem. Inf. Model..

[8]  Norman C. LeDonne,et al.  Use of structure-activity landscape index curves and curve integrals to evaluate the performance of multiple machine learning prediction models , 2011, J. Cheminformatics.

[9]  Valerie J. Gillet,et al.  Computer storage and retrieval of generic chemical structures in patents. 8. Reduced chemical graphs and their applications in generic chemical structure retrieval , 1987, J. Chem. Inf. Comput. Sci..

[10]  Michael P. Krein,et al.  Exploration of the topology of chemical spaces with network measures. , 2011, The journal of physical chemistry. A.

[11]  Herbert Waldmann,et al.  Bioactivity-guided navigation of chemical space. , 2010, Accounts of chemical research.

[12]  Egon L. Willighagen,et al.  Linked open drug data for pharmaceutical research and development , 2011, J. Cheminformatics.

[13]  Peter Murray-Rust,et al.  Minimum information about a bioactive entity (MIABE) , 2011, Nature Reviews Drug Discovery.

[14]  Rajarshi Guha,et al.  Assessing How Well a Modeling Protocol Captures a Structure-Activity Landscape , 2008, J. Chem. Inf. Model..

[15]  G. V. Paolini,et al.  Global mapping of pharmacological space , 2006, Nature Biotechnology.

[16]  John Quackenbush,et al.  Extracting biology from high-dimensional biological data , 2007, Journal of Experimental Biology.

[17]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Repositioning the Safe Medicine Comtan to Treat Multi-Drug and Extensively Drug Resistant Tuberculosis , 2009, PLoS Comput. Biol..

[18]  Jon Winter,et al.  Better compounds faster: the development and exploitation of a desktop predictive chemistry toolkit. , 2012, Drug discovery today.

[19]  Naoki Tanaka,et al.  Small-World Phenomena in Chemical Library Networks: Application to Fragment-Based Drug Discovery , 2009, J. Chem. Inf. Model..

[20]  Gavin Harper,et al.  Evolving Interpretable Structure-Activity Relationships. 1. Reduced Graph Queries , 2008, J. Chem. Inf. Model..

[21]  Anne Mai Wassermann,et al.  Design of Multitarget Activity Landscapes That Capture Hierarchical Activity Cliff Distributions , 2011, J. Chem. Inf. Model..

[22]  A. Barabasi,et al.  Drug—target network , 2007, Nature Biotechnology.

[23]  Ernesto Estrada Universality in protein residue networks. , 2010, Biophysical journal.

[24]  A. Fliri,et al.  Cause-effect relationships in medicine: a protein network perspective. , 2010, Trends in pharmacological sciences.

[25]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[26]  D. Rognan Chemogenomic approaches to rational drug design , 2007, British journal of pharmacology.

[27]  J. Bajorath,et al.  Local structural changes, global data views: graphical substructure-activity relationship trailing. , 2011, Journal of medicinal chemistry.

[28]  G. Harper,et al.  The reduced graph descriptor in virtual screening and data-driven clustering of high-throughput screening data. , 2004, Journal of chemical information and computer sciences.

[29]  Mathias Wawer,et al.  Navigating structure-activity landscapes. , 2009, Drug discovery today.

[30]  T. Jenssen,et al.  A literature network of human genes for high-throughput analysis of gene expression , 2001 .

[31]  Saraswathi Vishveshwara,et al.  Correlation of the Side-Chain Hubs with the Functional Residues in DNA Binding Protein Structures , 2006, J. Chem. Inf. Model..

[32]  Jürgen Bajorath,et al.  Exploring activity cliffs in medicinal chemistry. , 2012, Journal of medicinal chemistry.

[33]  Jürgen Bajorath,et al.  Methods for SAR visualization , 2012 .

[34]  J. Bajorath,et al.  Structure-activity relationship anatomy by network-like similarity graphs and local structure-activity relationship indices. , 2008, Journal of medicinal chemistry.

[35]  Michael J. Keiser,et al.  Predicting new molecular targets for known drugs , 2009, Nature.

[36]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[37]  Jürgen Bajorath,et al.  Exploring SAR Continuity in the Vicinity of Activity Cliffs , 2012, Chemical biology & drug design.

[38]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[39]  José L. Medina-Franco,et al.  Characterization of Activity Landscapes Using 2D and 3D Similarity Methods: Consensus Activity Cliffs , 2009, J. Chem. Inf. Model..

[40]  Dimitris K Agrafiotis,et al.  SAR maps: a new SAR visualization technique for medicinal chemists. , 2007, Journal of medicinal chemistry.

[41]  Sourav Das,et al.  Binding Affinity Prediction with Property-Encoded Shape Distribution Signatures , 2010, J. Chem. Inf. Model..

[42]  Rajarshi Guha,et al.  Structure-Activity Landscape Index: Identifying and Quantifying Activity Cliffs , 2008, J. Chem. Inf. Model..

[43]  Petra Schneider,et al.  From Virtual Screening to Bioactive Compounds by Visualizing and Clustering of Chemical Space , 2012, Molecular informatics.