Benchmarking of HPCC: A novel 3D molecular representation combining shape and pharmacophoric descriptors for efficient molecular similarity assessments.

Since 3D molecular shape is an important determinant of biological activity, designing accurate 3D molecular representations is still of high interest. Several chemoinformatic approaches have been developed to try to describe accurate molecular shapes. Here, we present a novel 3D molecular description, namely harmonic pharma chemistry coefficient (HPCC), combining a ligand-centric pharmacophoric description projected onto a spherical harmonic based shape of a ligand. The performance of HPCC was evaluated by comparison to the standard ROCS software in a ligand-based virtual screening (VS) approach using the publicly available directory of useful decoys (DUD) data set comprising over 100,000 compounds distributed across 40 protein targets. Our results were analyzed using commonly reported statistics such as the area under the curve (AUC) and normalized sum of logarithms of ranks (NSLR) metrics. Overall, our HPCC 3D method is globally as efficient as the state-of-the-art ROCS software in terms of enrichment and slightly better for more than half of the DUD targets. Since it is largely admitted that VS results depend strongly on the nature of the protein families, we believe that the present HPCC solution is of interest over the current ligand-based VS methods.

[1]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[2]  Lazaros Mavridis,et al.  Comprehensive Comparison of Ligand-Based Virtual Screening Tools Against the DUD Data set Reveals Limitations of Current 3D Methods , 2010, J. Chem. Inf. Model..

[3]  Lazaros Mavridis,et al.  Toward High Throughput 3D Virtual Screening Using Spherical Harmonic Surface Representations , 2007, J. Chem. Inf. Model..

[4]  B. Fan,et al.  Molecular similarity and diversity in chemoinformatics: From theory to applications , 2006, Molecular Diversity.

[5]  Wolfgang H. B. Sauer,et al.  Molecular Shape Diversity of Combinatorial Libraries: A Prerequisite for Broad Bioactivity , 2003, J. Chem. Inf. Comput. Sci..

[6]  Andreas Bender,et al.  Alpha Shapes Applied to Molecular Shape Characterization Exhibit Novel Properties Compared to Established Shape Descriptors , 2009, J. Chem. Inf. Model..

[7]  Ajay N. Jain,et al.  Molecular Shape and Medicinal Chemistry: A Perspective , 2010, Journal of medicinal chemistry.

[8]  A. Nicholls,et al.  How to do an evaluation: pitfalls and traps , 2008, J. Comput. Aided Mol. Des..

[9]  Peter Willett,et al.  Effectiveness of 2D fingerprints for scaffold hopping. , 2011, Future medicinal chemistry.

[10]  W. Cai,et al.  Molecular simulations using spherical harmonics , 2010 .

[11]  Andy Jennings,et al.  Selection of Molecules Based on Shape and Electrostatic Similarity: Proof of Concept of "Electroforms" , 2007, J. Chem. Inf. Model..

[12]  Karthik Ramani,et al.  Using diffusion distances for flexible molecular shape comparison , 2010, BMC Bioinformatics.

[13]  Christopher I. Bayly,et al.  Evaluating Virtual Screening Methods: Good and Bad Metrics for the "Early Recognition" Problem , 2007, J. Chem. Inf. Model..

[14]  Michael Nilges,et al.  Comparative Evaluation of 3D Virtual Ligand Screening Methods: Impact of the Molecular Alignment on Enrichment , 2010, J. Chem. Inf. Model..

[15]  Peter Willett,et al.  Similarity searching using 2D structural fingerprints. , 2011, Methods in molecular biology.

[16]  Irwin D. Kuntz,et al.  A fast and efficient method for 2D and 3D molecular shape description , 1992, J. Comput. Aided Mol. Des..

[17]  Frank Weinhold,et al.  Chemistry: A new twist on molecular shape , 2001, Nature.

[18]  Jerry O Ebalunode,et al.  Molecular shape technologies in drug discovery: methods and applications. , 2010, Current topics in medicinal chemistry.

[19]  Robert L. Jernigan,et al.  A New Class of Molecular Shape Descriptors, 1. Theory and Properties , 2002, J. Chem. Inf. Comput. Sci..

[20]  Karthik Ramani,et al.  IDSS: deformation invariant signatures for molecular shape comparison , 2009, BMC Bioinformatics.

[21]  Daisuke Kihara,et al.  Application of 3D Zernike descriptors to shape-based ligand similarity searching , 2009, J. Cheminformatics.

[22]  Raman Sharma,et al.  ElectroShape: fast molecular similarity calculations incorporating shape, chirality and electrostatics , 2010, J. Comput. Aided Mol. Des..

[23]  Alexander M. Lewis,et al.  Identification of a chemical probe for NAADP by virtual screening , 2009, Nature chemical biology.

[24]  Yegor Zyrianov Distribution-Based Descriptors of the Molecular Shape , 2005, J. Chem. Inf. Model..

[25]  Robert P. Sheridan Alternative Global Goodness Metrics and Sensitivity Analysis: Heuristics to Check the Robustness of Conclusions from Studies Comparing Virtual Screening Methods , 2008, J. Chem. Inf. Model..

[26]  Wensheng Cai,et al.  New approach for representation of molecular surface , 1998, J. Comput. Chem..

[27]  R. Nussinov,et al.  Molecular shape comparisons in searches for active sites and functional similarity. , 1998, Protein engineering.

[28]  W. Cai,et al.  SHEF: a vHTS geometrical filter using coefficients of spherical harmonic molecular surfaces , 2008, Journal of molecular modeling.

[29]  David W. Ritchie,et al.  Using Consensus-Shape Clustering To Identify Promiscuous Ligands and Protein Targets and To Choose the Right Query for Shape-Based Virtual Screening , 2011, J. Chem. Inf. Model..

[30]  Simona Distinto,et al.  Evaluation of the performance of 3D virtual screening protocols: RMSD comparisons, enrichment assessments, and decoy selection—What can we learn from earlier mistakes? , 2008, J. Comput. Aided Mol. Des..

[31]  Sean Ekins,et al.  The importance of discerning shape in molecular pharmacology. , 2009, Trends in pharmacological sciences.

[32]  Pierre Baldi,et al.  A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval , 2010, Bioinform..

[33]  C. Wermuth,et al.  Similarity in drugs: reflections on analogue design. , 2006, Drug discovery today.

[34]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[35]  Honglin Li,et al.  Identification of novel falcipain-2 inhibitors as potential antimalarial agents through structure-based virtual screening. , 2009, Journal of medicinal chemistry.

[36]  Jürgen Bajorath,et al.  Molecular similarity concepts and search calculations. , 2008, Methods in molecular biology.

[37]  S Subramaniam,et al.  Analytical shape computation of macromolecules: I. molecular area and volume through alpha shape , 1998, Proteins.

[38]  Ferran Sanz,et al.  Incorporating molecular shape into the alignment-free Grid-Independent Descriptors. , 2004, Journal of medicinal chemistry.

[39]  Guan-Hua Du,et al.  [Research progress of virtual screening aided drug discovery]. , 2009, Yao xue xue bao = Acta pharmaceutica Sinica.

[40]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[41]  Milan Randic,et al.  Novel Shape Descriptors for Molecular Graphs , 2001, J. Chem. Inf. Comput. Sci..

[42]  Jianhua Yao,et al.  Multiple-step virtual screening using VSM-G: overview and validation of fast geometrical matching enrichment , 2008, Journal of molecular modeling.

[43]  Andrew C. Good,et al.  New molecular shape descriptors: Application in database screening , 1995, J. Comput. Aided Mol. Des..

[44]  Y. Martin,et al.  Do structurally similar molecules have similar biological activity? , 2002, Journal of medicinal chemistry.

[45]  James L. Melville,et al.  Better than Random? The Chemotype Enrichment Problem , 2009, J. Chem. Inf. Model..

[46]  J. A. Grant,et al.  A fast method of molecular shape comparison: A simple application of a Gaussian description of molecular shape , 1996, J. Comput. Chem..

[47]  Yat T Tang,et al.  Virtual screening for lead discovery. , 2011, Methods in molecular biology.

[48]  W. Graham Richards,et al.  Similarity of molecular shape , 1991, J. Comput. Aided Mol. Des..

[49]  Xueguang Shao,et al.  Protein-ligand recognition using spherical harmonic molecular surfaces: towards a fast and efficient filter for large virtual throughput screening. , 2002, Journal of molecular graphics & modelling.

[50]  David W. Ritchie,et al.  Comparison of Ligand-Based and Receptor-Based Virtual Screening of HIV Entry Inhibitors for the CXCR4 and CCR5 Receptors Using 3D Ligand Shape Matching and Ligand-Receptor Docking , 2008, J. Chem. Inf. Model..

[51]  S. Muchmore,et al.  The Use of Three‐Dimensional Shape and Electrostatic Similarity Searching in the Identification of a Melanin‐Concentrating Hormone Receptor 1 Antagonist , 2006, Chemical biology & drug design.

[52]  Simona Distinto,et al.  How To Optimize Shape-Based Virtual Screening: Choosing the Right Query and Including Chemical Information , 2009, J. Chem. Inf. Model..

[53]  Jürgen Bajorath,et al.  Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches. , 2007, Drug discovery today.

[54]  J. Andrew Grant,et al.  Molecular shape and electrostatics in the encoding of relevant chemical information , 2005, J. Comput. Aided Mol. Des..

[55]  S Hazout,et al.  Searching for geometric molecular shape complementarity using bidimensional surface profiles. , 1992, Journal of molecular graphics.

[56]  Graham J. L. Kemp,et al.  Fast computation, rotation, and comparison of low resolution spherical harmonic molecular surfaces , 1999, J. Comput. Chem..

[57]  Takayuki Kotani,et al.  Rapid Evaluation of Molecular Shape Similarity Index Using Pairwise Calculation of the Nearest Atomic Distances , 2002, J. Chem. Inf. Comput. Sci..

[58]  Santosh Putta,et al.  Shapes of things: computer modeling of molecular shape in drug discovery. , 2007, Current topics in medicinal chemistry.

[59]  David A. Cosgrove,et al.  A novel method of aligning molecules by local surface shape similarity , 2000, J. Comput. Aided Mol. Des..

[60]  Nabil H. Mustafa,et al.  Fast Molecular Shape Matching Using Contact Maps , 2007, J. Comput. Biol..

[61]  David W. Ritchie,et al.  Clustering and Classifying Diverse HIV Entry Inhibitors Using a Novel Consensus Shape-Based Virtual Screening Approach: Further Evidence for Multiple Binding Sites within the CCR5 Extracellular Pocket , 2008, J. Chem. Inf. Model..

[62]  Mark S. Johnson,et al.  ShaEP: Molecular Overlay Based on Shape and Electrostatic Potential , 2009, J. Chem. Inf. Model..

[63]  A. Olson,et al.  Shape analysis of molecular surfaces , 1993, Biopolymers.

[64]  Ron Kohavi,et al.  The Case against Accuracy Estimation for Comparing Induction Algorithms , 1998, ICML.

[65]  W. Graham Richards,et al.  Ultrafast shape recognition to search compound databases for similar molecular shapes , 2007, J. Comput. Chem..

[66]  Wei Zhao,et al.  A statistical framework to evaluate virtual screening , 2009, BMC Bioinformatics.

[67]  I. Kuntz,et al.  Matching chemistry and shape in molecular docking. , 1993, Protein engineering.

[68]  Pedro J Ballester,et al.  Prospective virtual screening with Ultrafast Shape Recognition: the identification of novel inhibitors of arylamine N-acetyltransferases , 2010, Journal of The Royal Society Interface.

[69]  Janet M. Thornton,et al.  Real spherical harmonic expansion coefficients as 3D shape descriptors for protein binding pocket and ligand comparisons , 2005, Bioinform..

[70]  Goran Neshich,et al.  A fast surface-matching procedure for protein–ligand docking , 2006, Journal of molecular modeling.

[71]  H. Wolfson,et al.  Small molecule recognition: solid angles surface representation and molecular shape complementarity. , 1999, Combinatorial chemistry & high throughput screening.