mRAISE: an alternative algorithmic approach to ligand-based virtual screening

Ligand-based virtual screening is a well established method to find new lead molecules in todays drug discovery process. In order to be applicable in day to day practice, such methods have to face multiple challenges. The most important part is the reliability of the results, which can be shown and compared in retrospective studies. Furthermore, in the case of 3D methods, they need to provide biologically relevant molecular alignments of the ligands, that can be further investigated by a medicinal chemist. Last but not least, they have to be able to screen large databases in reasonable time. Many algorithms for ligand-based virtual screening have been proposed in the past, most of them based on pairwise comparisons. Here, a new method is introduced called mRAISE. Based on structural alignments, it uses a descriptor-based bitmap search engine (RAISE) to achieve efficiency. Alignments created on the fly by the search engine get evaluated with an independent shape-based scoring function also used for ranking of compounds. The correct ranking as well as the alignment quality of the method are evaluated and compared to other state of the art methods. On the commonly used Directory of Useful Decoys dataset mRAISE achieves an average area under the ROC curve of 0.76, an average enrichment factor at 1 % of 20.2 and an average hit rate at 1 % of 55.5. With these results, mRAISE is always among the top performing methods with available data for comparison. To access the quality of the alignments calculated by ligand-based virtual screening methods, we introduce a new dataset containing 180 prealigned ligands for 11 diverse targets. Within the top ten ranked conformations, the alignment closest to X-ray structure calculated with mRAISE has a root-mean-square deviation of less than 2.0 Å for 80.8 % of alignment pairs and achieves a median of less than 2.0 Å for eight of the 11 cases. The dataset used to rate the quality of the calculated alignments is freely available at http://www.zbh.uni-hamburg.de/mraise-dataset.html. The table of all PDB codes contained in the ensembles can be found in the supplementary material. The software tool mRAISE is freely available for evaluation purposes and academic use (see http://www.zbh.uni-hamburg.de/raise).

[1]  Matthias Rarey,et al.  Fast Protein Binding Site Comparison via an Index-Based Screening Technology , 2013, J. Chem. Inf. Model..

[2]  Simona Distinto,et al.  How To Optimize Shape-Based Virtual Screening: Choosing the Right Query and Including Chemical Information , 2009, J. Chem. Inf. Model..

[3]  Matthias Rarey,et al.  Facing the Challenges of Structure-Based Target Prediction by Inverse Virtual Screening , 2014, J. Chem. Inf. Model..

[4]  M. Rarey,et al.  SIENA: Efficient Compilation of Selective Protein Binding Site Ensembles , 2016, J. Chem. Inf. Model..

[5]  Michael M. Mysinger,et al.  Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking , 2012, Journal of medicinal chemistry.

[6]  A. S. Nascimento,et al.  MolShaCS: a free and open source tool for ligand similarity identification based on Gaussian descriptors. , 2013, European journal of medicinal chemistry.

[7]  Matthias Rarey,et al.  MONA 2: A Light Cheminformatics Platform for Interactive Compound Library Processing , 2015, J. Chem. Inf. Model..

[8]  Mark S. Johnson,et al.  ShaEP: Molecular Overlay Based on Shape and Electrostatic Potential , 2009, J. Chem. Inf. Model..

[9]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[10]  Jeffrey Skolnick,et al.  LIGSIFT: an open-source tool for ligand structural alignment and virtual screening , 2015, Bioinform..

[11]  Matthias Rarey,et al.  TrixX: structure-based molecule indexing for large-scale virtual screening in sublinear time , 2007, J. Comput. Aided Mol. Des..

[12]  Matthias Rarey,et al.  Evidence of Water Molecules - A Statistical Evaluation of Water Molecules Based on Electron Density , 2015, J. Chem. Inf. Model..

[13]  Garrett M. Morris,et al.  Shape‐based similarity searching in chemical databases , 2013 .

[14]  Matthias Rarey,et al.  CONFECT: Conformations from an Expert Collection of Torsion Patterns , 2013, ChemMedChem.

[15]  Matthias Rarey,et al.  An integrated approach to knowledge-driven structure-based virtual screening , 2014, Journal of Computer-Aided Molecular Design.

[16]  Gert Thijs,et al.  Pharao: pharmacophore alignment and optimization. , 2008, Journal of molecular graphics & modelling.

[17]  J. A. Grant,et al.  A fast method of molecular shape comparison: A simple application of a Gaussian description of molecular shape , 1996, J. Comput. Chem..

[18]  Matthias Rarey,et al.  Inside Cover: CONFECT: Conformations from an Expert Collection of Torsion Patterns (ChemMedChem 10/2013) , 2013 .

[19]  Katrin Stierand,et al.  Molecular complexes at a glance: automated generation of two-dimensional complex diagrams , 2006, Bioinform..

[20]  Maxim Totrov,et al.  Atomic Property Fields: Generalized 3D Pharmacophoric Potential for Automated Ligand Superposition, Pharmacophore Elucidation and 3D QSAR , 2007, Chemical biology & drug design.

[21]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..

[22]  C. Lemmen,et al.  FLEXS: a method for fast flexible ligand superposition. , 1998, Journal of medicinal chemistry.

[23]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[24]  Ajay N. Jain,et al.  Ligand-based structural hypotheses for virtual screening. , 2004, Journal of medicinal chemistry.

[25]  Michael Nilges,et al.  Comparative Evaluation of 3D Virtual Ligand Screening Methods: Impact of the Molecular Alignment on Enrichment , 2010, J. Chem. Inf. Model..