De novo Drug Design – Ye olde Scoring Problem Revisited

Generating molecules by computational means is common practice in drug discovery. New chemical entities with the desired properties may serve both as tool compounds for chemogenomics studies and as starting points for hit-tolead expansion. The three challenges for automated de novo design are i) the assembly of synthetically accessible structures, ii) scoring and property prediction, and finally iii) the systematic optimization of promising molecules in adaptive learning cycles. During the past three decades, numerous methods, algorithms and heuristics have been proposed to address each of these problems. While the generation of new chemical entities with attractive chemical scaffolds has become feasible by reaction-driven fragment assembly, and the in silico optimization problem may also be considered largely solved, the persisting issue of compound scoring remains difficult. Scoring means picking the best compounds from a large pool of accessible possibilities. This process typically includes both ligandand structure(receptor)-based virtual screening of the computationally generated molecules. This large virtual compound pool contains many more inactive or problematic chemical structures than desirable ones. While compound elimination by appropriate scoring models discards the bulk of the designs (“negative design”) with acceptable accuracy, the selection of the best or most promising ones (“positive design”) remains error-prone. The conventional techniques employed at this step of the selection process include coarse-grained and application-specific heuristics, physicochemical property calculation, quantitative and qualitative structure-activity relationship models, similarity calculations on various levels of detail and with different molecular representations, shape matching and automated ligand docking, as well as the detection of potentially toxic and otherwise unwanted chemical structures. More recently, qualitative and quantitative onand off-target prediction methods have been added to the molecular designer’s tool chest. During the PacifiChem2015 conference in Honolulu, HI, USA, we organized a symposium to share experience and discuss the progress in computer-based de novo compound design and scoring (Figure 1). Several symposium papers and closely related contributions are compiled in this focused special issue of Molecular Informatics. It is evident that project-specific, customized scoring functions will help reduce the false-positive prediction rate. In this special issue, we highlight methods that diverge from mainstream approaches and point towards future developments in molecular informatics research. In a Methods Corner article, Kaneko and Funatsu review a structure generation method that can be used to design molecules that lie within the applicability domain of a scoring function. This “inverse QSAR” approach to de novo design is now awaiting practical application. Fukunishi et al. present methods to improve docking scores by regression-based correction for use in structure-based virtual screening and molecule design. In a cross-validation study, the weighted scoring functions showed improved accuracy over simpler methods. Multidimensional compound optimization by Pareto ranking is showcased in the article by Daeyaert and Deem. They show that Pareto sorting improves the performance of de novo design algorithms in generating molecules with hard-tomeet constraints. This tackles the important problem of Figure 1. Researchers from across the globe discussed the status and future possibilities of de novo drug design in a special symposium held at PacifiChem2015 in Honolulu, Hawaii.

[1]  P Reddanna,et al.  Free energy calculations to estimate ligand-binding affinities in structure-based drug design. , 2014, Current pharmaceutical design.

[2]  Tao Wang,et al.  Quantitative structure–activity relationship: promising advances in drug discovery platforms , 2015, Expert opinion on drug discovery.

[3]  Petra Schneider,et al.  De Novo Design at the Edge of Chaos. , 2016, Journal of medicinal chemistry.

[4]  Jung-Hsin Lin,et al.  Scoring functions for prediction of protein-ligand interactions. , 2013, Current pharmaceutical design.

[5]  T. Le,et al.  A Bright Future for Evolutionary Methods in Drug Design , 2015, ChemMedChem.

[6]  Danishuddin,et al.  Descriptors and their selection methods in QSAR analysis: paradigm for drug design. , 2016, Drug discovery today.

[7]  Peter Willett,et al.  The Calculation of Molecular Structural Similarity: Principles and Practice , 2014, Molecular informatics.

[8]  Dieter Lang,et al.  Predicting drug metabolism: experiment and/or computation? , 2015, Nature Reviews Drug Discovery.

[9]  P Schneider,et al.  Spotting and designing promiscuous ligands for drug discovery. , 2016, Chemical communications.

[10]  Pedro J Ballester,et al.  Machine‐learning scoring functions to improve structure‐based binding affinity prediction and virtual screening , 2015, Wiley interdisciplinary reviews. Computational molecular science.

[11]  Vladimir B Bajic,et al.  In silico toxicology: computational methods for the prediction of chemical toxicity , 2016, Wiley interdisciplinary reviews. Computational molecular science.

[12]  G. Schneider,et al.  Voyages to the (un)known: adaptive design of bioactive compounds. , 2009, Trends in biotechnology.

[13]  Tanja Van Mourik,et al.  Uniting Cheminformatics and Chemical Theory To Predict the Intrinsic Aqueous Solubility of Crystalline Druglike Molecules , 2014, J. Chem. Inf. Model..

[14]  G. V. Paolini,et al.  Quantifying the chemical beauty of drugs. , 2012, Nature chemistry.

[15]  David A. Winkler,et al.  Understanding the Roles of the "Two QSARs" , 2016, J. Chem. Inf. Model..

[16]  Vinícius Gonçalves Maltarollo,et al.  Applying machine learning techniques for ADME-Tox prediction: a review , 2015, Expert opinion on drug metabolism & toxicology.

[17]  Gisbert Schneider De novo Molecular Design , 2013 .

[18]  Gisbert Schneider,et al.  Computer-based de novo design of drug-like molecules , 2005, Nature Reviews Drug Discovery.

[19]  W Patrick Walters,et al.  Going further than Lipinski's rule in drug design , 2012, Expert opinion on drug discovery.

[20]  Gisbert Schneider,et al.  Deep Learning in Drug Discovery , 2016, Molecular informatics.

[21]  Jerzy Leszczynski,et al.  Comprehension of drug toxicity: Software and databases , 2014, Comput. Biol. Medicine.