The Proximal Lilly Collection: Mapping, Exploring and Exploiting Feasible Chemical Space

Venturing into the immensity of the small molecule universe to identify novel chemical structure is a much discussed objective of many methods proposed by the chemoinformatics community. To this end, numerous approaches using techniques from the fields of computational de novo design, virtual screening and reaction informatics, among others, have been proposed. Although in principle this objective is commendable, in practice there are several obstacles to useful exploitation of the chemical space. Prime among them are the sheer number of theoretically feasible compounds and the practical concern regarding the synthesizability of the chemical structures conceived using in silico methods. We present the Proximal Lilly Collection initiative implemented at Eli Lilly and Co. with the aims to (i) define the chemical space of small, drug-like compounds that could be synthesized using in-house resources and (ii) facilitate access to compounds in this large space for the purposes of ongoing drug discovery efforts. The implementation of PLC relies on coupling access to available synthetic knowledge and resources with chemo/reaction informatics techniques and tools developed for this purpose. We describe in detail the computational framework supporting this initiative and elaborate on the characteristics of the PLC virtual collection of compounds. As an example of the opportunities provided to drug discovery researchers by easy access to a large, realistically feasible virtual collection such as the PLC, we describe a recent application of the technology that led to the discovery of selective kinase inhibitors.

[1]  Peter Willett,et al.  Designing focused libraries using MoSELECT. , 2002, Journal of molecular graphics & modelling.

[2]  Nathan Brown,et al.  Multi-objective optimization methods in drug design. , 2013, Drug discovery today. Technologies.

[3]  Ian A. Watson,et al.  Rules for identifying potentially reactive or promiscuous compounds. , 2012, Journal of medicinal chemistry.

[4]  Gisbert Schneider Future De Novo Drug Design , 2014, Molecular informatics.

[5]  Andreas Evers,et al.  CROSS: an efficient workflow for reaction-driven rescaffolding and side-chain optimization using robust chemical reactions and available reagents. , 2013, Journal of medicinal chemistry.

[6]  Christian Lemmen,et al.  Similarity searching and scaffold hopping in synthetically accessible combinatorial chemistry spaces. , 2008, Journal of medicinal chemistry.

[7]  A. Tversky Features of Similarity , 1977 .

[8]  Robert J. Jilek,et al.  AllChem: generating and searching 1020 synthetically accessible structures , 2007, J. Comput. Aided Mol. Des..

[9]  Holger Claussen,et al.  Searching Fragment Spaces with Feature Trees , 2009, J. Chem. Inf. Model..

[10]  Florent Chevillard,et al.  SCUBIDOO: A Large yet Screenable and Easily Searchable Database of Computationally Created Chemical Compounds Optimized toward High Likelihood of Synthetic Tractability , 2015, J. Chem. Inf. Model..

[11]  Markus Hartenfeller,et al.  DOGS: Reaction-Driven de novo Design of Bioactive Compounds , 2012, PLoS Comput. Biol..

[12]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[13]  Zhengwei Peng,et al.  LEAP into the Pfizer Global Virtual Library (PGVL) space: creation of readily synthesizable design ideas automatically. , 2011, Methods in molecular biology.

[14]  Ning Yu,et al.  Efficient Exploration of Large Combinatorial Chemistry Spaces by Monomer-Based Similarity Searching , 2009, J. Chem. Inf. Model..

[15]  M. Vieth,et al.  Discovery of selective RIO2 kinase small molecule ligand. , 2015, Biochimica et biophysica acta.

[16]  Jean-Louis Reymond,et al.  MQN-Mapplet: Visualization of Chemical Space with Interactive Maps of DrugBank, ChEMBL, PubChem, GDB-11, and GDB-13 , 2013, J. Chem. Inf. Model..

[17]  Markus Hartenfeller,et al.  A Collection of Robust Organic Synthesis Reactions for In Silico Molecule Design , 2011, J. Chem. Inf. Model..

[18]  Arthur Dalby,et al.  Description of several chemical structure file formats used by computer programs developed at Molecular Design Limited , 1992, J. Chem. Inf. Comput. Sci..

[19]  Alexander G. Godfrey,et al.  A remote-controlled adaptive medchem lab: an innovative approach to enable drug discovery in the 21st Century. , 2013, Drug discovery today.

[20]  Thierry Kogej,et al.  Automated Recycling of Chemistry for Virtual Screening and Library Design , 2012, J. Chem. Inf. Model..

[21]  John Clark,et al.  PGVL Hub: An integrated desktop tool for medicinal chemists to streamline design and synthesis of chemical libraries and singleton compounds. , 2011, Methods in molecular biology.

[22]  Michael J. Keiser,et al.  Complementarity Between a Docking and a High-Throughput Screen in Discovering New Cruzain Inhibitors† , 2010, Journal of medicinal chemistry.

[23]  Matthias Rarey,et al.  Similarity searching in large combinatorial chemistry spaces , 2001, J. Comput. Aided Mol. Des..

[24]  P. Wipf,et al.  Stochastic voyages into uncharted chemical space produce a representative library of all possible drug-like compounds. , 2013, Journal of the American Chemical Society.

[25]  Christos A. Nicolaou,et al.  Ties in Proximity and Clustering Compounds , 2001, J. Chem. Inf. Comput. Sci..

[26]  Lorenz C. Blum,et al.  Chemical space as a source for new drugs , 2010 .

[27]  Uta Lessel,et al.  Identification of new potent GPR119 agonists by combining virtual screening and combinatorial chemistry. , 2012, Journal of medicinal chemistry.

[28]  J. Reymond,et al.  Chemical Space Travel , 2007, ChemMedChem.

[29]  Darren R. Flower,et al.  On the Properties of Bit String-Based Measures of Chemical Similarity , 1998, J. Chem. Inf. Comput. Sci..

[30]  Valerie J. Gillet,et al.  Knowledge-Based Approach to de Novo Design Using Reaction Vectors , 2009, J. Chem. Inf. Model..

[31]  Christos A Nicolaou,et al.  Molecular library design using multi-objective optimization methods. , 2011, Methods in molecular biology.

[32]  Zhengwei Peng,et al.  Very large virtual compound spaces: construction, storage and utility in drug discovery. , 2013, Drug discovery today. Technologies.

[33]  Allan M Jordan,et al.  The medicinal chemist's toolbox: an analysis of reactions used in the pursuit of drug candidates. , 2011, Journal of medicinal chemistry.