Computational Assessment of Chemical Saturation of Analogue Series under Varying Conditions

Assessing the degree to which analogue series are chemically saturated is of major relevance in compound optimization. Decisions to continue or discontinue series are typically made on the basis of subjective judgment. Currently, only very few methods are available to aid in decision making. We further investigate and extend a computational concept to quantitatively assess the progression and chemical saturation of a series. To these ends, existing analogues and virtual candidates are compared in chemical space and compound neighborhoods are systematically analyzed. A large number of analogue series from different sources are studied, and alternative chemical space representations and virtual analogues of different designs are explored. Furthermore, evolving analogue series are distinguished computationally according to different saturation levels. Taken together, our findings provide a basis for practical applications of computational saturation analysis in compound optimization.

[1]  Jürgen Bajorath,et al.  SAR Monitoring of Evolving Compound Data Sets Using Activity Landscapes , 2011, J. Chem. Inf. Model..

[2]  Jürgen Bajorath,et al.  Computational method for estimating progression saturation of analog series , 2018, RSC advances.

[3]  Robert D Clark,et al.  Neighborhood behavior: a useful concept for validation of "molecular diversity" descriptors. , 1996, Journal of medicinal chemistry.

[4]  Jürgen Bajorath,et al.  Computational design of new molecular scaffolds for medicinal chemistry, part II: generalization of analog series-based scaffolds , 2017, Future science OA.

[5]  Jens Sadowski,et al.  Structure Modification in Chemical Databases , 2005 .

[6]  Michael M. Hann,et al.  RECAP-Retrosynthetic Combinatorial Analysis Procedure: A Powerful New Technique for Identifying Privileged Molecular Fragments with Useful Applications in Combinatorial Chemistry , 1998, J. Chem. Inf. Comput. Sci..

[7]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[8]  Jameed Hussain,et al.  Computationally Efficient Algorithm to Identify Matched Molecular Pairs (MMPs) in Large Data Sets , 2010, J. Chem. Inf. Model..

[9]  Jürgen Bajorath,et al.  Analog series-based scaffolds: computational design and exploration of a new type of molecular scaffolds for medicinal chemistry , 2016, Future science OA.

[10]  J. Bajorath,et al.  Local structural changes, global data views: graphical substructure-activity relationship trailing. , 2011, Journal of medicinal chemistry.

[11]  Mark Munson,et al.  Lead optimization attrition analysis (LOAA): a novel and general methodology for medicinal chemistry. , 2015, Drug discovery today.

[12]  G. Maggiora,et al.  Molecular similarity in medicinal chemistry. , 2014, Journal of medicinal chemistry.

[13]  Jürgen Bajorath,et al.  Matched molecular pairs derived by retrosynthetic fragmentation , 2014 .

[14]  Anne Mai Wassermann,et al.  SAR Matrices: Automated Extraction of Information-Rich SAR Tables from Large Compound Data Sets , 2012, J. Chem. Inf. Model..

[15]  Yanli Wang,et al.  PubChem BioAssay: 2017 update , 2016, Nucleic Acids Res..

[16]  Jürgen Bajorath,et al.  Computational Method for the Systematic Identification of Analog Series and Key Compounds Representing Series and Their Biological Activity Profiles. , 2016, Journal of medicinal chemistry.

[17]  Jürgen Bajorath,et al.  Monitoring the Progression of Structure-Activity Relationship Information during Lead Optimization. , 2016, Journal of medicinal chemistry.

[18]  Matthew Segall,et al.  Advances in multiparameter optimization methods for de novo drug design , 2014, Expert opinion on drug discovery.

[19]  Jürgen Bajorath,et al.  The 'SAR Matrix' method and its extensions for applications in medicinal chemistry and chemogenomics. , 2014, F1000Research.

[20]  Andrew T Maynard,et al.  Quantifying, Visualizing, and Monitoring Lead Optimization. , 2016, Journal of medicinal chemistry.