Large-Scale Assessment of Activity Landscape Feature Probabilities of Bioactive Compounds

Activity landscape representations integrate pairwise compound similarity and potency relationships and provide direct access to characteristic structure-activity relationship features in compound data sets. Because pairwise compound comparisons provide the foundation of activity landscape design, the assessment of specific landscape features such as activity cliffs has generally been confined to the level of compound pairs. A conditional probability-based approach has been applied herein to assign most probable activity landscape features to individual compounds. For example, for a given data set compound, it was determined if it would preferentially engage in the formation of activity cliffs or other landscape features. In a large-scale effort, we have determined conditional activity landscape feature probabilities for more than 160,000 compounds with well-defined activity annotations contained in 427 different target-based data sets. These landscape feature probabilities provide a detailed view of how different activity landscape features are distributed over currently available bioactive compounds.

[1]  Jürgen Bajorath,et al.  Quantifying the Fingerprint Descriptor Dependence of Structure-Activity Relationship Information on a Large Scale , 2013, J. Chem. Inf. Model..

[2]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[3]  Jürgen Bajorath,et al.  Conditional Probabilities of Activity Landscape Features for Individual Compounds , 2013, J. Chem. Inf. Model..

[4]  Jürgen Bajorath,et al.  Recent progress in understanding activity cliffs and their utility in medicinal chemistry. , 2014, Journal of medicinal chemistry.

[5]  J. Bajorath,et al.  Activity landscape representations for structure-activity relationship analysis. , 2010, Journal of medicinal chemistry.

[6]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[7]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[8]  Jürgen Bajorath,et al.  Frequency of Occurrence and Potency Range Distribution of Activity Cliffs in Bioactive Compounds , 2012, J. Chem. Inf. Model..

[9]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[10]  Preeti Iyer,et al.  Activity Landscapes, Information Theory, and Structure – Activity Relationships , 2013, Molecular informatics.

[11]  Jürgen Bajorath,et al.  Exploring activity cliffs in medicinal chemistry. , 2012, Journal of medicinal chemistry.