Ligand-Based Virtual Screening Using Bayesian Inference Network and Reweighted Fragments

Many of the similarity-based virtual screening approaches assume that molecular fragments that are not related to the biological activity carry the same weight as the important ones. This was the reason that led to the use of Bayesian networks as an alternative to existing tools for similarity-based virtual screening. In our recent work, the retrieval performance of the Bayesian inference network (BIN) was observed to improve significantly when molecular fragments were reweighted using the relevance feedback information. In this paper, a set of active reference structures were used to reweight the fragments in the reference structure. In this approach, higher weights were assigned to those fragments that occur more frequently in the set of active reference structures while others were penalized. Simulated virtual screening experiments with MDL Drug Data Report datasets showed that the proposed approach significantly improved the retrieval effectiveness of ligand-based virtual screening, especially when the active molecules being sought had a high degree of structural heterogeneity.

[1]  Naomie Salim,et al.  Implementing Relevance Feedback in Ligand-Based Virtual Screening Using Bayesian Inference Network , 2011, Journal of biomolecular screening.

[2]  Peter Willett,et al.  Evaluation of a Bayesian inference network for ligand-based virtual screening , 2009, J. Cheminformatics.

[3]  P. Willett,et al.  A Comparison of Some Measures for the Determination of Inter‐Molecular Structural Similarity Measures of Inter‐Molecular Structural Similarity , 1986 .

[4]  B. Fan,et al.  Molecular similarity and diversity in chemoinformatics: From theory to applications , 2006, Molecular Diversity.

[5]  Robert P Sheridan,et al.  Why do we need so many chemical similarity search methods? , 2002, Drug discovery today.

[6]  Naomie Salim,et al.  Similarity‐Based Virtual Screening with a Bayesian Inference Network , 2009, ChemMedChem.

[7]  Hanna Geppert,et al.  Current Trends in Ligand-Based Virtual Screening: Molecular Representations, Data Mining Methods, New Application Areas, and Performance Evaluation , 2010, J. Chem. Inf. Model..

[8]  P Willett,et al.  Grouping of coefficients for the calculation of inter-molecular similarity and dissimilarity using 2D fragment bit-strings. , 2002, Combinatorial chemistry & high throughput screening.

[9]  Peter Willett,et al.  Effect of standardization on fragment‐based measures of structural similarity , 1993 .

[10]  Luis M. de Campos,et al.  Implementing Relevance Feedback in the Bayesian Network Retrieval Model , 2003, J. Assoc. Inf. Sci. Technol..

[11]  Peter Willett,et al.  Promoting Access to White Rose Research Papers Enhancing the Effectiveness of Ligand-based Virtual Screening Using Data Fusion , 2022 .

[12]  Louis Hodes,et al.  Clustering a large number of compounds. 1. Establishing the method on an initial sample , 1989, J. Chem. Inf. Comput. Sci..

[13]  N. Nikolova,et al.  International Union of Pure and Applied Chemistry, LUMO energy ± The Lowest Unoccupied Molecular Orbital (LUMO) , 2022 .

[14]  R. Glen,et al.  Molecular similarity: a key technique in molecular informatics. , 2004, Organic & biomolecular chemistry.

[15]  Mitchell A. Miller Chemical database techniques in drug discovery , 2002, Nature Reviews Drug Discovery.

[16]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[17]  Jing Xin,et al.  Relevance Feedback for Content-Based Image Retrieval Using Bayesian Network , 2004, VIP.

[18]  Tim D. J. Perkins,et al.  Large-scale virtual screening for discovering leads in the postgenomic era , 2001, IBM Syst. J..

[19]  Naomie Salim,et al.  New Fragment Weighting Scheme for the Bayesian Inference Network in Ligand-Based Virtual Screening , 2011, J. Chem. Inf. Model..

[20]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[21]  W. Bruce Croft,et al.  Relevance feedback and inference networks , 1993, SIGIR.

[22]  Robert P Sheridan,et al.  Chemical similarity searches: when is complexity justified? , 2007, Expert opinion on drug discovery.

[23]  Jérôme Hert,et al.  New Methods for Ligand-Based Virtual Screening: Use of Data Fusion and Machine Learning to Enhance the Effectiveness of Similarity Searching , 2006, J. Chem. Inf. Model..

[24]  Jürgen Bajorath,et al.  Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches. , 2007, Drug discovery today.

[25]  Peter Willett,et al.  Measuring the degree of similarity between objects in text retrieval systems , 1993 .

[26]  Mark A. Murcko,et al.  Virtual screening : an overview , 1998 .

[27]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..