Combination of molecular similarity measures using data fusion

Many different measures of structural similarity have been suggested for matching chemical structures, each such measure focusing upon some particular type of molecular characteristic. The multi-faceted nature of biological activity suggests that an appropriate similarity measure should encompass many different types of characteristic, and this article discusses the use of data fusion methods to combine the results of searches based on multiple similarity measures. Experiments with several different types of dataset and activity suggest that data fusion provides a simple, but effective, approach to the combination of individual similarity measures. The best results were generally obtained with a fusion rule that sums the rank positions achieved by each molecule in searches using individual measures.

[1]  W. W. Daniel Applied Nonparametric Statistics , 1979 .

[2]  P. Willett,et al.  A Comparison of Some Measures for the Determination of Inter‐Molecular Structural Similarity Measures of Inter‐Molecular Structural Similarity , 1986 .

[3]  Peter Willett,et al.  Implementation and use of an atom-mapping procedure for similarity searching in databases of 3-D chemical structures , 1990 .

[4]  P. Jurs,et al.  Development and use of charged partial surface area structural descriptors in computer-assisted quantitative structure-property relationship studies , 1990 .

[5]  D. L. Hall,et al.  Mathematical Techniques in Multisensor Data Fusion , 1992 .

[6]  Peter Willett,et al.  Similarity searching in files of three-dimensional chemical structures: Comparison of fragment-based measures of shape similarity , 1994, J. Chem. Inf. Comput. Sci..

[7]  M. Kokar,et al.  Preface to the special section on data fusion: Architectures and issues , 1994 .

[8]  E. A. Fox,et al.  Combining the Evidence of Multiple Query Representations for Information Retrieval , 1995, Inf. Process. Manag..

[9]  Norbert Fuhr,et al.  Retrieval Effectiveness of Proper Name Search Methods , 1996, Inf. Process. Manag..

[10]  Robert P. Sheridan,et al.  Chemical Similarity Using Geometric Atom Pair Descriptors , 1996, J. Chem. Inf. Comput. Sci..

[11]  Hideyuki Masui,et al.  SPECTRA: A Spectral Information Management System Featuring a Novel Combined Search Function , 1996, J. Chem. Inf. Comput. Sci..

[12]  Robert P. Sheridan,et al.  Chemical Similarity Using Physiochemical Property Descriptors , 1996, J. Chem. Inf. Comput. Sci..

[13]  Peter Willett,et al.  Similarity Searching in Files of Three-Dimensional Chemical Structures: Evaluation of the EVA Descriptor and Combination of Rankings Using Data Fusion , 1997, J. Chem. Inf. Comput. Sci..

[14]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[15]  P. Schleyer Encyclopedia of computational chemistry , 1998 .

[16]  Peter Willett,et al.  Designing bioactive molecules : three-dimensional techniques and applications , 1998 .

[17]  Sung-Sau So,et al.  A comparative study of ligand-receptor complex binding affinity prediction methods based on glycogen phosphorylase inhibitors , 1999, J. Comput. Aided Mol. Des..