MOTIVATION
We propose a novel method for scoring the accuracy of protein binding site predictions-the Binding-site Distance Test (BDT) score. Recently, the Matthews Correlation Coefficient (MCC) has been used to evaluate binding site predictions, both by developers of new methods and by the assessors for the community-wide prediction experiment-CASP8. While being a rigorous scoring method, the MCC does not take into account the actual 3D location of the predicted residues from the observed binding site. Thus, an incorrectly predicted site that is nevertheless close to the observed binding site will obtain an identical score to the same number of non-binding residues predicted at random. The MCC is somewhat affected by the subjectivity of determining observed binding residues and the ambiguity of choosing distance cutoffs. By contrast the BDT method produces continuous scores ranging between 0 and 1, relating to the distance between the predicted and observed residues. Residues predicted close to the binding site will score higher than those more distant, providing a better reflection of the true accuracy of predictions. The CASP8 function predictions were evaluated using both the MCC and BDT methods and the scores were compared. The BDT was found to strongly correlate with the MCC scores while also being less susceptible to the subjectivity of defining binding residues. We therefore suggest that this new simple score is a potentially more robust method for future evaluations of protein-ligand binding site predictions.
AVAILABILITY
http://www.reading.ac.uk/bioinf/downloads/.
[1]
Alfonso Valencia,et al.
Assessment of predictions submitted for the CASP7 function prediction category.
,
2007,
Proteins.
[2]
B. Matthews.
Comparison of the predicted and observed secondary structure of T4 phage lysozyme.
,
1975,
Biochimica et biophysica acta.
[3]
Frank Eisenhaber,et al.
Prediction of Protein Function
,
2006
.
[4]
Michael I. Jordan,et al.
Active site prediction using evolutionary and structural information
,
2010,
Bioinform..
[5]
Gonzalo López,et al.
Assessment of ligand binding residue predictions in CASP8
,
2009,
Proteins.
[6]
Anna Tramontano,et al.
The prediction of protein function at CASP6
,
2005,
Proteins.
[7]
Keehyoung Joo,et al.
Protein‐binding site prediction based on three‐dimensional protein modeling
,
2009,
Proteins.
[8]
Michael J E Sternberg,et al.
Prediction of ligand binding sites using homologous structures and conservation at CASP8
,
2009,
Proteins.