Many bioinformatics algorithms can be understood as binary classifiers. They are usually compared using the area under the receiver operating characteristic ( ROC) curve. On the other hand, choosing the best threshold for practical use is a complex task, due to uncertain and context-dependent skews in the abundance of positives in nature and in the yields/costs for correct/incorrect classification. We argue that considering a classifier as a player in a zero-sum game allows us to use the minimax principle from game theory to determine the optimal operating point. The proposed classifier threshold corresponds to the intersection between the ROC curve and the descending diagonal in ROC space and yields a minimax accuracy of 1-FPR. Our proposal can be readily implemented in practice, and reveals that the empirical condition for threshold estimation of “specificity equals sensitivity” maximizes robustness against uncertainties in the abundance of positives in nature and classification costs.
[1]
Tom Fawcett,et al.
An introduction to ROC analysis
,
2006,
Pattern Recognit. Lett..
[2]
William Stafford Noble,et al.
Assessing computational tools for the discovery of transcription factor binding sites
,
2005,
Nature Biotechnology.
[3]
Okeh Um,et al.
Evaluating Measures of Indicators of Diagnostic Test Performance:Fundamental Meanings and Formulars
,
2012
.
[4]
J A Swets,et al.
Better decisions through science.
,
2000,
Scientific American.
[5]
Morten Nielsen,et al.
Towards High-throughput Immunomics for Infectious Diseases: Use of Next-generation Peptide Microarrays for Rapid Discovery and Mapping of Antigenic Determinants*
,
2015,
Molecular & Cellular Proteomics.