Results of the 2003 NFI-TNO forensic speaker recognition evaluation

In this paper we report on the results of the NFITNO speaker recognition evaluation held in 2003. The speech material used in this evaluation has been obtained from wire-tapped recordings from real police investigations in the Netherlands. In total six experiments were carried out, one main experiment in Dutch, one experiment in which speech lengths were systematically varied, three language dependence experiments, and one experiment evaluating a proposed forensic procedure for providing evidence in court cases. The lowest equal error rate of all systems was 12.1% in the condition using 15 seconds test segments and 60 seconds training segments.