Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition

The comparison of non-contemporary speech samples is common in forensic speaker recognition cases. It has yet to be established however, to what extent the time interval between non-contemporary samples can increase before a problem is created for forensic automatic speaker recognition. This paper presents results of a human listener test designed to evaluate the detectability of vocal ageing over increasing intervals of up to 30 years. Subsequently, a forensic automatic speaker recognition evaluation of 15 ageing males at increasing intervals of up to 60 years is presented. It is shown that at intervals of around 10 years, the average detectability of vocal ageing by humans is just above chance. As the interval rises to 30 years, vocal ageing is detected 90% of the time. In the automatic system, vocal ageing is manifested as a drop in intra-speaker likelihood ratios (LRs) as the time interval between non-contemporary samples increases. At an interval of 30 years, LRs for the vast majority of intra-speaker comparisons fall below a value of 100 commonly interpreted as ‘moderate support’ on a verbal LR scale. Our findings indicate that at a time-lapse of 30 years, vocal ageing creates significant problems for forensic automatic speaker recognition.

[1]  Richard William Rhodes,et al.  Assessing the strength of non-contemporaneous forensic speech evidence , 2012 .

[2]  I. Evett,et al.  The nature of forensic science opinion--a possible framework to guide thinking and practice in investigations and in court proceedings. , 2006, Science & justice : journal of the Forensic Science Society.

[3]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[4]  Phil Rose,et al.  Technical forensic speaker recognition: Evaluation, types and testing of evidence , 2006, Comput. Speech Lang..

[5]  Philip Rose Forensic Speaker Identification , 2002 .

[6]  Rahul Shrivastav,et al.  Noise and tremor in the perception of vocal aging in males. , 2010, Journal of voice : official journal of the Voice Foundation.

[7]  M. Dixit,et al.  Speaker Identification , 2011 .

[8]  Joan E Sussman,et al.  Changes in acoustic characteristics of the voice across the life span: measures from individuals 4-93 years of age. , 2011, Journal of speech, language, and hearing research : JSLHR.

[9]  Joaquín González-Rodríguez Forensic automatic speaker recognition: fiction or science? , 2008, INTERSPEECH.

[10]  Didier Meuwly,et al.  The inference of identity in forensic speaker recognition , 2000, Speech Commun..

[11]  W Decoster,et al.  Longitudinal voice changes: facts and interpretation. , 2000, Journal of voice : official journal of the Voice Foundation.

[12]  P. Mueller The Aging Voice , 1997, Seminars in speech and language.

[13]  Haizhou Li,et al.  An overview of text-independent speaker recognition: From features to supervectors , 2010, Speech Commun..

[14]  Michael Apted The up series , 2007 .

[15]  Jessica A Barlow,et al.  Age-related changes in acoustic characteristics of adult speech. , 2009, Journal of communication disorders.

[16]  Didier Meuwly Forensic speaker recognition: an evidence odyssey , 2004, Odyssey.

[17]  Andrzej Drygajlo,et al.  Compensating for Ageing and Quality variation in Speaker Verification , 2012, INTERSPEECH.

[18]  Javier Ortega-Garcia,et al.  Robust estimation, interpretation and assessment of likelihood ratios in forensic speaker recognition , 2006, Comput. Speech Lang..

[19]  H Hollien,et al.  Speaker identification utilizing noncontemporary speech. , 2001, Journal of forensic sciences.

[20]  Didier Meuwly,et al.  Forensic speaker recognition based on a Bayesian framework and Gaussian mixture modelling (GMM) , 2001, Odyssey.

[21]  Jeff A. Bilmes,et al.  A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[22]  Peter French,et al.  R v John Samuel Humble: The Yorkshire Ripper Hoaxer Trial , 2007 .

[23]  Didier Meuwly Forensic speaker recognition : An evidence odyssey – Summary , 2004 .

[24]  Susanne Schötz,et al.  Perception, Analysis and Synthesis of Speaker Age , 2006 .

[25]  A. Drygajlo,et al.  Forensic Automatic Speaker Recognition [Exploratory DSP] , 2007, IEEE Signal Processing Magazine.

[26]  Hermann J. Künzel Non-contemporary speech samples: auditory detectability of an 11 year delay and itseffect on automatic speaker identification , 2007 .

[27]  Andrzej Drygajlo,et al.  Speaker verification in score-ageing-quality classification space , 2013, Comput. Speech Lang..