State of the art automatic speaker recognition systems show very good results in the discrimination between different speakers under controlled recording conditions. In a forensic context, the conditions are uncontrolled and voice can be disguised. In cases of terrorism claim, extortion or kidnapping, it is of great interest for offenders to conceal their identity. Voice disguise is an important constraint to speaker discrimination. Some disguises produce a great variation of parameters and change the perception of an identity. The main risk is to confound a disguised voice and a normal voice and accuse an innocent individual. This paper proposes on one hand to present the impact of voice disguise on automatic speaker recognition and, on the other hand a statistical study in order to detect and identify four disguises among the most common. The first step consists in extracting features and the second step to classify them. MFCC (Mel Frequency Cepstral Coefficient) are considered as features and different...
[1]
Niklas Torstensson,et al.
Mimicked accents : Do speakers have similar cognitive prototypes?
,
2004
.
[2]
Vladimir N. Vapnik,et al.
The Nature of Statistical Learning Theory
,
2000,
Statistics for Engineering and Information Science.
[3]
Elizabeth Shriberg,et al.
A Study of Intentional Voice Modifications for Evading Automatic Speaker Recognition
,
2006,
2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.
[4]
Javier Ortega-Garcia,et al.
Effect of voice disguise on the performance of a forensic automatic speaker recognition system
,
2004,
Odyssey.
[5]
Daniel Elenius,et al.
Speaker verification scores and acoustic analysis of a professional impersonator
,
2004
.
[6]
Alvin F. Martin,et al.
The DET curve in assessment of detection task performance
,
1997,
EUROSPEECH.
[7]
Voice disguise and automatic detection
,
2005
.
[8]
Harry Hollien,et al.
Forensic Voice Identification
,
2001
.