Classification of Pitch Disguise Level with Artificial Neural Networks

In audio forensics, voice disguise raises many serious challenges in finding the criminal suspects. Most of the criminals usually disguise their voice just before and/or after committing a crime. Hence, to perform forensic speaker verification, original voice is to be recovered from the disguised voice. Pitch disguise is one type of voice disguise which results in very low speaker recognition rate compared to other types of disguises. Original voice can be easily recovered from the pitch disguised voice if the level of disguise is known. Hence, this paper proposes a novel technique for finding the level of pitch disguise using Artificial Neural Networks (ANN) modelling. The performance of the system is measured in terms of accuracy and the proposed technique gives an accuracy of 96.23%

[1]  B. Morrongiello,et al.  Development of the perception of musical relations: semitone and diatonic structure. , 1986, Journal of experimental psychology. Human perception and performance.

[2]  Tiejun Tan,et al.  The effect of voice disguise on Automatic Speaker Recognition , 2010, 2010 3rd International Congress on Image and Signal Processing.

[3]  Yong Wang,et al.  Identification of Electronic Disguised Voices , 2014, IEEE Transactions on Information Forensics and Security.

[4]  Heaton T. Jeff,et al.  Introduction to Neural Networks with Java , 2005 .

[5]  R. Rodman SPEAKER RECOGNITION OF DISGUISED VOICES : A PROGRAM FOR RESEARCH , 2000 .

[6]  Cuiling Zhang Acoustic analysis of disguised voices with raised and lowered pitch , 2012, 2012 8th International Symposium on Chinese Spoken Language Processing.

[7]  Cuiling Zhang,et al.  Voice disguise and automatic speaker recognition. , 2008, Forensic science international.

[8]  Tan Yee Fan,et al.  A Tutorial on Support Vector Machine , 2009 .

[9]  Steve Renals,et al.  Speaker verification using sequence discriminant support vector machines , 2005, IEEE Transactions on Speech and Audio Processing.

[10]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[11]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.