Robust Digital Speech Watermarking For Online Speaker Recognition

A robust and blind digital speech watermarking technique has been proposed for online speaker recognition systems based on Discrete Wavelet Packet Transform (DWPT) and multiplication to embed the watermark in the amplitudes of the wavelet’s subbands. In order to minimize the degradation effect of the watermark, these subbands are selected where less speaker-specific information was available (500 Hz–3500 Hz and 6000 Hz–7000 Hz). Experimental results on Texas Instruments Massachusetts Institute of Technology (TIMIT), Massachusetts Institute of Technology (MIT), and Mobile Biometry (MOBIO) show that the degradation for speaker verification and identification is 1.16% and 2.52%, respectively. Furthermore, the proposed watermark technique can provide enough robustness against different signal processing attacks.

[1]  Farrokh Marvasti,et al.  Robust Multiplicative Audio and Speech Watermarking Using Statistical Modeling , 2009, 2009 IEEE International Conference on Communications.

[2]  Zhen Li,et al.  A Robust Audio Watermarking Scheme Based on Lifting Wavelet Transform and Singular Value Decomposition , 2011, IWDW.

[3]  Marcos Faúndez-Zanuy,et al.  Speaker verification security improvement by means of speech watermarking , 2006, Speech Commun..

[4]  Syed Abdul Rahman Al-Haddad,et al.  Digital Audio and Speech Watermarking Based on the Multiple Discrete Wavelets Transform and Singular Value Decomposition , 2012, 2012 Sixth Asia Modelling Symposium.

[5]  Fathi E. Abd El-Samie,et al.  An SVD audio watermarking approach using chaotic encrypted images , 2011, Digit. Signal Process..

[6]  Jianwu Dang,et al.  An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification , 2008, Speech Commun..

[7]  Farrokh Marvasti,et al.  Robust audio and speech watermarking using Gaussian and Laplacian modeling , 2010, Signal Process..

[8]  Syed Abdul Rahman Al-Haddad,et al.  Distant Speaker Recognition: An Overview , 2016, Int. J. Humanoid Robotics.

[9]  Faraneh Zarafshan,et al.  Blind digital speech watermarking based on Eigen-value quantization in DWT , 2015, J. King Saud Univ. Comput. Inf. Sci..

[10]  Shyamala Doraisamy,et al.  Speaker Frame Selection for Digital Speech Watermarking , 2016 .

[11]  Syed Abdul Rahman Al-Haddad,et al.  Semi-fragile digital speech watermarking for online speaker recognition , 2015, EURASIP J. Audio Speech Music. Process..

[12]  Indranil Sengupta,et al.  A New Audio Watermarking Scheme Based on Singular Value Decomposition and Quantization , 2011, Circuits Syst. Signal Process..

[13]  Scott Craver,et al.  Additive attacks on speaker recognition , 2014, Electronic Imaging.

[14]  Jianwu Dang,et al.  An investigation of dependencies between frequency components and speaker characteristics based on phoneme mean F-ratio contribution , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.

[15]  Gernot Kubin,et al.  Speaker identification security improvement by means of speech watermarking , 2007, Pattern Recognit..

[16]  Shyamala Doraisamy,et al.  Digital speech watermarking for anti-spoofing attack in speaker recognition , 2014, 2014 IEEE REGION 10 SYMPOSIUM.

[17]  Syed Abdul Rahman Al-Haddad,et al.  An overview of digital speech watermarking , 2013, Int. J. Speech Technol..

[18]  Haizhou Li,et al.  Spoofing and countermeasures for speaker verification: A survey , 2015, Speech Commun..

[19]  Faraneh Zarafshan,et al.  Interacting video information via speech watermarking for mobile second screen in Android smartphone , 2013, 2013 IEEE Student Conference on Research and Developement.

[20]  Indranil Sengupta,et al.  An adaptive audio watermarking based on the singular value decomposition in the wavelet domain , 2010, Digit. Signal Process..

[21]  Andreas Uhl,et al.  Watermarking as a Means to Enhance Biometric Systems: A Critical Survey , 2011, Information Hiding.

[22]  Mohammad Ali Akhaee,et al.  Robust Multiplicative Patchwork Method for Audio Watermarking , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  Jean-François Bonastre,et al.  Localization and selection of speaker-specific information with statistical modeling , 2000, Speech Commun..