An overview of digital speech watermarking

Digital speech watermarking is a robust way to hide and thus secure data like audio and video from any intentional or unintentional manipulation through transmission. In terms of some signal characteristics including bandwidth, voice/non-voice and production model, digital speech signal is different from audio, music and other signals. Although, various review articles on image, audio and video watermarking are available, there are still few review papers on digital speech watermarking. Therefore this article presents an overview of digital speech watermarking including issues of robustness, capacity and imperceptibility. Other issues discussed are types of digital speech watermarking, application, models and masking methods. This article further highlights the related challenges in the real world, research opportunities and future works in this area, yet to be explored fully.

[1]  Seymour Shlien,et al.  The modulated lapped transform, its time-varying forms, and its applications to audio coding standards , 1997, IEEE Trans. Speech Audio Process..

[2]  Henrique S. Malvar Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[3]  Jian Liu,et al.  Quantization Index Modulation audio watermarking system using a psychoacoustic model , 2011, 2011 8th International Conference on Information, Communications & Signal Processing.

[4]  Qiang Cheng,et al.  Spread spectrum signaling for speech watermarking , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  Sheng-He Sun,et al.  Multipurpose image watermarking algorithm based on multistage vector quantization , 2005, IEEE Transactions on Image Processing.

[6]  Gernot Kubin,et al.  Speech watermarking for air traffic control , 2004, 2004 12th European Signal Processing Conference.

[7]  W. Bastiaan Kleijn,et al.  On phase perception in speech , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[8]  William Stallings,et al.  Cryptography and network security , 1998 .

[9]  Marcos Faúndez-Zanuy,et al.  Speaker verification security improvement by means of speech watermarking , 2006, Speech Commun..

[10]  Seyed Mohammad Ahadi,et al.  Watermarking of speech signal through phase quantization of sinusoidal model , 2011, 2011 19th Iranian Conference on Electrical Engineering.

[11]  Masashi Unoki,et al.  Embedding Limitations with Digital-audio Watermarking Method Based on Cochlear Delay Characteristics , 2011, J. Inf. Hiding Multim. Signal Process..

[12]  Richard Heusdens,et al.  A Low-Complexity Spectro-Temporal Distortion Measure for Audio Processing Applications , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Parul Garg,et al.  A Combined Watermarking and Encryption Algorithm for Secure VoIP , 2009, Inf. Secur. J. A Glob. Perspect..

[14]  Wei Yang,et al.  An Information-Hiding Model for Secure Communication , 2007, ICIC.

[15]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[16]  T. Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. II. Simulations and measurements. , 1996, The Journal of the Acoustical Society of America.

[17]  H. Levitt Transformed up-down methods in psychoacoustics. , 1971, The Journal of the Acoustical Society of America.

[18]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[19]  Taku Komura,et al.  Automatic Panel Extraction of Color Comic Images , 2007, PCM.

[20]  Harald Pobloth Perceptual and Squared Error Aspects in Speech and Audio Coding , 2004 .

[21]  David Malah,et al.  Bandwidth Extension of Telephone Speech Aided by Data Embedding , 2007, EURASIP J. Adv. Signal Process..

[22]  Ian McLoughlin,et al.  Applied Speech and Audio Processing: With Matlab Examples , 2009 .

[23]  Peter Vary,et al.  High rate data hiding in ACELP speech codecs , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Mauro Barni,et al.  Watermarking Systems Engineering: Enabling Digital Assets Security and Other Applications , 2007 .

[25]  Kuldip K. Paliwal,et al.  Usefulness of phase spectrum in human speech perception , 2003, INTERSPEECH.

[26]  Peter Vary,et al.  Digital Speech Transmission: Enhancement, Coding and Error Concealment , 2006 .

[27]  Gernot Kubin,et al.  Performance of noise excitation for unvoiced speech , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[28]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[29]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[30]  Douglas D. O'Shaughnessy,et al.  Speech enhancement based conceptually on auditory evidence , 1991, IEEE Trans. Signal Process..

[31]  Rubo Zhang,et al.  Speech Hiding Based on Auditory Wavelet , 2004, ICCSA.

[32]  Gernot Kubin,et al.  Speech watermarking for the VHF radio channel , 2005 .

[33]  P. P. Vaidyanathan,et al.  A Kaiser window approach for the design of prototype filters of cosine modulated filterbanks , 1998, IEEE Signal Processing Letters.

[34]  Marcos Faúndez-Zanuy Digital Watermarking: New Speech and Image Applications , 2009, NOLISP.

[35]  Shi-Huang Chen,et al.  Speech Watermarking Based on Wavelet Transform and BCH Coding , 2008, 2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (sutc 2008).

[36]  Gernot Kubin,et al.  Speaker identification security improvement by means of speech watermarking , 2007, Pattern Recognit..

[37]  Dimitrios Hatzinakos,et al.  Multiresolution digital watermarking: algorithms and implications for multimedia signals , 1999 .

[38]  David W. Tempest,et al.  The Noise Handbook , 1985 .

[39]  Ahmed H. Tewfik,et al.  Robust audio watermarking using perceptual masking , 1998, Signal Process..

[40]  Mark F. Bocko,et al.  Data hiding via phase manipulation of audio signals , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[41]  R. Gray,et al.  Distortion measures for speech processing , 1980 .

[42]  Laurent Girin,et al.  Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[43]  Oscal T.-C. Chen,et al.  Content-Dependent Watermarking Scheme in Compressed Speech With Identifying Manner and Location of Attacks , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[44]  P M Seligman,et al.  Preliminary evaluation of a formant enhancement algorithm on the perception of speech in noise for normally hearing listeners. , 1994, Audiology : official organ of the International Society of Audiology.

[45]  Jesper Jensen,et al.  A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration , 2005, EURASIP J. Adv. Signal Process..

[46]  B. Atal,et al.  Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .

[47]  T Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. I. Model structure. , 1996, The Journal of the Acoustical Society of America.

[48]  Henry Leung,et al.  Speech Bandwidth Extension by Data Hiding and Phonetic Classification , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[49]  A. Murat Tekalp,et al.  Pitch and duration modification for speech watermarking , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[50]  Marcos Faundez-Zanuy,et al.  Speech Watermarking: An Approach for the Forensic Analysis of Digital Telephonic Recordings * , 2010, Journal of forensic sciences.

[51]  Siba Prasada Panigrahi,et al.  An Efficient Noise Generator for Validation of Channels Equalizers , 2011, J. Signal Inf. Process..

[52]  Gernot Kubin,et al.  Speech Watermarking for Analog Flat-Fading Bandpass Channels , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[53]  L. F. Turner,et al.  Modelling the detectability of changes in auditory signals , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[54]  John R. Deller,et al.  Digital watermarking of speech signals for the National Gallery of the Spoken Word , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[55]  Ingemar J. Cox,et al.  Digital Watermarking , 2003, Lecture Notes in Computer Science.

[56]  Eric Moulines,et al.  Non-parametric techniques for pitch-scale and time-scale modification of speech , 1995, Speech Commun..

[57]  C. Mosquera,et al.  Rational dither modulation: a high-rate data-hiding method invariant to gain attacks , 2005, IEEE Transactions on Signal Processing.

[58]  Elizabeth Chang,et al.  Secure communication in wireless multimedia sensor networks using watermarking , 2010, 4th IEEE International Conference on Digital Ecosystems and Technologies.

[59]  Wai C. Chu,et al.  Speech Coding Algorithms: Foundation and Evolution of Standardized Coders , 2003 .

[60]  Jeng-Shyang Pan,et al.  Speech Authentication by Semi-fragile Watermarking , 2005, KES.

[61]  Jeng-Shyang Pan,et al.  Tabu search based multi-watermarks embedding algorithm with multiple description coding , 2011, Inf. Sci..

[62]  Jie Zhu,et al.  Robust speech watermarking algorithm , 2007 .

[63]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[64]  Chao Wu,et al.  Design and Implementation of Steganographic Speech Telephone , 2007, PCM.

[65]  Bin Yan,et al.  Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization , 2011, Multimedia Tools and Applications.

[66]  Tolga Çiloglu,et al.  An improved all-pass watermarking scheme for speech and audio , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[67]  Jean-Marie Moureaux,et al.  Hybrid transmission, compression and data hiding: quantisation index modulation as source coding strategy , 2004 .

[68]  Walter Bender,et al.  Techniques for Data Hiding , 1996, IBM Syst. J..

[69]  David H. Shur,et al.  On combining watermarking with perceptual coding , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[70]  A. A. Zaidan,et al.  A review of audio based steganography and digital watermarking , 2011 .

[71]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[72]  Antonio Laganà,et al.  Computational Science and Its Applications – ICCSA 2004 , 2004, Lecture Notes in Computer Science.

[73]  Mauro Barni,et al.  Watermarking Systems Engineering (Signal Processing and Communications, 21) , 2004 .

[74]  Hsiang-Cheh Huang,et al.  Metadata-based image watermarking for copyright protection , 2010, Simul. Model. Pract. Theory.

[75]  Jean-Marie Moureaux,et al.  Indexing Lattice Vectors in a Joint Watermarking and Compression Scheme , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[76]  Henrique S. Malvar Extended lapped transforms: properties, applications, and fast algorithms , 1992, IEEE Trans. Signal Process..

[77]  Zunera Jalil Copyright Protection of Plain Text Using Digital Watermarking , 2010 .

[78]  Farrokh Marvasti,et al.  Robust audio and speech watermarking using Gaussian and Laplacian modeling , 2010, Signal Process..

[79]  Chuen-Ching Wang,et al.  A robust watermarking scheme combined with the FSVQ for images , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[80]  C.-C. Jay Kuo,et al.  Fragile speech watermarking based on exponential scale quantization for tamper detection , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[81]  Doh-Suk Kim,et al.  Perceptual phase quantization of speech , 2003, IEEE Trans. Speech Audio Process..

[82]  Ning Chen,et al.  Multipurpose speech watermarking based on multistage vector quantization of linear prediction coefficients , 2007 .

[83]  Bin Yan,et al.  Watermarking Combined with CELP Speech Coding for Authentication , 2005, IEICE Trans. Inf. Syst..

[84]  Kuldip K. Paliwal,et al.  Speech Coding and Synthesis , 1995 .

[85]  Naohisa Komatsu,et al.  Digital watermarking based on process of speech production , 2004, IS&T/SPIE Electronic Imaging.

[86]  Ian B. Thomas,et al.  The Influence of First and Second Formants on the Intelligibility of Clipped Speech , 1968 .

[87]  Shantanu Chakrabartty,et al.  An Overview of Statistical Pattern Recognition Techniques for Speaker Verification , 2011, IEEE Circuits and Systems Magazine.

[88]  Wai C. Chu,et al.  Speech Coding Algorithms , 2003 .

[89]  Peter Jax,et al.  Artificial bandwidth extension of speech supported by watermark-transmitted side information , 2005, INTERSPEECH.

[90]  G. Clark,et al.  Acoustic parameters measured by a formant-estimating speech processor for a multiple-channel cochlear implant. , 1987, The Journal of the Acoustical Society of America.

[91]  O.T.-C. Chen,et al.  Fragile speech watermarking scheme with recovering speech contents , 2004, The 2004 47th Midwest Symposium on Circuits and Systems, 2004. MWSCAS '04..

[92]  Ingemar J. Cox,et al.  Watermarking as communications with side information , 1999, Proc. IEEE.

[93]  Mohammad S. Alam,et al.  Neural-network-based zero-watermark scheme for digital images , 2006 .