Efficient and robust frame-synchronized blind audio watermarking by featuring multilevel DWT and DCT

This paper presents a self-synchronizing blind audio watermarking method developed based on the distinctive features of multilevel discrete wavelet transform (MDWT) and discrete cosine transform (DCT). In the proposed design, the repetitive pattern hidden in the 11th approximation subband is used to control frame synchronization, and the wideband signal covering the first to ninth detail subbands is used to hide binary information. Tracing the zero-crossings across the extracted synchronous signal makes it possible to recalibrate frame positions and achieve synchronous watermarking on a rational dither modulation (RDM) framework. We also developed a windowed vector modulation (WVM) scheme suited for the time as well as DCT domain to enhance watermark imperceptivity. The proposed watermarking method currently achieves a payload capacity of 86 bits per second. The results of PEAQ testing confirm that the watermarked audio signal retains quality nearly indistinguishable from the original. Compared with two existing synchronized methods designed specifically to cope with playback speed modification, the proposed MDWT–DCT–RDM–WVM method demonstrates superior robustness to commonly encountered attacks. When encountering desynchronization attacks, such as cropping and resampling time-scale modification, the proposed technique is capable of retrieving the watermark bits with a high degree of accuracy.

[1]  Jiwu Huang,et al.  Audio watermarking robust against time-scale modification and MP3 compression , 2008, Signal Process..

[2]  Ling-Yuan Hsu,et al.  Variable-dimensional vector modulation for perceptual-based DWT blind audio watermarking with adjustable payload capacity , 2014, Digit. Signal Process..

[3]  Hong-Xia Wang,et al.  Statistical characteristic-based robust audio watermarking for resolving playback speed modification , 2011, Digit. Signal Process..

[4]  Yong Wang,et al.  A blind audio watermarking algorithm with self-synchronization , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[5]  C. L. Philip Chen,et al.  Robust Mel-Frequency Cepstral coefficients feature detection and dual-tree complex wavelet transform for digital audio watermarking , 2015, Inf. Sci..

[6]  Jiwu Huang,et al.  Histogram-Based Audio Watermarking Against Time-Scale Modification and Cropping Attacks , 2007, IEEE Transactions on Multimedia.

[7]  D. M. Campbell,et al.  Springer Handbook of Acoustics , 2015 .

[8]  Rui Yang,et al.  Geometric Invariant Audio Watermarking Based on an LCM Feature , 2011, IEEE Transactions on Multimedia.

[9]  Zhen Li,et al.  A robust audio watermarking scheme based on lifting wavelet transform and singular value decomposition , 2012, Signal Process..

[10]  Gregory W. Wornell,et al.  Quantization index modulation: A class of provably good methods for digital watermarking and information embedding , 2001, IEEE Trans. Inf. Theory.

[11]  Xiangzhong Fang,et al.  Audio Watermarking Robust against Playback Speed Modification , 2011, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[12]  Jiwu Huang,et al.  Efficiently self-synchronized audio watermarking for assured audio data transmission , 2005, IEEE Transactions on Broadcasting.

[13]  Ling-Yuan Hsu,et al.  A DWT-Based Rational Dither Modulation Scheme for Effective Blind Audio Watermarking , 2016, Circuits Syst. Signal Process..

[14]  Xiangyang Xue,et al.  Localized audio watermarking technique robust against time-scale modification , 2006, IEEE Trans. Multim..

[15]  C. Mosquera,et al.  Rational dither modulation: a high-rate data-hiding method invariant to gain attacks , 2005, IEEE Transactions on Signal Processing.

[16]  E. Zwicker,et al.  Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency , 1980 .

[17]  Tapio Seppänen,et al.  Digital Audio Watermarking Techniques and Technologies: Applications and Benchmarks , 2007 .

[18]  Ling-Yuan Hsu,et al.  Robust, transparent and high-capacity audio watermarking in DCT domain , 2015, Signal Process..

[19]  Xiang-Yang Wang,et al.  A Novel Synchronization Invariant Audio Watermarking Scheme Based on DWT and DCT , 2006, IEEE Transactions on Signal Processing.

[20]  Mohamed F. Mansour,et al.  Data embedding in audio using time-scale modification , 2005, IEEE Transactions on Speech and Audio Processing.

[21]  Zhen Li,et al.  A Robust Audio Watermarking Scheme Based on Lifting Wavelet Transform and Singular Value Decomposition , 2011, IWDW.

[22]  Xiangyang Wang,et al.  A robust content based audio watermarking using UDWT and invariant histogram , 2010, Multimedia Tools and Applications.

[23]  Chi-Man Pun,et al.  Robust Segments Detector for De-Synchronization Resilient Audio Watermarking , 2013, IEEE Transactions on Audio, Speech, and Language Processing.