Performance Analysis of Robust Audio Hashing

We present a novel theoretical analysis of the Philips audio fingerprinting method proposed by Haitsma, Kalker, and Oostveen (2001). Although this robust hashing algorithm exhibits very good performance, the method has only been partially analyzed in the literature. Hence, there is a clear need for a more complete analysis which allows both performance prediction and systematic optimization. We examine here the theoretical performance of the method for Gaussian inputs by means of a statistical model. Our analysis relies on formulating the unquantized fingerprint as a quadratic form, which affords a systematic way to compute the model parameters. We provide closed-form analytical upperbounds for the probability of bit error of the hash for two relevant scenarios: noise addition and desynchronization. We show that these results are useful when applied to real audio signals

[1]  J. Imhof Computing the distribution of quadratic forms in normal variables , 1961 .

[2]  P. Welch The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .

[3]  Seungjae Lee,et al.  Audio fingerprinting based on normalized spectral subband moments , 2006, IEEE Signal Processing Letters.

[4]  Jaap A. Haitsma,et al.  Robust Audio Hashing for Content Identification , 2001 .

[5]  Pedro Cano,et al.  A Review of Audio Fingerprinting , 2005, J. VLSI Signal Process..

[6]  D. Sorensen IMPLICITLY RESTARTED ARNOLDI/LANCZOS METHODS FOR LARGE SCALE EIGENVALUE CALCULATIONS , 1996 .

[7]  David G. Long,et al.  The probability density of spectral estimates based on modified periodogram averages , 1999, IEEE Trans. Signal Process..

[8]  Ramarathnam Venkatesan,et al.  A Perceptual Audio Hashing Algorithm: A Tool for Robust Audio Identification and Information Hiding , 2001, Information Hiding.

[9]  J. Magnus,et al.  Matrix Differential Calculus with Applications in Statistics and Econometrics , 1991 .

[10]  Robert M. Gray,et al.  Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[11]  Robert M. Gray,et al.  Toeplitz And Circulant Matrices: A Review (Foundations and Trends(R) in Communications and Information Theory) , 2006 .

[12]  Reginald L. Lagendijk,et al.  Stochastic Model of a Robust Audio Fingerprinting System , 2004, ISMIR.

[13]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[14]  E. Ziegel Matrix Differential Calculus With Applications in Statistics and Econometrics , 1989 .

[15]  R. L. Lagendijk,et al.  THEORETICAL MODELING OF A ROBUST AUDIO FINGERPRINTING SYSTEM , 2004 .

[16]  Pieter H. Hartel,et al.  Music2Share - copyright-compliant music sharing in P2P systems , 2004, Proceedings of the IEEE.

[17]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System With an Efficient Search Strategy , 2003 .

[18]  Reginald L. Lagendijk,et al.  Extracting Quality Parameters for Compressed Audio from Fingerprints , 2005, ISMIR.

[19]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[20]  Gregory W. Wornell,et al.  Authentication with distortion criteria , 2005, IEEE Transactions on Information Theory.

[21]  Hoirin Kim,et al.  Frequency Filtering for a Highly Robust Audio Fingerprinting Scheme in a Real-Noise Environment , 2006, IEICE Trans. Inf. Syst..