Robust Audio Copy-Move Forgery Detection Using Constant Q Spectral Sketches and GA-SVM

Audio recordings used as evidence have become increasingly important to litigation. Before their admissibility as evidence, an audio forensic expert is often required to help determine whether the submitted audio recordings are altered or authentic. Within this field, the copy-move forgery detection (CMFD), which focuses on finding possible forgeries that are derived from the same audio recording, has been an urgent problem in blind audio forensics. However, most of the existing methods require idealistic pre-segmentation and artificial threshold selection to calculate the similarity between segments, which may result in serious misleading and misjudgment especially on high frequency words. In this work, we present a robust method for detecting and locating an audio copy-move forgery on the basis of constant Q spectral sketches (CQSS) and the integration of a customised genetic algorithm (GA) and support vector machine (SVM). Specifically, the CQSS features are first extracted by averaging the logarithm of the squared-magnitude constant Q transform. Then, the CQSS feature set is automatically optimised by a customised GA combined with SVM to obtain the best feature subset and classification model at the same time. Finally, the integrated method, named CQSS-GA-SVM, is evaluated against the state-of-the-art approaches to blind detection of copy-move forgeries on real-world copy-move datasets with read English and Chinese corpus, respectively. The experimental results demonstrate that the proposed CQSS-GA-SVM exhibits significantly high robustness against post-processing based anti-forensics attacks and adaptability to the changes of the duplicated segment duration, the training set size, the recording length, and the forgery type, which may be beneficial to improving the work efficiency of audio forensic experts.

[1]  Beste Ustubioglu,et al.  Duplicated Audio Segment Detection with Local Binary Pattern , 2020, 2020 43rd International Conference on Telecommunications and Signal Processing (TSP).

[2]  Prabhu R. Bevinamarad,et al.  Audio Forgery Detection Techniques: Present and Past Review , 2020, 2020 4th International Conference on Trends in Electronics and Informatics (ICOEI)(48184).

[3]  Chen Li,et al.  Homologous Audio Copy-move Tampering Detection Method Based on Pitch , 2019, 2019 IEEE 19th International Conference on Communication Technology (ICCT).

[4]  N. V. Lalitha,et al.  Localization of Copy-Move Forgery in Speech Signals Through Watermarking Using DCT-QIM , 2019, International Journal of Electronics and Telecommunications.

[5]  Rohan Kumar Das,et al.  Low frequency frame-wise normalization over constant-Q transform for playback speech detection , 2019, Digit. Signal Process..

[6]  Jiantao Zhou,et al.  Fast and Effective Image Copy-Move Forgery Detection via Hierarchical Feature Point Matching , 2019, IEEE Transactions on Information Forensics and Security.

[7]  Nguyen Tuan Anh,et al.  One approach in the time domain in detecting copy-move of speech recordings with the similar magnitude , 2019, International Journal of Engineering and Applied Sciences (IJEAS).

[8]  Wei Lu,et al.  Fast and Effective Copy-Move Detection of Digital Audio Based on Auto Segment , 2019, Int. J. Digit. Crime Forensics.

[9]  Rui Yang,et al.  Robust Copy–Move Detection of Speech Recording Using Similarities of Pitch and Formant , 2019, IEEE Transactions on Information Forensics and Security.

[10]  Wei Lu,et al.  Copy-move detection of digital audio based on multi-feature decision , 2018, J. Inf. Secur. Appl..

[11]  Haizhou Li,et al.  Extended Constant-Q Cepstral Coefficients for Detection of Spoofing Attacks , 2018, 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).

[12]  Rangding Wang,et al.  Source Cell-Phone Identification in the Presence of Additive Noise from CQT Domain , 2018, Inf..

[13]  Xin Yao,et al.  SNR-Constrained Heuristics for Optimizing the Scaling Parameter of Robust Audio Watermarking , 2018, IEEE Transactions on Multimedia.

[14]  Chen Li,et al.  An algorithm of detecting audio copy-move forgery based on DCT and SVD , 2017, 2017 IEEE 17th International Conference on Communication Technology (ICCT).

[15]  Nicholas W. D. Evans,et al.  Constant Q cepstral coefficients: A spoofing countermeasure for automatic speaker verification , 2017, Comput. Speech Lang..

[16]  Sheeraz Akram,et al.  Blind Detection of Copy-Move Forgery in Digital Audio Forensics , 2017, IEEE Access.

[17]  Wei Lu,et al.  Fast Copy-Move Detection of Digital Audio , 2017, 2017 IEEE Second International Conference on Data Science in Cyberspace (DSC).

[18]  Ping Zhu,et al.  A priori SNR estimation and noise estimation for speech enhancement , 2016, EURASIP Journal on Advances in Signal Processing.

[19]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[20]  Hyeontaek Lim,et al.  Formant-Based Robust Voice Activity Detection , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[21]  Khairul Anam,et al.  A novel extreme learning machine for dimensionality reduction on finger movement classification using sEMG , 2015, 2015 7th International IEEE/EMBS Conference on Neural Engineering (NER).

[22]  Rui Yang,et al.  Copy-move detection of audio recording with pitch similarity , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Sanjeev Khudanpur,et al.  Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[24]  Yan Li,et al.  Audio authenticity: Duplicated audio segment detection in waveform audio file , 2014 .

[25]  Constantine Kotropoulos,et al.  Source phone identification using sketches of features , 2014, IET Biom..

[26]  Tai-Shih Chi,et al.  Voice activity detection based on frequency modulation of harmonics , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27]  Anssi Klapuri,et al.  Audio Pitch Shifting Using the Constant-Q Transform , 2013 .

[28]  Thomas Grill,et al.  A Framework for Invertible, Real-Time Constant-Q Transforms , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[29]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[30]  Sanjit K. Mitra,et al.  Voice activity detection based on multiple statistical models , 2006, IEEE Transactions on Signal Processing.

[31]  James V. Stone Independent component analysis: an introduction , 2002, Trends in Cognitive Sciences.

[32]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[33]  Judith C. Brown,et al.  An efficient algorithm for the calculation of a constant Q transform , 1992 .

[34]  Chi-Man Pun,et al.  An End-to-End Dense-InceptionNet for Image Copy-Move Forgery Detection , 2020, IEEE Transactions on Information Forensics and Security.

[35]  K. R. Krishna,et al.  Copy and Move Detection in Audio Recordings using Dynamic Time Warping Algorithm , 2019 .

[36]  Aboul Ella Hassanien,et al.  Linear discriminant analysis: A detailed tutorial , 2017, AI Commun..

[37]  Muhammad Khurram Khan,et al.  Digital multimedia audio forensics: past, present and future , 2017, Multimedia Tools and Applications.

[38]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[39]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[40]  J. Weston,et al.  Support Vector Machine Solvers , 2007 .

[41]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing , 2000 .

[42]  Judith C. Brown Calculation of a constant Q spectral transform , 1991 .

[43]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .