Authenticity identification of speaker digital recording data based on quantum genetic algorithm

The problem we are trying to solve in this paper is the authenticity identification of speaker digital recording data. As an important basis of the judicial identification, it is crucial to ensure the authenticity of digital recording data. A large number of solutions have been proposed to address the problem. However, classic methods are usually based on logical symbol rather than the physical detection of energy or phase, and these solutions show drawbacks in terms of identification inefficiency, algorithm instability and heavy time overhead. In this paper, we propose to utilize the quantum theory to address the problem. Any tampering operation for digital recording data can lead to the change of charge in the memory, and it can utilize the subtle change to implement the identification. First, we analyze the quantum nature of storage and investigate to extract the transmittance of speech signal as the characteristic value through quantum tunneling theory. Second, aiming at the characteristics of speech signal, we utilize the transmittance to define the rotation angle step function and propose an improved quantum genetic algorithm to detect the change of phase. The proposed method achieves the authenticity identification based on phase detection. The results obtained in this research include the problem can be addressed by phase detection solution based on quantum genetic algorithm, and it shows performance benefits compared with existing solutions by simulation experiment. It is not only theoretically but also practically feasible to realize authenticity identification of speak digital recording data.

[1]  Su Hong-sheng,et al.  Improved Quantum Genetic Algorithm and Its Application , 2011 .

[2]  Changchun Bao,et al.  Speech enhancement based on AR model parameters estimation , 2016, Speech Commun..

[3]  Huang De-cai Quantum Genetic Algorithm Based on Angle Coding of 3D , 2012 .

[4]  Chrysostomos L. Nikias,et al.  Joint estimation of time delay and frequency delay in impulsive noise using fractional lower order statistics , 1996, IEEE Trans. Signal Process..

[5]  Zhang Xiao-feng,et al.  An Improved Quantum Genetic Algorithm of Quantum Revolving Gate , 2013 .

[6]  Shi Yun-qing Audio re-sampling detection in audio forensics based on EM algorithm , 2006 .

[7]  Liang Li,et al.  Efficient parallel HEVC intra-prediction on many-core processor , 2014 .

[8]  Pan Ping An Improved Method of MFCC Parameter Extraction in Speaker Recognition , 2011 .

[9]  Yu Zheng,et al.  Urban Water Quality Prediction Based on Multi-Task Multi-View Learning , 2016, IJCAI.

[10]  Daniel Patricio Nicolalde Rodríguez,et al.  Audio Authenticity: Detecting ENF Discontinuity With High Precision Phase Analysis , 2010, IEEE Transactions on Information Forensics and Security.

[11]  Yongdong Zhang,et al.  Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Yongdong Zhang,et al.  Supervised Hash Coding With Deep Neural Network for Environment Perception of Intelligent Vehicles , 2018, IEEE Transactions on Intelligent Transportation Systems.

[13]  Luming Zhang,et al.  Fortune Teller: Predicting Your Career Path , 2016, AAAI.

[14]  Luming Zhang,et al.  Action2Activity: Recognizing Complex Activities from Sensor Data , 2015, IJCAI.

[15]  Eddy B. Brixen Further Investigation into the ENF Criterion for Forensic Authentication , 2007 .

[16]  Yongdong Zhang,et al.  Parallel deblocking filter for HEVC on many-core processor , 2014 .

[17]  Zhang Rui-lin Speaker recognition method using MFCC and LPCC features , 2009 .

[18]  Zhang Zong-fei Novel Improved Quantum Genetic Algorithm , 2010 .

[19]  Catalin Grigoras Digital audio recording analysis: the Electric Network Frequency (ENF) Criterion , 2005 .

[20]  Hany Farid,et al.  Detecting Digital Forgeries Using Bispectral Analysis , 1999 .

[21]  Jong-Hwan Kim,et al.  Genetic quantum algorithm and its application to combinatorial optimization problem , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[22]  Lyle H. Ungar,et al.  Beyond Binary Labels: Political Ideology Prediction of Twitter Users , 2017, ACL.

[23]  Ajit Narayanan,et al.  Quantum-inspired genetic algorithms , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[24]  Wang Yu A Novel Quantum Genetic Algorithm for TSP , 2007 .

[25]  Hongbin Zha,et al.  Visual analysis of child-adult interactive behaviors in video sequences , 2010, 2010 16th International Conference on Virtual Systems and Multimedia.

[26]  Bruce E. Koenig,et al.  Forensic Authentication of Digital Audio Recordings , 2009 .

[27]  Luo Hui,et al.  Voice Authenticity Detection Method Based on Stochastic Resonance , 2013 .

[28]  Jiwu Huang,et al.  Detecting digital audio forgeries by checking frame offsets , 2008, MM&Sec '08.

[29]  Yongdong Zhang,et al.  A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors , 2014, IEEE Signal Processing Letters.

[30]  Alan J. Cooper An automated approach to the Electric Network Frequency (ENF) criterion - Theory and practice , 2010 .

[31]  Mateusz Kajstura,et al.  Application of the Electrical Network Frequency (ENF) Criterion. A case of a digital recording. , 2005, Forensic science international.

[32]  Wei Wei,et al.  Packet Loss Concealment Algorithm in VoIP , 2009 .

[33]  R. Maher,et al.  Audio forensic examination , 2009, IEEE Signal Processing Magazine.

[34]  Jun Zhong,et al.  Towards unsupervised physical activity recognition using smartphone accelerometers , 2016, Multimedia Tools and Applications.

[35]  Li Liu,et al.  Recognizing Complex Activities by a Probabilistic Interval-Based Model , 2016, AAAI.

[36]  Guo Guangcan RECENT ADVANCEMENT IN RESEARCH OF QUANTUM COMPUTATION , 2005 .

[37]  Thierry Paul,et al.  Quantum computation and quantum information , 2007, Mathematical Structures in Computer Science.

[38]  Yongdong Zhang,et al.  Effective Uyghur Language Text Detection in Complex Background Images for Traffic Prompt Identification , 2018, IEEE Transactions on Intelligent Transportation Systems.

[39]  Hongbin Zha,et al.  Fusion of low-and high-dimensional approaches by trackers sampling for generic human motion tracking , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[40]  David S. Rosenblum,et al.  From action to activity: Sensor-based activity recognition , 2016, Neurocomputing.

[41]  He Zhaoxia Method of speaker feature parameter extraction based on duffing stochastic resonance , 2012 .

[42]  Hongbin Zha,et al.  Tracking Generic Human Motion via Fusion of Low- and High-Dimensional Approaches , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.