论文信息 - Non-intrusive identification of speech codecs in digital audio signals - 字舞流文

Non-intrusive identification of speech codecs in digital audio signals

Non-Intrusive Identification of Speech Codecs in Digital Audio Signals

[1] Priyabrata Sinha. Speech Compression Overview , 2010 .

[2] Ashwin Swaminathan,et al. Multimedia Forensic Analysis via Intrinsic and Extrinsic Fingerprints , 2008 .

[3] W. Bastiaan Kleijn,et al. Internet Low Bit Rate Codec (iLBC) , 2004, RFC.

[4] Alan McCree,et al. Low-Bit-Rate Speech Coding , 2008 .

[5] K. Scholz,et al. Speech-codec detection by spectral harmonic-plus-noise decomposition , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[6] Jae S. Lim,et al. Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[7] Ahmet M. Kondoz,et al. Digital Speech: Coding for Low Bit Rate Communication Systems , 1995 .

[8] Andreas Spanias,et al. Speech coding: a tutorial review , 1994, Proc. IEEE.

[9] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[10] W. Bastiaan Kleijn. Principles of Speech Coding , 2008 .

[11] Koen Vos,et al. SILK Speech Codec , 2010 .

[12] Lajos Hanzo,et al. Voice and Audio Compression for Wireless Communications , 2007 .

[13] Francesco Camastra,et al. Machine Learning for Audio, Image and Video Analysis - Theory and Applications , 2007, Advanced Information and Knowledge Processing.

[14] Haibin Ling,et al. An Efficient Earth Mover's Distance Algorithm for Robust Histogram Comparison , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Dominique Vaufreydaz,et al. The effect of speech and audio compression on speech recognition performance , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[16] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[17] J. Benesty,et al. Linear Predic 7. Linear Prediction , 2008 .

[18] A. Noga. A Short-Segment Fourier Transform Methodology , 2009 .

[19] Patrick Traynor,et al. PinDr0p: using single-ended audio features to determine call provenance , 2010, CCS '10.

[20] John W. Ratcliff. Audio compression , 1992 .

[21] Manfred R. Schroeder,et al. Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22] D. M. Alley. Automatic identification of voice band telephony coding schemes using neural networks , 1993 .

[23] Jonathan G. Fiscus,et al. Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[24] LingHaibin,et al. An Efficient Earth Mover's Distance Algorithm for Robust Histogram Comparison , 2007 .

[25] Haibin Ling,et al. Diffusion Distance for Histogram Comparison , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[26] Chi-keung Julian Wong. Coding of speech at 16 kbit/s using low-delay code excited linear prediction (LD-CELP) , 2010 .

[27] Robert X. Gao,et al. From Fourier Transform to Wavelet Transform: A Historical Perspective , 2011 .

[28] Juin-Hwey Chen,et al. Analysis-by-Synthesis Speech Coding , 2008 .