Non-intrusive identification of speech codecs in digital audio signals

Non-Intrusive Identification of Speech Codecs in Digital Audio Signals

[1]  Priyabrata Sinha Speech Compression Overview , 2010 .

[2]  Ashwin Swaminathan,et al.  Multimedia Forensic Analysis via Intrinsic and Extrinsic Fingerprints , 2008 .

[3]  W. Bastiaan Kleijn,et al.  Internet Low Bit Rate Codec (iLBC) , 2004, RFC.

[4]  Alan McCree,et al.  Low-Bit-Rate Speech Coding , 2008 .

[5]  K. Scholz,et al.  Speech-codec detection by spectral harmonic-plus-noise decomposition , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[6]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[7]  Ahmet M. Kondoz,et al.  Digital Speech: Coding for Low Bit Rate Communication Systems , 1995 .

[8]  Andreas Spanias,et al.  Speech coding: a tutorial review , 1994, Proc. IEEE.

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  W. Bastiaan Kleijn Principles of Speech Coding , 2008 .

[11]  Koen Vos,et al.  SILK Speech Codec , 2010 .

[12]  Lajos Hanzo,et al.  Voice and Audio Compression for Wireless Communications , 2007 .

[13]  Francesco Camastra,et al.  Machine Learning for Audio, Image and Video Analysis - Theory and Applications , 2007, Advanced Information and Knowledge Processing.

[14]  Haibin Ling,et al.  An Efficient Earth Mover's Distance Algorithm for Robust Histogram Comparison , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Dominique Vaufreydaz,et al.  The effect of speech and audio compression on speech recognition performance , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[16]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[17]  J. Benesty,et al.  Linear Predic 7. Linear Prediction , 2008 .

[18]  A. Noga A Short-Segment Fourier Transform Methodology , 2009 .

[19]  Patrick Traynor,et al.  PinDr0p: using single-ended audio features to determine call provenance , 2010, CCS '10.

[20]  John W. Ratcliff Audio compression , 1992 .

[21]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  D. M. Alley Automatic identification of voice band telephony coding schemes using neural networks , 1993 .

[23]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[24]  LingHaibin,et al.  An Efficient Earth Mover's Distance Algorithm for Robust Histogram Comparison , 2007 .

[25]  Haibin Ling,et al.  Diffusion Distance for Histogram Comparison , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[26]  Chi-keung Julian Wong Coding of speech at 16 kbit/s using low-delay code excited linear prediction (LD-CELP) , 2010 .

[27]  Robert X. Gao,et al.  From Fourier Transform to Wavelet Transform: A Historical Perspective , 2011 .

[28]  Juin-Hwey Chen,et al.  Analysis-by-Synthesis Speech Coding , 2008 .