Voice Coding with Opus

In this paper, we describe the voice mode of the Opus speech and audio codec. As only the decoder is standardized, the details in this paper will help anyone who wants to modify the encoder or gain a better understanding of the codec. We go through the main components that constitute the voice part of the codec, provide an overview, give insights, and discuss the design decisions made during the development. Tests have shown that Opus quality is comparable to or better than several state-of-the-art voice codecs, while covering a much broader application area than competing codecs.

[1]  Jan Skoglund,et al.  Vector quantization based on Gaussian mixture models , 2000, IEEE Trans. Speech Audio Process..

[2]  Koen Vos A Fast Implementation of Burg's Method , 2013 .

[3]  Robert M. Gray,et al.  The Design of Predictive Trellis Waveform Coders Using the Generalized Lloyd Algorithm , 1986, IEEE Trans. Commun..

[4]  Philippe Gournay,et al.  Increasing the robustness of CELP-based coders by constrained optimization , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[5]  G. Nigel Martin,et al.  * Range encoding: an algorithm for removing redundancy from a digitised message , 1979 .

[6]  Bishnu S. Atal,et al.  Predictive coding of speech signals and subjective error criteria , 1978, ICASSP.

[7]  Koen Vos,et al.  SILK Speech Codec , 2010 .

[8]  H. Strube Linear prediction on a warped frequency scale , 1980 .

[9]  R.P. Ramachandran,et al.  Joint solutions for formant and pitch predictors in speech processing , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  Timothy B. Terriberry,et al.  Definition of the Opus Audio Codec , 2012, RFC.

[11]  J. Cadzow Maximum Entropy Spectral Analysis , 2006 .

[12]  Anssi Rämö,et al.  Voice Quality Characterization of IETF Opus Codec , 2011, INTERSPEECH.

[13]  John B. Anderson,et al.  Trellis source codes based on linear congruential recursions , 2005, IEEE Communications Letters.

[14]  Peter Vary,et al.  A New Approach for Low-Delay Joint-Stereo Coding , 2011 .