Effects of Packet Losses in Waveform Coded Speech and Improvements Due to an Odd-Even Sample-Interpolation Procedure

We have studied the effects of random packet losses in digital speech systems based on 12-bit PCM and 4-bit adaptive DPCM coding. The effects are a function of packet length B and probability of packet loss P L . We have also studied tbe benefits of an odd-even sample-interpolation procedure that mitigates these effects (at the cost of increased decoding delay). The procedure is based on arranging a 2B -block of codewords into two B -sample packets, an odd-sample packet and an even-sample packet. If one of these packets is lost, the odd (or even) samples of the 2B -block are estimated from the even (or odd) samples by means of adaptive interpolation. Perceptual considerations indicate that packet lengths most robust to losses are in the range 16-32 ms, irrespective of whether interpolation is used or not. With these packet lengths, tolerable P L values, which are strictly input-speech-dependent, can be as high as 2 to 5 percent without interpolation and 5 to 10 percent with interpolation. These observations are based on a computer simulation with three sentence-length speech inputs, and on informal listening tests.

[1]  N. Jayant Digital coding of speech waveforms: PCM, DPCM, and DM quantizers , 1974 .

[2]  J. O'Neal,et al.  PCM speech compression via ADPCM/TASI , 1977 .

[3]  R. Cox,et al.  Multiple User Variable Rate Coding for TASI and Packet Transmission Systems , 1980, IEEE Trans. Commun..

[4]  C. J. Harris,et al.  Packet transmission of speech using variable‐quality coding and time‐interval modification , 1977 .

[5]  E. T. Klemmer Subjective evaluation of transmission delay in telephone conversations , 1967 .

[6]  J. W. Emling,et al.  The effects of time delay and echoes on telephone conversations , 1963 .

[7]  David J. Goodman Embedded DPCM for Variable Bit Rate Transmission , 1980, IEEE Trans. Commun..

[8]  T. Bially,et al.  A Technique for Adaptive Voice Flow Control in Integrated Packet Networks , 1980, IEEE Trans. Commun..

[9]  I. Gitman,et al.  Economic analysis of integrated voice and data networks: A case study , 1978, Proceedings of the IEEE.

[10]  N. S. Jayant Step-size transmitting differential coders for mobile telephony , 1975, The Bell System Technical Journal.

[11]  Yohtaro Yatsuzuka,et al.  A high-gain DSI-ADPCM system , 1979, ICASSP.

[12]  R. Crochiere,et al.  Speech Coding , 1979, IEEE Transactions on Communications.

[13]  James W. Forgie,et al.  Speech transmission in packet-switched store-and-forward networks , 1899, AFIPS '75.

[14]  Daniel Minoli,et al.  Optimal Packet Length for Packet Voice Communication , 1979, IEEE Trans. Commun..

[15]  A. Jain Image Coding Via a Nearest Neighbors Image Model , 1975, IEEE Trans. Commun..

[16]  Raymond Steele,et al.  On Soft-Decision Demodulation for PCM- and DPCM- Encoded Speech , 1980, IEEE Trans. Commun..

[17]  B. Gold,et al.  Digital speech networks , 1977, Proceedings of the IEEE.

[18]  K. Bullington,et al.  Engineering aspects of TASI , 1959 .

[19]  G. Coviello,et al.  Comparative Discussion of Circuit- vs. Packet-Switched Voice , 1979, IEEE Trans. Commun..

[20]  C. Weinstein,et al.  Fractional Speech Loss and Talker Activity Model for TASI and for Packet-Switched Speech , 1978, IEEE Trans. Commun..

[21]  Donald L. Schilling,et al.  Delta Modulators in Packet Voice Networks , 1980, IEEE Trans. Commun..