论文信息 - Quality-Aware Loss-Robust Scalable Speech Streaming Based on Speech Quality Estimation

Quality-Aware Loss-Robust Scalable Speech Streaming Based on Speech Quality Estimation

This paper proposes a quality-aware loss-robust scalable speech streaming (QLSSS) method to improve the perceived speech quality (PSQ) of a scalable wideband speech streaming (SWSS) system over IP networks. To this end, the proposed method estimates the PSQ and the packet loss rate (PLR) from the received speech data. Subsequently, it decides the amount of redundant speech data (RSD) that a speech decoder can use to reconstruct lost speech signals for high PLRs. According to this decision, the proposed method optimizes a scalable speech coding mode for current speech data (CSD) and RSD bitstreams in order to prevent speech quality from being degraded under the estimated packet loss condition and maintain the transmission bandwidth. The effectiveness of the proposed method is then demonstrated using the ITU-T Recommendations G.729.1 and P.563 as a scalable wideband speech codec and a PSQ estimator, respectively. It is shown from the experiments that an SWSS system employing the proposed QLSSS method significantly improves speech quality under packet loss conditions.

Hong Kook Kim | Seung Ho Choi | Jin Ah Kang

[1] Wenyu Jiang,et al. Comparison and optimization of packet loss repair methods on VoIP perceived quality under bursty loss , 2002, NOSSDAV '02.

[2] Qian Zhang,et al. Error robust scalable audio streaming over wireless IP networks , 2004, IEEE Trans. Multim..

[3] Donald F. Towsley,et al. Adaptive FEC-based error control for Internet telephony , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[4] Chi-Ying Tsui,et al. Unequal error protection for wireless transmission of MPEG audio , 1999, ISCAS'99. Proceedings of the 1999 IEEE International Symposium on Circuits and Systems VLSI (Cat. No.99CH36349).

[5] J. Hagenauer,et al. Channel coding and transmission aspects for wireless multimedia , 1999, Proc. IEEE.

[6] Seong Ro Lee,et al. A Packet Loss Concealment Algorithm Robust to Burst Packet Loss Using Multiple Codebooks and Comfort Noise for CELP-Type Speech Coders , 2010, FGIT-FGCN.

[7] Chun-Feng Wu,et al. Perceptual-based playout mechanisms for multi-stream voice over IP networks , 2009, 2009 17th European Signal Processing Conference.

[8] Yang Zhi-ling. 8 kbit/s～32 kbit/s Wideband Codec Bitstream Interoperable with G.729:ITU-T G.729.1 , 2011 .

[9] Roch Lefebvre,et al. The adaptive multirate wideband speech codec (AMR-WB) , 2002, IEEE Trans. Speech Audio Process..

[10] Akinori Ito,et al. Packet Loss Concealment for MDCT-Based Audio Codec Using Correlation-Based Side Information , 2008, 2008 International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[11] Henning Schulzrinne,et al. RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.