Adaptive Redundant Speech Transmission over Wireless Multimedia Sensor Networks Based on Estimation of Perceived Speech Quality

An adaptive redundant speech transmission (ARST) approach to improve the perceived speech quality (PSQ) of speech streaming applications over wireless multimedia sensor networks (WMSNs) is proposed in this paper. The proposed approach estimates the PSQ as well as the packet loss rate (PLR) from the received speech data. Subsequently, it decides whether the transmission of redundant speech data (RSD) is required in order to assist a speech decoder to reconstruct lost speech signals for high PLRs. According to the decision, the proposed ARST approach controls the RSD transmission, then it optimizes the bitrate of speech coding to encode the current speech data (CSD) and RSD bitstream in order to maintain the speech quality under packet loss conditions. The effectiveness of the proposed ARST approach is then demonstrated using the adaptive multirate-narrowband (AMR-NB) speech codec and ITU-T Recommendation P.563 as a scalable speech codec and the PSQ estimation, respectively. It is shown from the experiments that a speech streaming application employing the proposed ARST approach significantly improves speech quality under packet loss conditions in WMSNs.

[1]  Mark Handley,et al.  Reliable Audio for Use over the Internet , 2006 .

[2]  Maghsoud Abbaspour,et al.  An Energy-Efficient and High-Quality Video Transmission Architecture in Wireless Video-Based Sensor Networks , 2008, Sensors.

[3]  Fatiha Merazka Improved Packet Loss Recovery using Interleaving for CELP-type Speech Coders in Packet Networks , 2009 .

[4]  Ramesh Govindan,et al.  Understanding packet delivery performance in dense wireless sensor networks , 2003, SenSys '03.

[5]  Henning Schulzrinne,et al.  An RTP Payload Format for Generic Forward Error Correction , 1999, RFC.

[6]  Nuggehally Sampath Jayant,et al.  Effects of Packet Losses in Waveform Coded Speech and Improvements Due to an Odd-Even Sample-Interpolation Procedure , 1981, IEEE Trans. Commun..

[7]  Ian F. Akyildiz,et al.  A survey on wireless multimedia sensor networks , 2007, Comput. Networks.

[8]  Ayman Mostafa,et al.  AMR Call Quality Measurement Based on ITU-T P.862.1 PESQ-LQO , 2006, IEEE Vehicular Technology Conference.

[9]  Peter Vary,et al.  Quality control for AMR speech channels in GSM networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Redwan Salami,et al.  ITU-T G.729 Annex A: reduced complexity 8 kb/s CS-ACELP codec for digital simultaneous voice and data , 1997, IEEE Commun. Mag..

[11]  Jhing-Fa Wang,et al.  A voicing-driven packet loss recovery algorithm for analysis-by-synthesis predictive speech coders over Internet , 2001, IEEE Trans. Multim..

[12]  Luca Benini,et al.  Analysis of Audio Streaming Capability of Zigbee Networks , 2008, EWSN.

[13]  Seong Ro Lee,et al.  Burst Packet Loss Concealment Using Multiple Codebooks and Comfort Noise for CELP-Type Speech Coders in Wireless Sensor Networks , 2011, Sensors.

[14]  Yongjun Li,et al.  Loss Temporal Dependency Tomography in Wireless Sensor Network , 2007, 2007 International Conference on Wireless Communications, Networking and Mobile Computing.

[15]  Anthony Rowe,et al.  Voice over Sensor Networks , 2006, 2006 27th IEEE International Real-Time Systems Symposium (RTSS'06).

[16]  Magnus Westerlund,et al.  Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for the Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs , 2002, RFC.

[17]  Steven McCanne,et al.  Simulation of FEC-based error control for packet audio on the Internet , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[18]  Andrea Fumagalli,et al.  Perceptual based voice multi-hop transmission over wireless sensor networks , 2009, 2009 IEEE Symposium on Computers and Communications.

[19]  Jamal N. Al-Karaki,et al.  Wireless Multimedia Sensor Networks: Current Trends and Future Directions , 2010, Sensors.

[20]  H. Schulzrinne,et al.  A Transport Protocol for Real-time Applications , 2010 .

[21]  V. Hardman,et al.  A survey of packet loss recovery techniques for streaming audio , 1998, IEEE Network.

[22]  Guoliang Xing,et al.  QVS: Quality-Aware Voice Streaming for Wireless Sensor Networks , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems.

[23]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[24]  S. Hayashi,et al.  Design and description of CS-ACELP: a toll quality 8 kb/s speech coder , 1998, IEEE Trans. Speech Audio Process..

[25]  Frank H. P. Fitzek,et al.  Voice quality evaluation in wireless packet communication systems: a tutorial and performance results for RHC , 2005, IEEE Wireless Communications.

[26]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[27]  David J. Goodman,et al.  The effect of waveform substitution on the quality of PCM packet communications , 1988, IEEE Trans. Acoust. Speech Signal Process..

[28]  B. Girod,et al.  A new technique for audio packet loss concealment , 1996, Proceedings of GLOBECOM'96. 1996 IEEE Global Telecommunications Conference.