Handoff with DSP Support: Enabling Seamless Voice Communications across Heterogeneous Telephony Systems on Dual-Mode Mobile Devices

In this paper we investigate the problem of voice communications across heterogeneous telephony systems on dual-mode (WiFi and GSM) mobile devices. Since GSM is a circuit-switched telephony system, existing solutions that are based on packet-switched network protocols cannot be used. We show in this paper that an enabling technology for seamless voice communications across circuit-switched and packet-switched telephony systems is the support of digital signal processing (DSP) techniques during handoffs. To substantiate our argument, we start with a framework based on the session initiation protocol (SIP) for vertical handoffs on dual-mode mobile devices. We then identify the key obstacle in achieving seamless handoffs across circuit-switched and packet-switched systems, and explain why DSP support is necessary in this context. We propose a solution that incorporates time alignment and time scaling algorithms during handoffs for supporting seamless voice communications across heterogeneous telephony systems. We conduct testbed experiments using a GSM-WiFi dual-mode notebook and evaluate the quality of speech when the call is migrated from WiFi to GSM networks. Evaluation results show that such a cross-disciplinary solution involving signal processing and networking can effectively support seamless voice communications across heterogeneous telephony systems.

[1]  Kouji Nishimura,et al.  A seamless handoff for dual-interfaced mobile devices in hybrid wireless access networks , 2004, 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004..

[2]  Bernd Girod,et al.  Adaptive playout scheduling and loss concealment for voice communication over IP networks , 2003, IEEE Trans. Multim..

[3]  Fang Liu,et al.  Objective quality measurement for audio time-scale modification , 2003, SPIE ITCom.

[4]  Randy H. Katz,et al.  Vertical handoffs in wireless overlay networks , 1998, Mob. Networks Appl..

[5]  Jon Peterson,et al.  Session Initiation Protocol for Telephones (SIP-T): Context and Architectures , 2002, RFC.

[6]  Werner Verhelst,et al.  Efficient non-uniform time-scaling of speech with WSOLA for CALL applications , 2004 .

[7]  Mark Handley,et al.  SIP: Session Initiation Protocol , 1999, RFC.

[8]  Robert J. Sparks,et al.  The Session Initiation Protocol (SIP) Refer Method , 2003, RFC.

[9]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[10]  Nikos I. Passas,et al.  Seamless continuity of real-time video across UMTS and WLAN networks: challenges and performance evaluation , 2005, IEEE Wireless Communications.

[11]  Luigi Atzori,et al.  Speech playout buffering based on a simplified version of the ITU-T E-model , 2004, IEEE Signal Processing Letters.

[12]  A. Wilgus,et al.  High quality time-scale modification for speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Hyun-Ho Choi,et al.  A seamless handoff scheme for UMTS-WLAN interworking , 2004, IEEE Global Telecommunications Conference, 2004. GLOBECOM '04..

[14]  S. Voran Perception of Temporal Discontinuity Impairments in Coded Speech - A Proposal for Objective Estimators and Some Subjective Test Results , 2003 .

[15]  Raghupathy Sivakumar,et al.  A Receiver-Centric Transport Protocol for Mobile Hosts with Heterogeneous Wireless Interfaces , 2003, MobiCom '03.

[16]  Seung-Jae Han,et al.  Design and implementation of a WLAN/cdma2000 interworking architecture , 2003, IEEE Commun. Mag..

[17]  Olivier Boëffard,et al.  Multilingual PSOLA text-to-speech system , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Werner Verhelst Overlap-add methods for time-scaling of speech , 2000, Speech Commun..

[19]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[20]  Werner Verhelst,et al.  An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Doh-Suk Kim,et al.  ANIQUE: An Auditory Model for Single-Ended Speech Quality Estimation , 2005, IEEE Trans. Speech Audio Process..

[22]  Hakki Gökhan Ilk,et al.  Adaptive time scale modification of speech for graceful degrading voice quality in congested networks for VoIP applications , 2006, Signal Process..

[23]  Antony William Rix,et al.  Perceptual evaluation of speech quality (PESQ): The new ITU standard for end-to-end speech quality a , 2002 .

[24]  Ralf Steinmetz,et al.  Human Perception of Jitter and Media Synchronization , 1996, IEEE J. Sel. Areas Commun..

[25]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[26]  S. Weinstein Echo cancellation in the telephone network , 1977, IEEE Communications Society Magazine.

[27]  Henning Schulzrinne,et al.  Application-layer mobility using SIP , 2000, MOCO.