An Approach to Integrating SIP in Converged Multimodal/Multimedia Communication Services

Abstract In this paper, we present an approach of integrating SIP (Session Initiation Protocol) in converged multimodal/multimedia communication services. An extensible VoIPTeleserver for VoIP in SIP environment is described. It is based on the concept of dialogue system and Web convergence that separates the channel dependent media resources from the application dependent service creation and hosting environment. It supports XML based service applications for multiple channels including voice, DTMF, IM and chat over IP. The loosely coupled open architecture in our approach is highly extensible. We describe the concept and structure of VoIPTeleServer used in our approach in detail, which interfaces to the VoIP world through SIP signaling and works as a broker between the VoIP SIP environment and MTIP to deliver converged communication services. A prototype of VoIPTeleServer was implemented, and services and applications based on SIP and MTIP convergence are constructed. Special attention is given to the adverse effect of delay, jitter and packet loss for voice portal services over IP. In particular, case studies of DTMF service in voice portal under adverse channel conditions are performed. The compounding effects of multiple channel impairments to DTMF in voice portal services over IP are characterized. The potential high error rate of the DTMF service indicates that the data redundancy method as proposed in RFC 2198 is needed for DTMF in order to achieve reliable voice portal services over IP.

[1]  Feng Liu,et al.  A distributed multimodal dialogue system based on dialogue system and web convergence , 2002, INTERSPEECH.

[2]  Adam Roach,et al.  Session Initiation Protocol (SIP)-Specific Event Notification , 2002, RFC.

[3]  Michael Pucher,et al.  Architecture for adaptive multimodal dialog systems based on voiceXML , 2001, INTERSPEECH.

[4]  Henning Schulzrinne,et al.  Integrating voiceXML with SIP services , 2003, IEEE International Conference on Communications, 2003. ICC '03..

[5]  Sherif Abdou,et al.  An enhanced BLSTIP dialogue research platform , 2000, INTERSPEECH.

[6]  Jean-François Serignat,et al.  Audio packet loss over IP and speech recognition , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[7]  Feng Liu,et al.  An architecture of wireless Web and dialogue system convergence for multimodal service interaction over converged networks , 2003, Proceedings 27th Annual International Computer Software and Applications Conference. COMPAC 2003.

[8]  Wu Chou,et al.  An architecture of wireless Web and dialogue system convergence for multimodal service interaction over converged networks , 2002, Proceedings. Eleventh International Conference on Computer Communications and Networks.

[9]  Henning Schulzrinne,et al.  RTP Payload for DTMF Digits, Telephony Tones, and Telephony Signals , 2000, RFC.

[10]  Kuansan Wang Implementation of a multimodal dialog system using extended markup languages , 2000, INTERSPEECH.

[11]  Henning Schulzrinne,et al.  Unified Messaging using SIP and RTSP , 2000 .

[12]  Mark Handley,et al.  SDP: Session Description Protocol , 1998, RFC.