Voice-based Human-Machine Interaction Modeling for Automated Information Services

Voice based human-machine dialogs are becoming more and more important part of informative services. The implementation of voice dialogs enables to realize some of the aims of telecommunication services more successfully and efficiently. The main aim is to enable the communication according the principle "anytime-anywhere". The importance of voice dialogs is also caused by the fact that principle "anytime-anywhere" often could be realized only using mobile and portable devices. Those devices typically have small keyboards and screens and hence voice based interface has advantages over traditional keyboard and screen based interface. The paper presents the model of multimodal interface which core element is the recognition of voice commands. The model targets the informative services provided by the Lithuanian medical and social security enterprises. Paper shows that recognition accuracy of Lithuanian voice commands could be increased significantly if the foreign language which has closer to Lithuanian phonetic structure engine is adapted. Ill. 2, bibl. 11, tabl. 1 (in English; abstracts in English and Lithuanian). http://dx.doi.org/10.5755/j01.eee.110.4.300

[1]  Marion Mast,et al.  Multimodal output for a conversational telephony system , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[2]  Rytis Maskeliunas,et al.  Advances on the Use of the Foreign Language Recognizer , 2009, COST 2102 Training School.

[3]  Jan Kleindienst,et al.  Multi-modal telephony services in hometalk , 2007 .

[4]  R. Dettmer It's good to talk [speech technology for on-line services access] , 2003 .

[5]  R. Duerr Voice recognition in the telecommunications industry , 1996, Professional Program Proceedings. ELECTRO '96.

[6]  Michal Pechoucek,et al.  An Intelligent Telephony Interface of Multiagent Decision Support Systems , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[7]  Lou Boves,et al.  A multimodal consumer information server with IVR menu , 1994, Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications.

[8]  Biing-Hwang Juang Ubiquitous speech communication interface , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[9]  R. Maskeli,et al.  Investigation of Foreign Languages Models for Lithuanian Speech Recognition , 2009 .

[10]  G. Nemeth,et al.  Challenges of creating multimodal interfaces on mobile devices , 2007, ELMAR 2007.