User driven audio content navigation for spoken web

It is a common practice for us to skim textual content on a web page. While skimming, we usually skip words or phrases that are not of interest to us and we slow down our speed when the content seems to be of relevance to us. But when we listen to audio content, which is not persistent and is sequential, such skimming is not possible. In developing countries, cell-phone penetration is much higher than Internet penetration. Moreover, due to low literacy, voice is a convenient modality to access information. The skimming techniques are therefore more critical in the audio domain. In this paper, we describe the technique for navigating audio content while interacting with information systems in a client server environment, where a dumb phone is the client device. The paper presents techniques for skimming audio content and for placing markers in audio. The user studies conducted with 18 users for more than 1 month, in a live setting substantiates the usability and usefulness of the navigation techniques.

[1]  Rajarshi Das,et al.  Social navigation for the spoken web , 2010, RecSys '10.

[2]  Tobias Lauer,et al.  An elastic audio slider for interactive speech skimming , 2004, NordiCHI '04.

[3]  Arun Kumar,et al.  Content creation and dissemination by-and-for users in rural areas , 2009, 2009 International Conference on Information and Communication Technologies and Development (ICTD).

[4]  Dipanjan Chakraborty,et al.  HSTP: hyperspeech transfer protocol , 2007, HT '07.

[5]  Chris Schmandt,et al.  NewsComm: a hand-held interface for interactive access to structured audio , 1996, CHI.

[6]  Dipanjan Chakraborty,et al.  VOISERV: Creation and Delivery of Converged Services through Voice for Emerging Economies , 2007, 2007 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks.

[7]  Dipanjan Chakraborty,et al.  WWTW: the world wide telecom web , 2007, NSDR '07.

[8]  Guodong Guo,et al.  Content-based audio classification and retrieval by support vector machines , 2003, IEEE Trans. Neural Networks.

[9]  Barry Arons,et al.  SpeechSkimmer: a system for interactively skimming recorded speech , 1997, TCHI.