WIRE3: Driving Around the Information Super-Highway

Abstract: Interactive voice browsers offer an alternative paradigm that affords ubiquitous mobile access to the WWW using a wide range of consumer devices. This technology can facilitate a safe, “hands-free” browsing environment that is of importance both to car drivers and various mobile and technical professionals. This paper describes the challenges of architecting an interactive voice browser that combines digital audio with the features of a speech synthesizer to make structural elements of the document explicit to the listener. The aesthetics of the audio rendition can simultaneously help reduce the monotony factor and enhance comprehension. The evolution of the voice browser gave rise to a new conceptual model of the HTML document structure and its mapping to a 3D audio space. A number of novel features are discussed for improving both the user’s comprehension of the HTML document structure and their orientation within it. These factors, in turn, can improve the effectiveness of the browsing experience.

[1]  Stuart Goose,et al.  Enhancing Web accessibility via the Vox Portal and a Web-hosted dynamic HTMLVoxML converter , 2000, Comput. Networks.

[2]  Dan Benson,et al.  Browsing the world wide web in a non-visual environment , 1997 .

[3]  Allen Newell,et al.  The psychology of human-computer interaction , 1983 .

[4]  Justinian P. Rosca,et al.  Broadband Direction-Of-Arrival Estimation Based on Second Order Statistics , 1999, NIPS.

[5]  Frankie James Presenting HTML Structure in Audio: User Satisfaction with Audio Hypertext , 1998 .

[6]  Helen Petrie,et al.  Initial design and evaluation of an interface to hypermedia systems for blind users , 1997, HYPERTEXT '97.

[7]  Andrew F. Monk,et al.  Mode Errors: A User-Centered Analysis and Some Preventative Measures Using Keying-Contingent Sound , 1986, Int. J. Man Mach. Stud..

[8]  Özgür Yilmaz,et al.  Blind separation of disjoint orthogonal signals: demixing N sources from 2 mixtures , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9]  Takashi Itoh,et al.  User interface of a Home Page Reader , 1998, Assets '98.

[10]  George P. Landow,et al.  The rhetoric of hypermedia: Some rules for authors , 1989, J. Comput. High. Educ..

[11]  Barry Arons Hyperspeech: navigating in speech-only hypermedia , 1991, HYPERTEXT '91.

[12]  Chris Schmandt,et al.  AudioStreamer: exploiting simultaneity for listening , 1995, CHI 95 Conference Companion.

[13]  Murray Crease,et al.  Making progress with sounds - the design & evaluation of an audio progress bar , 1998 .

[14]  William W. Gaver Auditory Icons: Using Sound in Computer Interfaces , 1986, Hum. Comput. Interact..

[15]  Meera Blattner,et al.  Earcons and Icons: Their Structure and Common Design Principles , 1989, Hum. Comput. Interact..

[16]  Bill Gardner,et al.  HRTF Measurements of a KEMAR Dummy-Head Microphone , 1994 .

[17]  Elizabeth D. Mynatt,et al.  The Mercator Environment: A Nonvisual Interface to X Windows and Unix Workstations , 1992 .

[18]  Chris Schmandt,et al.  Dynamic Soundscape: mapping time to space for audio browsing , 1997, CHI.

[19]  Victor Zue,et al.  WebGALAXY: Beyond Point and Click - a Conversational Interface to a Browser , 1997, Comput. Networks.

[20]  Barry Arons,et al.  A Review of The Cocktail Party Effect , 1992 .

[21]  Michael T. Turvey,et al.  An auditory analogue of the sperling partial report procedure: Evidence for brief auditory storage , 1972 .

[22]  Simon R. Oldfield,et al.  Acuity of Sound Localisation: A Topography of Auditory Space. I. Normal Hearing Conditions , 1984, Perception.

[23]  Stuart Goose,et al.  1-800-hypertext: browsing hypertext with a telephone , 1998, HYPERTEXT '98.

[24]  Stephen A. Brewster,et al.  Parallel earcons: reducing the length of audio messages , 1995, Int. J. Hum. Comput. Stud..