Speech Technology for Information Access: a South African Case Study

Telephone-based information access has the potential to deliver a significant positive impact in the developing world. We discuss some of the most important issues that must be addressed in order to realize this potential, including matters related to resource development, automatic speech recognition, text-to-speech systems, and user-interface design. Although our main focus has been on the eleven official languages of South Africa, we believe that many of these same issues will be relevant for the application of speech technology throughout the developing world.

[1]  Joyojeet Pal,et al.  Speech Recognition for Illiterate Access to Information and Technology , 2006, 2006 International Conference on Information and Communication Technologies and Development.

[2]  G Botha,et al.  Two approaches to gathering text corpora from the WorldWideWeb , 2005 .

[3]  Joakim Nivre,et al.  Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics , 2009 .

[4]  Alta de Waal,et al.  Morphological analysis: a method for selecting ICT applications in South African government service delivery , 2010 .

[5]  Marelie H. Davel,et al.  Pronunciation dictionary development in resource-scarce environments , 2009, INTERSPEECH.

[6]  E. Barnard,et al.  Phonetics of intonation in South African Bantu languages , 2008 .

[7]  Etienne Barnard,et al.  Vowel variation in Southern Sotho: an acoustic investigation , 2008 .

[8]  Ronald Rosenfeld,et al.  Speech vs. touch-tone: Telephony interfaces for information access by low literate users , 2009, 2009 International Conference on Information and Communication Technologies and Development (ICTD).

[9]  Ronald Rosenfeld,et al.  HealthLine: Speech-based access to health information by low-literate users , 2007, 2007 International Conference on Information and Communication Technologies and Development.

[10]  Etienne Barnard,et al.  The utility of spoken dialog systems , 2008, 2008 IEEE Spoken Language Technology Workshop.

[11]  Tanja Schultz,et al.  Multilingual Speech Processing , 2006 .

[12]  Etienne Barnard,et al.  ASR corpus design for resource-scarce languages , 2009, INTERSPEECH.

[13]  Madelaine Plauché,et al.  Initial Fieldwork for LWAZI: A Telephone-Based Spoken Dialog System for Rural South Africa , 2009 .

[14]  Jennifer Balogh,et al.  Voice User Interface Design , 2004 .

[15]  Etienne Barnard,et al.  Other Challenges: Non-native Speech, Dialects, Accents, and Local Interfaces , 2006 .

[16]  Etienne Barnard,et al.  Language and Technology Literacy Barriers to Accessing Government Services , 2003, EGOV.

[17]  John M. Carroll,et al.  Human-computer interaction: psychology as a science of design , 1997, Int. J. Hum. Comput. Stud..

[18]  Etienne Barnard,et al.  Phonetic alignment for speech synthesis in under-resourced languages , 2009, INTERSPEECH.

[19]  Etienne Barnard,et al.  HIV health information access using spoken dialogue systems: Touchtone vs. speech , 2009, 2009 International Conference on Information and Communication Technologies and Development (ICTD).

[20]  Etienne Barnard,et al.  Influences on tone in Sepedi, a southern Bantu language , 2008, INTERSPEECH.

[21]  Etienne Barnard,et al.  Pronunciation prediction with Default&Refine , 2008, Comput. Speech Lang..

[22]  Etienne Barnard,et al.  Basic speech recognition for spoken dialogues , 2009, INTERSPEECH.