How to Build a Spoken Dialog System with Limited ( or no ) Language Resources

This paper evaluates low cost, rapidly deployable speech technologies for new languages as a means to improve equitable, affordable access to information technology (IT). We describe our field work in Tamil Nadu, recording speech from within a multi-modal (speech and touch) dialog system. The performance of a speech recognizer built using cross-language adaptation is evaluated on our field recordings, with implications for an iterative, learning approach for spoken dialog systems in conditions of limited language resources.

[1]  Kentaro Toyama,et al.  Text-Free User Interfaces for Illiterate and Semi-Literate Users , 2006, 2006 International Conference on Information and Communication Technologies and Development.

[2]  P. Harris,et al.  Reasoning From Unfamiliar Premises , 2005, Psychology Science.

[3]  Jean-Luc Gauvain,et al.  Lightly Supervised Acoustic Model Training , 2000 .

[4]  C. S. Kumar,et al.  A bilingual speech recognition system for English and Tamil , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[5]  Karl Magnus Petersson,et al.  Language Processing Modulated by Literacy: A Network Analysis of Verbal Repetition in Literate and Illiterate Subjects , 2000, Journal of Cognitive Neuroscience.

[6]  Masahiro Araki,et al.  Spoken, Multilingual and Multimodal Dialogue Systems: Development and Assessment , 2005 .

[7]  Rosalie A. Zobel-Pocock International user interfaces , 1990 .

[8]  A. Waibel,et al.  Multilinguality in speech and spoken language systems , 2000, Proceedings of the IEEE.

[9]  D. Crystal What is language death , 2002 .

[10]  Dilek Z. Hakkani-Tür,et al.  Active and unsupervised learning for automatic speech recognition , 2003, INTERSPEECH.

[11]  Alexander H. Waibel,et al.  Unsupervised training of a speech recognizer using TV broadcasts , 1998, ICSLP.

[12]  Joyojeet Pal,et al.  Speech Recognition for Illiterate Access to Information and Technology , 2006, 2006 International Conference on Information and Communication Technologies and Development.

[13]  Ben Shneiderman,et al.  Designing the User Interface: Strategies for Effective Human-Computer Interaction , 1998 .

[14]  Frederick Jelinek,et al.  Five speculations (and a divertimento) on the themes of H. Bourlard, H. Hermansky, and N. Morgan , 1996, Speech Commun..

[15]  V. Borooah,et al.  Gender bias among children in India in their diet and immunisation against disease. , 2004, Social science & medicine.

[16]  Thomas C. Ormerod,et al.  Understanding interfaces - a handbook of human-computer dialogue , 1994, Computers and people series.

[17]  Hermann Ney,et al.  Multilingual acoustic modeling using graphemes , 2003, INTERSPEECH.

[18]  Etienne Barnard,et al.  The efficient generation of pronunciation dictionaries: machine learning factors during bootstrapping , 2004, INTERSPEECH.

[19]  Etienne Barnard,et al.  Language and Technology Literacy Barriers to Accessing Government Services , 2003, EGOV.

[20]  J. Moor What Is Computer Ethics?* , 1985, The Ethics of Information Technologies.

[21]  Joyojeet Pal,et al.  The challenges of technology research for developing regions , 2006, IEEE Pervasive Computing.

[22]  E. Soola Agricultural communication and the African non-literate farmer: the Nigerian experience. , 1988, Africa media review.

[23]  J. Donner Microentrepreneurs and Mobiles: An Exploration of the Uses of Mobile Phones by Small Business Owners in Rwanda , 2004 .

[24]  G. Psacharopoulos Returns to investment in education: A global update , 1994 .

[25]  S. Deo,et al.  DIGITAL LIBRARY ACCESS FOR ILLITERATE USERS , 2004 .

[26]  R. Swaminathan,et al.  Multilingual Speech Recognition for Information Retrieval in Indian Context , 2004, HLT-NAACL.

[27]  Tanja Schultz,et al.  Multilingual and Crosslingual Speech Recognition , 1998 .

[28]  E. F. Schumacher Small Is Beautiful: Economics as if People Mattered , 1973 .

[29]  Justin Chisenga Global Information Infrastructure and the Question of African Content , 2002 .

[30]  Taylor C. Boas,et al.  Will the Digital Revolution Revolutionize Development? Drawing Together the Debate , 2005 .