An Honest Conversation: Transparently Combining Machine and Human Speech Assistance in Public Spaces

There is widespread concern over the ways speech assistant providers currently use humans to listen to users' queries without their knowledge. We report two iterations of the TalkBack smart speaker, which transparently combines machine and human assistance. In the first, we created a prototype to investigate whether people would choose to forward their questions to a human answerer if the machine was unable to help. Longitudinal deployment revealed that most users would do so when given the explicit choice. In the second iteration we extended the prototype to draw upon spoken answers from previous deployments, combining machine efficiency with human richness. Deployment of this second iteration shows that this corpus can help provide relevant, human-created instant responses. We distil lessons learned for those developing conversational agents or other AI-infused systems about how to appropriately enlist human-in-the-loop information services to benefit users, task workers and system performance.

[1]  T. Ingold Making: Anthropology, Archaeology, Art and Architecture , 2013 .

[2]  Kentaro Toyama,et al.  Intermediated technology use in developing communities , 2010, CHI.

[3]  Abigail Sellen,et al.  "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents , 2016, CHI.

[4]  Anirudha Joshi,et al.  Challenges In Supporting The Emergent User , 2018, IndiaHCI.

[5]  Apoorva Bhalla,et al.  An exploratory study understanding the appropriated use of voice-based Search and Assistants , 2018, IndiaHCI.

[6]  T. Ingold,et al.  Creativity and Cultural Improvisation , 2007 .

[7]  Edward Cutrell,et al.  "Yours is better!": participant response bias in HCI , 2012, CHI.

[8]  K. Taylor,et al.  In Defense of Ugliness: The Role of Technical Presence in Critical Infrastructure System Endurance , 2007, 2007 IEEE International Symposium on Technology and Society.

[9]  Paul N. Bennett,et al.  Guidelines for Human-AI Interaction , 2019, CHI.

[10]  Bhiksha Raj,et al.  Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India , 2016, ICTD.

[11]  Gary Marsden,et al.  'Visual literacy' as challenge to the internationalisation of interfaces: a study of South African student web users , 2002, CHI Extended Abstracts.

[12]  Paul Dourish,et al.  Beyond the user: use and non-use in HCI , 2009, OZCHI.

[13]  Gaetano Borriello,et al.  Sangeet Swara: A Community-Moderated Voice Forum in Rural India , 2015, CHI.

[14]  Ingold Tim,et al.  Creativity and Cultural Improvisation. An Introduction [w:] ciż, eds , 2007 .

[15]  Anirudha Joshi,et al.  Technology adoption by 'emergent' users: the user-usage model , 2013, APCHI.

[16]  Sundar Burra,et al.  Getting the information base for Dharavi's redevelopment , 2009 .

[17]  Eli Blevis,et al.  Regarding Software as a Material of Design , 2006 .

[18]  Anirudha Joshi,et al.  StreetWise: Smart Speakers vs Human Help in Public Slum Settings , 2019, CHI.

[19]  Sarah Sharples,et al.  Voice Interfaces in Everyday Life , 2018, CHI.

[20]  Richard Harper,et al.  The Role of HCI in the Age of AI , 2019, Int. J. Hum. Comput. Interact..

[21]  Anirudha Joshi,et al.  Diversifying Future-Making Through Itinerative Design , 2019, ACM Trans. Comput. Hum. Interact..

[22]  Mary L. Gray,et al.  Ghost Work: How to Stop Silicon Valley from Building a New Global Underclass , 2019 .

[23]  Ece Kamar,et al.  Directions in Hybrid Intelligence: Complementing AI Systems with Human Intelligence , 2016, IJCAI.

[24]  Jennifer Pearson,et al.  Revisiting “Hole in the Wall” Computing: Private Smart Speakers and Public Slum Settings , 2018, CHI.