Winder: Linking Speech and Visual Objects to Support Communication in Asynchronous Collaboration

Team members commonly collaborate on visual documents remotely and asynchronously. Particularly, students are frequently restricted to this setting as they often do not share work schedules or physical workspaces. As communication in this setting has delays and limits the main modality to text, members exert more effort to reference document objects and understand others’ intentions. We propose Winder, a Figma plugin that addresses these challenges through linked tapes—multimodal comments of clicks and voice. Bidirectional links between the clicked-on objects and voice recordings facilitate understanding tapes: selecting objects retrieves relevant recordings, and playing recordings highlights related objects. By periodically prompting users to produce tapes, Winder preemptively obtains information to satisfy potential communication needs. Through a five-day study with eight teams of three, we evaluated the system’s impact on teams asynchronously designing graphical user interfaces. Our findings revealed that producing linked tapes could be as lightweight as face-to-face (F2F) interactions while transmitting intentions more precisely than text. Furthermore, with preempted tapes, teammates coordinated tasks and invited members to build on each others’ work.

[1]  Nicholas Chen,et al.  RichReview: blending ink, speech, and gesture to support collaborative document review , 2014, UIST.

[2]  Jonathan Grudin,et al.  Why CSCW applications fail: problems in the design and evaluationof organizational interfaces , 1988, CSCW '88.

[3]  Jonathan Trevor,et al.  Anchored conversations: chatting in the context of a document , 2000, CHI.

[4]  Dan Cosley,et al.  What's mine is mine: territoriality in collaborative authoring , 2009, CHI.

[5]  Robert E. Kraut,et al.  Expressive richness: a comparison of speech and text as media for revision , 1991, CHI.

[6]  Tovi Grossman,et al.  MicroMentor: Peer-to-Peer Software Help Sessions in Three Minutes or Less , 2020, CHI.

[7]  Melissa C. Thomas-Hunt,et al.  When timeliness matters: the effect of status on reactions to perceived time delay within distributed collaboration. , 2006, The Journal of applied psychology.

[8]  Martin Pielot,et al.  Didn't you see my message?: predicting attentiveness to mobile instant messages , 2014, CHI.

[9]  Tovi Grossman,et al.  Chronicle: capture, exploration, and playback of document workflow histories , 2010, UIST.

[10]  Jacob O. Wobbrock,et al.  LemonAid: selection-based crowdsourced contextual help for web applications , 2012, CHI.

[11]  Nicholas Chen,et al.  RichReview++: Deployment of a Collaborative Multi-modal Annotation System for Instructor Feedback and Peer Discussion , 2016, CSCW.

[12]  Saul Greenberg,et al.  A Framework for Asynchronous Change Awareness in Collaboratively-Constructed Documents , 2004, CRIWG.

[13]  Jonas Voss,et al.  Figma: collaborative interface design tool , 2019 .

[14]  Paul Johns,et al.  Video threads: asynchronous video sharing for temporally distributed teams , 2012, CSCW.

[15]  Steve Oney,et al.  Callisto: Capturing the "Why" by Connecting Conversations with Computational Narratives , 2020, CHI.

[16]  James D. Thompson Organizations in Action: Social Science Bases of Administrative Theory , 1967 .

[17]  Amy X. Zhang,et al.  Making Sense of Group Chat through Collaborative Tagging and Summarization , 2018, Proc. ACM Hum. Comput. Interact..

[18]  Barry M. Kroll Cognitive Egocentrism and the Problem of Audience Awareness in Written Discourse. , 1978 .

[19]  Hideaki Kuzuoka,et al.  Handling documents and discriminating objects in hybrid spaces , 2006, CHI.

[20]  Stina Matthiesen,et al.  Does Distance Still Matter? Revisiting the CSCW Fundamentals on Distributed Collaboration , 2014, TCHI.

[21]  Jan Pries-Heje,et al.  Why Scrum Works: A Case Study from an Agile Distributed Project in Denmark and India , 2011, 2011 AGILE Conference.

[22]  Juho Kim,et al.  Korero: Facilitating Complex Referencing of Visual Materials in Asynchronous Discussion Interface , 2017, Proc. ACM Hum. Comput. Interact..

[23]  Hao-Chuan Wang,et al.  AlphaRead: Support Unambiguous Referencing in Remote Collaboration with Readable Object Annotation , 2017, CSCW.

[24]  Guang Li,et al.  SketchComm: a tool to support rich and flexible asynchronous communication of early design ideas , 2012, CSCW.

[25]  A. Girardi,et al.  A Systematic Literature Review of Teamwork Pedagogy in Higher Education , 2016 .

[26]  Carl Gutwin,et al.  A Descriptive Framework of Workspace Awareness for Real-Time Groupware , 2002, Computer Supported Cooperative Work (CSCW).

[27]  Bernt Schiele,et al.  Learning Video Object Segmentation from Static Images , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Mark S. Ackerman,et al.  Successful classroom deployment of a social document annotation system , 2012, CHI.

[29]  Juho Kim,et al.  Snapstream: Snapshot-based Interaction in Live Streaming for Visual Art , 2020, CHI.

[30]  M. W. van Someren,et al.  The think aloud method: a practical approach to modelling cognitive processes , 1994 .

[31]  James W. Pennebaker,et al.  Improving teamwork using real-time language feedback , 2013, CHI.

[32]  Tovi Grossman,et al.  Workflow Graphs: A Computational Model of Collective Task Strategies for 3D Design Software , 2020, Graphics Interface.

[33]  Nickolas A. Jordan,et al.  Integrating Social Media into the Classroom Curriculum , 2013 .

[34]  Paul Resnick,et al.  Creating Guided Code Explanations with chat.codes , 2018, Proc. ACM Hum. Comput. Interact..

[35]  James A. Landay,et al.  Interactive sketching for the early stages of user interface design , 1995, CHI '95.

[36]  Chiara Rossitto,et al.  Understanding Constellations of Technologies in Use in a Collaborative Nomadic Setting , 2014, Computer Supported Cooperative Work (CSCW).

[37]  Carman Neustaedter,et al.  FamilyStories: Asynchronous Audio Storytelling for Family Members Across Time Zones , 2020, CHI.

[38]  Stephanie Bell Project-Based Learning for the 21st Century: Skills for the Future , 2010 .

[39]  Stanley Peters,et al.  Meeting adjourned: off-line learning interfaces for automatic meeting understanding , 2008, IUI '08.

[40]  Geraldine Fitzpatrick,et al.  Reflecting on reflection: framing a design landscape , 2010, OZCHI '10.

[41]  Gökhan Tür,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. 1 The CALO Meeting Assistant System , 2022 .

[42]  Heng-Yu Ku,et al.  Collaboration factors, teamwork satisfaction, and student attitudes toward online collaborative learning , 2013, Comput. Hum. Behav..

[43]  Gayle S. Christensen,et al.  The MOOC Phenomenon: Who Takes Massive Open Online Courses and Why? , 2013 .

[44]  Sang Won Lee,et al.  Codeon: On-Demand Software Development Assistance , 2017, CHI.

[45]  Haiyi Zhu,et al.  Understanding Social Costs in Online Question Asking , 2019, CHI Extended Abstracts.

[46]  Kwan-Liu Ma,et al.  TalkTraces: Real-Time Capture and Visualization of Verbal Content in Meetings , 2019, CHI.

[47]  Thomas Kvan,et al.  Expertise, collaboration and bandwidth , 1998, CHI.

[48]  Scott E. Hudson,et al.  QnA: augmenting an instant messaging client to balance user responsiveness and performance , 2004, CSCW.

[49]  François Guimbretière,et al.  TypeTalker: A Speech Synthesis-Based Multi-Modal Commenting System , 2017, CSCW.

[50]  Judith S. Olson,et al.  Distance Matters , 2000, Hum. Comput. Interact..

[51]  Sharon L. Oviatt,et al.  Ten myths of multimodal interaction , 1999, Commun. ACM.