A Task-Centered Framework for Computationally-Grounded Science Collaborations

Collaboration is ubiquitous in today's science, yet there is limited support for coordinating scientific work. The general-purpose tools that are typically used (e.g., email, shared document editing, social coding sites), have still not replaced in-person meetings, phone calls, and extensive emails needed to coordinate and track collaborative activities. Scientists with diverse knowledge and skills around the globe could collaborate by opening scientific processes that expose all tasks and activities publicly to achieve a shared scientific question. This paper describes the Organic Data Science framework to support scientific collaborations that revolve around complex science questions that require significant coordination, entice contributors to remain engaged for extended periods of time, and enable continuous growth to accommodate new contributors as the work evolves over time. We discuss how the design of this framework incorporates principles followed by successful on-line communities. We present initial results to date of several communities that are collaborating using this framework.

[1]  Judith S. Olson,et al.  From Shared Databases to Communities of Practice: A Taxonomy of Collaboratories , 2007, J. Comput. Mediat. Commun..

[2]  Yolanda Gil,et al.  A semantic framework for automatic generation of computational workflows using distributed data and component catalogues , 2011, J. Exp. Theor. Artif. Intell..

[3]  Yolanda Gil,et al.  Supporting Open Collaboration in Science Through Explicit and Linked Semantic Description of Processes , 2015, ESWC.

[4]  Quentin Jones,et al.  An empirical study of critical mass and online community survival , 2010, CSCW '10.

[5]  Diomidis Spinellis,et al.  The collaborative organization of knowledge , 2008, CACM.

[6]  E. Birney The making of ENCODE: Lessons for big-data projects , 2012, Nature.

[7]  P. Resnick,et al.  Building Successful Online Communities: Evidence-Based Social Design , 2012 .

[8]  Deborah L. McGuinness,et al.  Investigations into Trust for Collaborative Information Repositories: A Wikipedia Case Study , 2006, MTW.

[9]  Jeroen J. G. van Merriënboer,et al.  Training Complex Cognitive Skills: A Four-Component Instructional Design Model for Technical Training , 1997 .

[10]  Cláudio T. Silva,et al.  Towards Enabling Social Analysis of Scientific Data , 2008 .

[11]  Markus Krötzsch,et al.  Semantic MediaWiki , 2006, International Semantic Web Conference.

[12]  Thomas A. Finholt,et al.  The Long Now of Infrastructure: Articulating Tensions in Development , 2009 .

[13]  Paolo Traverso,et al.  Automated Planning: Theory & Practice , 2004 .

[14]  Paul T. Groth,et al.  Capturing Common Knowledge about Tasks: Intelligent Assistance for To-Do Lists , 2012, TIIS.

[15]  Eric Horvitz,et al.  Volunteering Versus Work for Pay: Incentives and Tradeoffs in Crowdsourcing , 2013, HCOMP.

[16]  Yolanda Gil,et al.  Knowledge capture in the wild: a perspective from semantic wiki communities , 2013, K-CAP.

[17]  M. A. Britt,et al.  Constructing representations of arguments , 2003 .

[18]  Bruce G. Coury,et al.  The development of cognitive models of planning for use in the design of project management systems , 1994, Int. J. Hum. Comput. Stud..

[19]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[20]  Paul T. Groth,et al.  Wings: Intelligent Workflow-Based Design of Computational Experiments , 2011, IEEE Intelligent Systems.

[21]  Markus Krötzsch,et al.  Semantic MediaWiki , 2006, Foundations for the Web of Information and Services.

[22]  Yolanda Gil,et al.  A Task-Centered Interface for On-Line Collaboration in Science , 2015, IUI Companion.

[23]  Jure Leskovec,et al.  Governance in Social Media: A Case Study of the Wikipedia Promotion Process , 2010, ICWSM.

[24]  Aniket Kittur,et al.  Coordination in collective intelligence: the role of team structure and task interdependence , 2009, CHI.

[25]  Aniket Kittur,et al.  Beyond Wikipedia: coordination and conflict in online production groups , 2010, CSCW '10.

[26]  Cláudio T. Silva,et al.  VisTrails: visualization meets data management , 2006, SIGMOD Conference.

[27]  Yolanda Gil,et al.  A Virtual Crowdsourcing Community for Open Collaboration in Science Processes , 2015, AMCIS.

[28]  James Fogarty,et al.  Amplifying community content creation with mixed initiative information extraction , 2009, CHI.

[29]  John Riedl,et al.  The effects of group composition on decision quality in a social production community , 2010, GROUP '10.

[30]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..