Internet Collaboration on Extremely Difficult Problems: Research versus Olympiad Questions on the Polymath Site

Despite the existence of highly successful Internet collaborations on complex projects, including open-source software, little is known about how Internet collaborations work for solving "extremely" difficult problems, such as open-ended research questions. We quantitatively investigate a series of efforts known as the Polymath projects, which tackle mathematical research problems through open online discussion. A key analytical insight is that we can contrast the polymath projects with mini-polymaths -- spinoffs that were conducted in the same manner as the polymaths but aimed at addressing math Olympiad questions, which, while quite difficult, are known to be feasible. Our comparative analysis shifts between three elements of the projects: the roles and relationships of the authors, the temporal dynamics of how the projects evolved, and the linguistic properties of the discussions themselves. We find interesting differences between the two domains through each of these analyses, and present these analyses as a template to facilitate comparison between Polymath and other domains for collaboration and communication. We also develop models that have strong performance in distinguishing research-level comments based on any of our groups of features. Finally, we examine whether comments representing research breakthroughs can be recognized more effectively based on their intrinsic features, or by the (re-)actions of others, and find good predictive power in linguistic features.

[1]  Aniket Kittur,et al.  The polymath project: lessons from a successful online collaboration in mathematics , 2011, CHI.

[2]  Yutaka Yamauchi,et al.  Collaboration with Lean Media: how open-source software succeeds , 2000, CSCW '00.

[3]  Terence Tao,et al.  The Erdos discrepancy problem , 2015, 1509.05363.

[4]  Jon M. Kleinberg,et al.  You Had Me at Hello: How Phrasing Affects Memorability , 2012, ACL.

[5]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[6]  T. Gowers,et al.  Massively collaborative mathematics , 2009, Nature.

[7]  Alison Pease,et al.  Seventy four minutes of mathematics: An analysis of the third Mini−Polymath project , 2012 .

[8]  W. Glänzel,et al.  Analysing Scientific Networks Through Co-Authorship , 2004 .

[9]  Jure Leskovec,et al.  What's in a Name? Understanding the Interplay between Titles, Content, and Communities in Social Media , 2013, ICWSM.

[10]  Bo Pang,et al.  The effect of wording on message propagation: Topic- and author-controlled natural experiments on Twitter , 2014, ACL.

[11]  Henk F. Moed,et al.  Handbook of Quantitative Science and Technology Research , 2005 .

[12]  Michael J. Barany,et al.  '[B]ut this is blog maths and we're free to make up conventions as we go along': Polymath1 and the modalities of 'massively collaborative mathematics' , 2010, Int. Sym. Wikis.

[13]  J. Dinardo,et al.  Natural Experiments and Quasi-Natural Experiments , 2010 .

[14]  Jure Leskovec,et al.  A computational approach to politeness with application to social factors , 2013, ACL.

[15]  George Lakoff,et al.  Hedges: A study in meaning criteria and the logic of fuzzy concepts , 1973, J. Philos. Log..

[16]  Aniket Kittur,et al.  Harnessing the wisdom of crowds in wikipedia: quality through coordination , 2008, CSCW.

[17]  Benjamin F. Jones,et al.  Multi-University Research Teams: Shifting Impact, Geography, and Stratification in Science , 2008, Science.

[18]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[19]  Daniel B. Horn,et al.  Six degrees of jonathan grudin: a social network analysis of the evolution and impact of CSCW research , 2004, CSCW.