Mapping communities in large virtual social networks: Using Twitter data to find the Indie Mac community

This paper describes a multi-method approach to delineate a “real world” community of practice from a large N dataset derived from the social networking site Twitter. The starting point is previous qualitative research of a virtual community of independent (“indie”) developers who create software for Apple's Macintosh and iPhone platforms. Indie developers have been active on Twitter from an early stage on and they use Twitter to sustain interactions between peers, exchange technical information and for viral “echo chamber” marketing. The publicly available Twitter API is used to mine a network consisting of several million edges, which is sized down to a large network containing roughly 1 million edges through several pruning methods. The fast greedy algorithm is then used to detect subgraphs within this large network. Triangulation with qualitative data proves that the fast greedy algorithm is able to distill meaningful communities from a large, noisy and ill-delineated network. The accuracy of this approach gives rise to the discussion of the value for businesses and market research, since it offers opportunities to identify and monitor target audiences at a finely grained level. However, we should be wary of the serious consequences with regard to privacy and ethics. The proposed multi-method approach allows micro level inferences from a macro dataset of which the individual Twitter user might be completely unaware. The results could have consequences for the anonymity of key persons behind the scenes of social and political movements or any other communities whose members are active on Twitter or other social networks.

[1]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[4]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[5]  John Seely Brown,et al.  Book Reviews : The Social Life of Information By John Seely Brown and Paul Duguid. Boston: Harvard Business School Press, 2000. 320 pages , 2000 .

[6]  P. Bourdieu,et al.  The Field of Cultural Production , 1993 .

[7]  Tatiana Kern Bertschinger,et al.  Quality and Quantity , 1966, Nature.

[8]  Michiel van Meeteren,et al.  Indie Fever The genesis, culture and economy of a community of independent software developers on the Macintosh OS X platform , 2008 .

[9]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[10]  R. Luce,et al.  Connectivity and generalized cliques in sociometric group structure , 1950, Psychometrika.

[11]  M. Gladwell,et al.  The Tipping Point , 2011 .

[12]  Ko de Ruyter,et al.  Beyond the Call of Duty: Why Customers Contribute to Firm-hosted Commercial Online Communities , 2007 .

[13]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[14]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[15]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[16]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[17]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Norman,et al.  Structural Models: An Introduction to the Theory of Directed Graphs. , 1966 .

[19]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  F. Luccio,et al.  On the Decomposition of Networks in Minimally Interconnected Subnetworks , 1969 .

[21]  R. Hanneman Introduction to Social Network Methods , 2001 .

[22]  R. J. Mokken,et al.  Cliques, clubs and clans , 1979 .

[23]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Stephen B. Seidman,et al.  A graph‐theoretic generalization of the clique concept* , 1978 .

[25]  W. Martin,et al.  The Rules of The Art , 1971 .

[26]  D. Schweiger,et al.  Organization Studies , 2003 .

[27]  Richard E. Caves,et al.  Book Review , 2001 .

[28]  T. Dublin,et al.  Research policy , 2021, The Routledge Handbook of Gender and EU Politics.

[29]  S. Borgatti,et al.  LS sets, lambda sets and other cohesive subsets , 1990 .

[30]  R. Luce,et al.  A method of matrix analysis of group structure , 1949, Psychometrika.

[31]  G. Simmel The Number of Members as Determining the Sociological Form of the Group. I , 1902, American Journal of Sociology.

[32]  T. C. Hu,et al.  Synthesis of a Communication Network , 1964 .

[33]  A. Amin,et al.  Knowing in action: beyond communities of practice , 2008 .

[34]  R. Alba A graph‐theoretic definition of a sociometric clique† , 1973 .

[35]  F. Chilton,et al.  道路交通振動(Transportation Research Record,541) , 1976 .

[36]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[37]  J. Avery,et al.  The long tail. , 1995, Journal of the Tennessee Medical Association.

[38]  Weixiong Zhang,et al.  Identifying network communities with a high resolution. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[40]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.