A genetic search of patterns of behaviour in OSS communities

This paper proposes the identification of patterns of behaviour of open source software (OSS) communities using factor analysis and their social network analysis (SNA) features. OSS communities can be modelled as a social network in which nodes represent the community members and arcs represent the social interactions among them, and factor analysis is able to provide the factors that explain the latent patterns of behaviour. Due to the complexity of the problem and the high number of SNA features that can be extracted, this paper proposes a genetic search of an optimum subset of indicators leading to a group of latent patterns of behaviour maximizing the explained data variance and the interpretation of factors. Obtained results illustrate the feasibility of the proposed framework to extract relevant information from a large set of data.

[1]  Sergio L. Toral Marín,et al.  Strategic group identification using evolutionary computation , 2010, Expert Syst. Appl..

[2]  K. Dejong,et al.  An analysis of the behavior of a class of genetic adaptive systems , 1975 .

[3]  Jennifer Preece,et al.  Sociability and usability in online communities: Determining and measuring success , 2001, Behav. Inf. Technol..

[4]  Chitu Okoli,et al.  Investigating recognition-based performance in an open content community: A social capital perspective , 2007, Inf. Manag..

[5]  Gregorio Robles,et al.  Evolution of Volunteer Participation in Libre Software Projects: Evidence from Debian , 2005 .

[6]  Daniel Gutiérrez-Reina,et al.  An evolutionary computation approach for designing mobile ad hoc networks , 2012, Expert Syst. Appl..

[7]  J.T. Alander,et al.  On optimal population size of genetic algorithms , 1992, CompEuro 1992 Proceedings Computer Systems and Software Engineering.

[8]  Guy Theraulaz,et al.  m s Self-Organization Patterns in Wasp and Open Source Communities , 2006 .

[9]  J. Herbsleb,et al.  Two case studies of open source software development: Apache and Mozilla , 2002, TSEM.

[10]  António Lucas Soares,et al.  Improving the quality of collaboration requirements for information management through social networks analysis , 2007, Int. J. Inf. Manag..

[11]  D HerbslebJames,et al.  Two case studies of open source software development , 2002 .

[12]  W. Edward Steinmueller,et al.  The institutions of open source software: Examining the Debian community , 2008, Inf. Econ. Policy.

[13]  Françoise Détienne,et al.  Participation in online interaction spaces: Design-use mediation in an Open Source Software community , 2009 .

[14]  Sergio Toral,et al.  International comparison of R&D investment by European, US and Japanese companies , 2010 .

[15]  Jitesh H. Panchal,et al.  Co-Evolution of Products & Communities in Mass Collaborative Product Development: a Computational Exploration , 2009 .

[16]  Joel West,et al.  The Role of Participation Architecture in Growing Sponsored Open Source Communities , 2008 .

[17]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[18]  Ajay Mehra The Development of Social Network Analysis: A Study in the Sociology of Science , 2005 .

[19]  George Kuk,et al.  Strategic Interaction and Knowledge Sharing in the KDE Developer Mailing List , 2006, Manag. Sci..

[20]  Gregory R. Madey,et al.  Network Analysis of the SourceForge.net Community , 2007, OSS.

[21]  Yuan Long,et al.  Social Network Structures in Open Source Software Development Teams , 2007, J. Database Manag..

[22]  Sergio L. Toral Marín,et al.  Analysis of virtual communities supporting OSS projects using social network analysis , 2010, Inf. Softw. Technol..

[23]  Vladimir Batagelj,et al.  Exploratory Social Network Analysis with Pajek: Cohesion , 2005 .

[24]  Kenneth Alan De Jong,et al.  An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[25]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[26]  Connie M. Borror,et al.  Methods of Multivariate Analysis, 2nd Ed. , 2004 .

[27]  A. Balaban,et al.  Topological Indices and Related Descriptors in QSAR and QSPR , 2003 .

[28]  Wonseok Oh,et al.  Broken Ties: The Impact of Organizational Restructuring on the Stability of Information-Processing Networks , 2007, J. Manag. Inf. Syst..

[29]  Kevin Crowston,et al.  The social structure of free and open source software development , 2005, First Monday.

[30]  Brian Fitzgerald,et al.  Understanding open source software development , 2002 .

[31]  Kouichi Kishida,et al.  The Co-Evolution of Systems and Communities in Free and Open Source Software Development , 2005 .

[32]  Kon Shing Kenneth Chung,et al.  Actor centrality correlates to project based coordination , 2006, CSCW '06.

[33]  Sven Apel,et al.  Types and modularity for implicit invocation with implicit announcement , 2010, TSEM.

[34]  Sergio L. Toral Marín,et al.  Virtual communities as a resource for the development of OSS projects: the case of Linux ports to embedded processors , 2009, Behav. Inf. Technol..

[35]  David Hinds,et al.  Social Network Structure as a Critical Success Condition for Virtual Communities , 2008, Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008).

[36]  Param Vir Singh,et al.  The small-world effect , 2010, ACM Trans. Softw. Eng. Methodol..

[37]  Yeonhee Lee,et al.  Application of factor analysis for service R&D classification: A case study on the Korean ICT industry , 2011, Expert Syst. Appl..

[38]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[39]  Liaquat Hossain,et al.  Exploring computer supported collaborative coordination through social networks , 2011 .

[40]  Zbigniew Michalewicz,et al.  Genetic algorithms + data structures = evolution programs (3rd ed.) , 1996 .

[41]  Sergio L. Toral Marín,et al.  An empirical study of the driving forces behind online communities , 2009, Internet Res..

[42]  Jin Xu,et al.  A Topological Analysis of the Open Souce Software Development Community , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[43]  Liaquat Hossain,et al.  Social networks and coordination performance of distributed software development teams , 2009 .

[44]  Jin Xu,et al.  Application of Social Network Analysis to the Study of Open Source Software , 2006 .

[45]  Ioannis Stamelos,et al.  Identifying knowledge brokers that yield software engineering knowledge in OSS projects , 2006, Inf. Softw. Technol..

[46]  Kevin Crowston,et al.  Hierarchy and centralization in free and open source software team communications , 2006 .

[47]  Sergio L. Toral Marín,et al.  International comparison of R&D investment by European, US and Japanese companies , 2010, Int. J. Technol. Manag..

[48]  Christopher Durugbo,et al.  Modelling user participation in organisations as networks , 2012, Expert Syst. Appl..

[49]  Walt Scacchi Free/open source software development , 2007, ESEC-FSE '07.

[50]  Fadi P. Deek,et al.  Open Source: Technology and Policy , 2007 .