A Hidden Markov Model of Developer Learning Dynamics in Open Source Software Projects

This study develops a stochastic model to capture developer learning dynamics in open source software projects (OSS). A hidden Markov model (HMM) is proposed that allows us to investigate (1) the extent to which individuals learn from their own experience and from interactions with peers, (2) whether an individual's ability to learn from these activities varies as she evolves/learns over time, and (3) to what extent individual learning persists over time. We calibrate the model based on six years of detailed data collected from 251 developers working on 25 OSS projects hosted at Sourceforge. Using the HMM, three latent learning states (high, medium, and low) are identified, and the marginal impact of learning activities on moving the developer between these states is estimated. Our findings reveal different patterns of learning in different learning states. Learning from peers appears to be the most important source of learning for developers across the three states. Developers in the medium learning state benefit the most through discussions that they initiate. On the other hand, developers in the low and the high states benefit the most by participating in discussions started by others. While in the low state, developers depend entirely upon their peers to learn, whereas in the medium or high state, they can also draw upon their own experiences. Explanations for these varying impacts of learning activities on the transitions of developers between the three learning states are provided. The HMM is shown to outperform the classical learning curve model. The HMM modeling of this study contributes to the development of a theoretically grounded understanding of learning behavior of individuals. Such a theory and associated findings have important managerial and operational implications for devising interventions to promote learning in a variety of settings.

[1]  R. Kanfer Motivation theory and industrial and organizational psychology. , 1990 .

[2]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[3]  Lain L. MacDonald,et al.  Hidden Markov and Other Models for Discrete- valued Time Series , 1997 .

[4]  David A. Hensher,et al.  A latent class model for discrete choice analysis: contrasts with mixed logit , 2003 .

[5]  J. R. Larson,et al.  Groups as problem‐solving units: Toward a new meaning of social cognition , 1993 .

[6]  L. Argote,et al.  The persistence and transfer of learning in industrial settings , 1990 .

[7]  Zoubin Ghahramani,et al.  A Unifying Review of Linear Gaussian Models , 1999, Neural Computation.

[8]  Linda Argote,et al.  An Empirical Investigation of the Microstructure of Knowledge Acquisition and Transfer Through Learning by Doing , 1996, Oper. Res..

[9]  L. Sproull,et al.  Coordinating Expertise in Software Development Teams , 2000 .

[10]  Param Vir Singh,et al.  The small-world effect , 2010, ACM Trans. Softw. Eng. Methodol..

[11]  Robert E. Ployhart,et al.  Learning by Doing Something Else: Variation, Relatedness, and the Learning Curve , 2003, Manag. Sci..

[12]  Victor R. Basili,et al.  Improve Software Quality by Reusing Knowledge and Experience , 1995 .

[13]  Daniel A. Levinthal,et al.  ABSORPTIVE CAPACITY: A NEW PERSPECTIVE ON LEARNING AND INNOVATION , 1990 .

[14]  J. Alberto Espinosa,et al.  Learning from Experience in Software Development: A Multilevel Analysis , 2007, Manag. Sci..

[15]  James J. Heckman,et al.  Identifying the Hand of the Past: Distinguishing State Dependence from Heterogeneity , 1991 .

[16]  S. M. Arnsten Intrinsic motivation. , 1990, The American journal of occupational therapy : official publication of the American Occupational Therapy Association.

[17]  M. D. Dunnette Handbook of Industrial and Organizational Psychology , 2005 .

[18]  Brian Fitzgerald,et al.  Why Hackers Do What They Do: Understanding Motivation and Effort in Free/Open Source Software Projects , 2007 .

[19]  Rajiv M. Dewan,et al.  Firms as Incubators of Open Source Software , 2008, Inf. Syst. Res..

[20]  Linda Argote,et al.  Managing Knowledge in Organizations: An Integrative Framework and Review of Emerging Themes , 2003, Manag. Sci..

[21]  Georg von Krogh,et al.  Open Source Software and the "Private-Collective" Innovation Model: Issues for Organization Science , 2003, Organ. Sci..

[22]  Param Vir Singh,et al.  Learning Curves of Agents with Diverse Skills in Information Technology-Enabled Physician Referral Systems , 2011, Inf. Syst. Res..

[23]  Kim B. Clark,et al.  The Architecture of Participation: Does Code Architecture Mitigate Free Riding in the Open Source Development Model? , 2006, Manag. Sci..

[24]  Douglas Polley,et al.  Learning While Innovating , 1992 .

[25]  Alessandro Narduzzo,et al.  Modularity in Action: GNU/Linux and Free/Open Source Software Development Model Unleashed , 2003 .

[26]  Steven Weber,et al.  The Success of Open Source , 2004 .

[27]  M. Lepper,et al.  Turning “play” into “work” and “work” into “play”: 25 Years of research on intrinsic versus extrinsic motivation , 2000 .

[28]  Andrea Bonaccorsi,et al.  Intrinsic vs. extrinsic incentives in profit-oriented firms supplying Open Source products and services , 2005, First Monday.

[29]  Param Vir Singh,et al.  Social Capital, Structural Holes and Team Composition: Collaborative Networks of the Open Source Software Community , 2007, ICIS.

[30]  James D. Hamilton A New Approach to the Economic Analysis of Nonstationary Time Series and the Business Cycle , 1989 .

[31]  J. Tirole,et al.  Some Simple Economics of Open Source , 2002 .

[32]  L. Argote,et al.  Learning Curves in Manufacturing , 1990, Science.

[33]  Param Vir Singh,et al.  Developer Heterogeneity and Formation of Communication Networks in Open Source Software Projects , 2011, J. Manag. Inf. Syst..

[34]  David A. Nembhard,et al.  The Effects of Worker Learning, Forgetting, and Heterogeneity on Assembly Line Productivity , 2001, Manag. Sci..

[35]  Eric A. von Hippel,et al.  How Open Source Software Works: 'Free' User-to-User Assistance? , 2000 .

[36]  Robert E. Kraut,et al.  Coordination in software development , 1995, CACM.

[37]  Eric D. Darr,et al.  The Acquisition, Transfer, and Depreciation of Knowledge in Service Organizations: Productivity in Franchises , 1995 .

[38]  J. Dutton,et al.  Treating Progress Functions as a Managerial Opportunity , 1984 .

[39]  Gary L. Lilien,et al.  Location, Location, Location: How Network Embeddedness Affects Project Success in Open Source Systems , 2006, Manag. Sci..

[40]  Georg von Krogh,et al.  The Promise of Research on Open Source Software , 2006, Manag. Sci..

[41]  J. Heckman,et al.  Econometric duration analysis , 1984 .

[42]  Sandra Slaughter,et al.  Understanding the Motivations, Participation, and Performance of Open Source Software Developers: A Longitudinal Study of the Apache Projects , 2006, Manag. Sci..

[43]  J. Herbsleb,et al.  Two case studies of open source software development: Apache and Mozilla , 2002, TSEM.

[44]  Param Vir Singh,et al.  Network Effects: The Influence of Structural Capital on Open Source Project Success , 2011, MIS Q..

[45]  D HerbslebJames,et al.  Two case studies of open source software development , 2002 .

[46]  J. Tirole,et al.  Self-Confidence and Personal Motivation , 2002 .

[47]  Kevin Crowston,et al.  Defining Open Source Software Project Success , 2003, ICIS.

[48]  Johan P. Olsen,et al.  THE UNCERTAINTY OF THE PAST: ORGANIZATIONAL LEARNING UNDER AMBIGUITY* , 1975 .

[49]  Bart W. Stuck,et al.  A Computer and Communication Network Performance Analysis Primer (Prentice Hall, Englewood Cliffs, NJ, 1985; revised, 1987) , 1987, Int. CMG Conference.

[50]  Audris Mockus,et al.  An empirical study of global software development: distance and speed , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[51]  Joseph Hilbe,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2009 .

[52]  Linda Argote,et al.  Individual Experience and Experience Working Together: Predicting Learning Rates from Knowing Who Knows What and Knowing How to Work Together , 2005, Manag. Sci..

[53]  Paul R. Milgrom,et al.  Employment Contracts, Influence Activities, and Efficient Organization Design , 1987, Journal of Political Economy.

[54]  Corey Phelps,et al.  Determinants of Open Source Software License Choice: A Social Influence Perspective , 2010 .

[55]  John P. Campbell,et al.  A Confirmatory Test of a Model of Performance Determinants , 1994 .

[56]  Z. Griliches,et al.  Econometric Models for Count Data with an Application to the Patents-R&D Relationship , 1984 .

[57]  Frederick P. Brooks,et al.  No Silver Bullet: Essence and Accidents of Software Engineering , 1987 .

[58]  Marc Sacks,et al.  On-the-Job Learning in the Software Industry: Corporate Culture and the Acquisition of Knowledge , 1994 .