A Methodology for Analyzing Uptake of Software Technologies Among Developers

Motivation: The question of what combination of attributes drives the adoption of a particular software technology is critical to developers. It determines both those technologies that receive wide support from the community and those which may be abandoned, thus rendering developers' investments worthless. Aim and Context: We model software technology adoption by developers and provide insights on specific technology attributes that are associated with better visibility among alternative technologies. Approach: We leverage social contagion theory and statistical modeling to identify, define, and test empirically measures that are likely to affect software adoption. More specifically, we leverage a large collection of open source version control repositories to construct a software dependency chain for a specific set of R language source-code files. We formulate logistic regression models, to investigate the combination of technological attributes that drive adoption among competing data frame implementations in the R language: tidy and data.table. We quantify key project attributes that might affect adoption and also characteristics of developers making the selection. Results: We find that a quick response to raised issues, a larger number of overall deployments, and a larger number of high-quality StackExchange questions are associated with higher adoption. Decision makers tend to adopt the technology that is closer to them in the technical dependency network and in author collaborations networks while meeting their performance needs. Future work: We hope that our methodology encompassing social contagion that captures both rational and irrational preferences and the elucidation of key measures from large collections of version control data provides a general path toward increasing visibility, driving better informed decisions, and producing more sustainable and widely adopted software

[1]  Xavier Blanc,et al.  Mining Library Migration Graphs , 2012, 2012 19th Working Conference on Reverse Engineering.

[2]  Steven T. Berry Estimating Discrete-Choice Models of Product Differentiation , 1994 .

[3]  Ralf Lämmel,et al.  Large-scale, AST-based API-usage analysis of open-source Java projects , 2011, SAC.

[4]  Vallabh Sambamurthy,et al.  Social Contagion and Information Technology Diffusion: The Adoption of Electronic Medical Records in U.S. Hospitals , 2010, Manag. Sci..

[5]  James D. Herbsleb,et al.  Social coding in GitHub: transparency and collaboration in an open software repository , 2012, CSCW.

[6]  S. Chopra,et al.  Supply Chain Management: Strategy, Planning & Operation , 2007 .

[7]  K. Small,et al.  Applied Welfare Economics with Discrete Choice Models , 1979 .

[8]  Martin P. Robillard,et al.  SemDiff: Analysis and recommendation support for API evolution , 2009, 2009 IEEE 31st International Conference on Software Engineering.

[9]  M. Christopher Logistics and supply chain management , 2011 .

[10]  Yves Croissant,et al.  multinomial logit model , 2013 .

[11]  Cor-Paul Bezemer,et al.  Logging Library Migrations: A Case Study for the Apache Software Foundation Projects , 2016, 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR).

[12]  A.A. Chhajed,et al.  Software focused supply chains: challenges and issues , 2005, INDIN '05. 2005 3rd IEEE International Conference on Industrial Informatics, 2005..

[13]  Anthony Finkelstein,et al.  Exploiting software supply chain business architecture: a research agenda , 1999, ICSE 1999.

[14]  Audris Mockus,et al.  Software Engineering for Big Data Systems , 2016, IEEE Softw..

[15]  Ralf,et al.  Swing to SWT and back: Patterns for API migration by wrapping , 2010, ICSM 2010.

[16]  Richard L. Daft,et al.  Innovation in Organizations: Innovation Adoption in School Organizations. , 1979 .

[17]  Audris Mockus,et al.  Software Support Tools and Experimental Work , 2006, Empirical Software Engineering Issues.

[18]  Vineet Padmanabhan,et al.  Comments on "Information Distortion in a Supply Chain: The Bullwhip Effect" , 1997, Manag. Sci..

[19]  Sarah Nadi,et al.  Which Library Should I Use?: A Metric-Based Comparison of Software Libraries , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering: New Ideas and Emerging Technologies Results (ICSE-NIER).

[20]  James D. Herbsleb,et al.  Influence of social and technical factors for evaluating contribution in GitHub , 2014, ICSE.

[21]  Carol Woody,et al.  Supply-Chain Risk Management: Incorporating Security into Software Development , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[22]  Greg M. Allenby,et al.  A Choice Model with Conjunctive, Disjunctive, and Compensatory Screening Rules , 2004 .

[23]  Marco Tulio Valente,et al.  Apiwave: Keeping track of API popularity and migration , 2015, 2015 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[24]  Samuel H. Huang,et al.  A product driven approach to manufacturing supply chain selection , 2002 .

[25]  Audris Mockus,et al.  Amassing and indexing a large sample of version control systems: Towards the census of public source code history , 2009, 2009 6th IEEE International Working Conference on Mining Software Repositories.

[26]  R. Burt Social Contagion and Innovation: Cohesion versus Structural Equivalence , 1987, American Journal of Sociology.

[27]  Robert J. Walker,et al.  Seeking the ground truth: a retroactive study on the evolution and migration of software libraries , 2012, SIGSOFT FSE.

[28]  Audris Mockus,et al.  Crowdsourcing the discovery of software repositories in an educational environment , 2016, PeerJ Prepr..

[29]  Evan E. Anderson,et al.  Choice Models for the Evaluation and Selection of Software Packages , 1990, J. Manag. Inf. Syst..

[30]  Christopher D. Chambers,et al.  Redefine statistical significance , 2017, Nature Human Behaviour.

[31]  Donald J. Bowersox,et al.  Supply Chain Logistics Management -3/E. , 2010 .

[32]  Michael Gertz,et al.  Mining email social networks , 2006, MSR '06.

[33]  Frank M. Bass,et al.  A New Product Growth for Model Consumer Durables , 2004, Manag. Sci..

[34]  Garrett J. van Ryzin,et al.  Revenue Management Under a General Discrete Choice Model of Consumer Behavior , 2004, Manag. Sci..

[35]  Karim R. Lakhani,et al.  Community, Joining, and Specialization in Open Source Software Innovation: A Case Study , 2003 .

[36]  Audris Mockus,et al.  World of Code: An Infrastructure for Mining the Universe of Open Source VCS Data , 2019, 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR).

[37]  Xavier Blanc,et al.  A study of library migrations in Java , 2014, J. Softw. Evol. Process..

[38]  D. Russell,et al.  People and information technology in the supply chain: Social and organizational influences on adoption , 2004 .

[39]  Martin Burger,et al.  Mining trends of library usage , 2009, IWPSE-Evol '09.

[40]  Rakesh Nagi,et al.  A subjective evidence model for influence maximization in social networks , 2016 .

[41]  Miryung Kim,et al.  A graph-based approach to API usage adaptation , 2010, OOPSLA.

[42]  Ralf Lämmel,et al.  Study of an API Migration for Two XML APIs , 2009, SLE.

[43]  Elias Levy Poisoning the Software Supply Chain , 2003, IEEE Secur. Priv..

[44]  Hau L. Lee,et al.  Mitigating supply chain risk through improved confidence , 2004 .

[45]  Jack Greenfield Software Factories: Assembling Applications with Patterns, Models, Frameworks and Tools , 2004, GPCE.

[46]  W. Powell,et al.  The iron cage revisited institutional isomorphism and collective rationality in organizational fields , 1983 .

[47]  Andreas Zeller,et al.  Mining API Popularity , 2010, TAIC PART.

[48]  B. Kogut,et al.  Open-source Software Development and Distributed Innovation , 2001 .

[49]  Gregory K. Leonard,et al.  A utility-consistent, combined discrete choice and count data model Assessing recreational use losses due to natural resource damage , 1995 .

[50]  E. Hippel Innovation by User Communities: Learning From Open-Source Software , 2001 .

[51]  Emerson R. Murphy-Hill,et al.  Social influences on secure development tool adoption: why security tools spread , 2014, CSCW.

[52]  Joel West,et al.  Patterns of Open Innovation in Open Source Software , 2006 .

[53]  D. McFadden,et al.  MIXED MNL MODELS FOR DISCRETE RESPONSE , 2000 .

[54]  Robert G. Fichman,et al.  Going Beyond the Dominant Paradigm for Information Technology Innovation Research: Emerging Concepts and Methods , 2004, J. Assoc. Inf. Syst..

[55]  Donald J. Bowersox,et al.  Supply Chain Logistics Management , 2002 .

[56]  Jacqueline Holdsworth Software Process Design , 1995 .

[57]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[58]  Gary J. Russell,et al.  A Probabilistic Choice Model for Market Segmentation and Elasticity Structure , 1989 .

[59]  Audris Mockus,et al.  Engineering big data solutions , 2014, FOSE.

[60]  Zach G. Zacharia,et al.  DEFINING SUPPLY CHAIN MANAGEMENT , 2001 .

[61]  S. Kalish A New Product Adoption Model with Price, Advertising, and Uncertainty , 1985 .

[62]  Meeta Dasgupta,et al.  Innovation in Organizations , 2009 .