Evolution of the R software ecosystem: Metrics, relationships, and their impact on qualities

Abstract Software ecosystems are an important new concept for collaborative software development, and empirical studies on their development are important towards understanding the underlying dynamics and modelling their behaviour. We conducted an explorative analysis of the R ecosystem as an exemplar on high-level, ecosystem-wide assessment. Based principally on the documentation metadata of the R packages, we generated a variety of metrics that allow the quantification of the R ecosystem. We also categorized the ecosystem participants, both in the software marketplace and in the developer community, by characteristics that measure their activity and impact. By viewing our metrics across the ecosystem’s lifecycle for the various participant categories, we discovered interrelationships between them and determined the contribution of each category to the ecosystem as a whole.

[1]  Slinger Jansen,et al.  A sense of community: A research agenda for software ecosystems , 2009, 2009 31st International Conference on Software Engineering - Companion Volume.

[2]  M. E. Conway HOW DO COMMITTEES INVENT , 1967 .

[3]  David G. Messerschmitt,et al.  Software Ecosystem: Understanding an Indispensable Technology and Industry , 2003 .

[4]  Slinger Jansen,et al.  Defining Software Ecosystems: A Survey of Software Platforms and Business Network Governance , 2013, IWSECO@ICSOB.

[5]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[6]  Slinger Jansen,et al.  Towards the Roles and Motives of Open Source Software Developers , 2013, IWSECO@ICSOB.

[7]  Slinger Jansen,et al.  Measuring the health of open source software ecosystems: Beyond the scope of project health , 2014, Inf. Softw. Technol..

[8]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[9]  Erik den Hartigh,et al.  Measuring the health of a business ecosystem , 2013 .

[10]  Uwe Zdun,et al.  How do software ecosystems evolve? a quantitative assessment of the r ecosystem. , 2016, SPLC.

[11]  Xavier Franch,et al.  QuESo a quality model for open source software ecosystems , 2014, 2014 9th International Conference on Software Engineering and Applications (ICSOFT-EA).

[12]  Tom Mens,et al.  On the maintainability of CRAN packages , 2014, 2014 Software Evolution Week - IEEE Conference on Software Maintenance, Reengineering, and Reverse Engineering (CSMR-WCRE).

[13]  Klaus Marius Hansen,et al.  Software ecosystems - A systematic literature review , 2013, J. Syst. Softw..

[14]  Geir Kjetil Hanssen,et al.  A longitudinal case study of an emerging software ecosystem: Implications for practice and theory , 2012, J. Syst. Softw..

[15]  Marco Iansiti,et al.  The Keystone Advantage: What the New Dynamics of Business Ecosystems Mean for Strategy, Innovation, and Sustainability , 2004 .

[16]  Faheem Ahmed,et al.  A three-dimensional view of software ecosystems , 2010, ECSA '10.

[17]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[18]  Slinger Jansen,et al.  Python: Characteristics identification of a free open source software ecosystem , 2013, 2013 7th IEEE International Conference on Digital Ecosystems and Technologies (DEST).

[19]  Tom Mens,et al.  When GitHub Meets CRAN: An Analysis of Inter-Repository Package Dependency Problems , 2016, 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[20]  N. Cliff Dominance statistics: Ordinal analyses to answer ordinal questions. , 1993 .

[21]  Tom Mens,et al.  maintaineR: A Web-Based Dashboard for Maintainers of CRAN Packages , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[22]  Petra Bosch-Sijtsema,et al.  From integration to composition: On the impact of software product lines, global development and ecosystems , 2010, J. Syst. Softw..

[23]  Tore Dybå,et al.  Theoretical foundations of software ecosystems , 2012, IWSECO@ICSOB.

[24]  J. F. Moore,et al.  Predators and prey: a new ecology of competition. , 1993, Harvard business review.

[25]  Tom Mens,et al.  An empirical study of identical function clones in CRAN , 2015, 2015 IEEE 9th International Workshop on Software Clones (IWSC).

[26]  Slinger Jansen,et al.  Steering Insight: An Exploration of the Ruby Software Ecosystem , 2011, ICSOB.

[27]  Klaus Marius Hansen,et al.  Reviewing the Health of Software Ecosystems - A Conceptual Framework Proposal , 2013, IWSECO@ICSOB.

[28]  Imed Hammouda,et al.  Socio-Technical Congruence in the Ruby Ecosystem , 2014, OpenSym.

[29]  Kurt Hornik Are There Too Many R Packages , 2016 .

[30]  Slinger Jansen,et al.  Business Network Management as a Survival Strategy: A Tale of Two Software Ecosystems , 2009, IWSECO@ICSR.

[31]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Pearl Brereton,et al.  Robust Statistical Methods for Empirical Software Engineering , 2017, Empirical Software Engineering.

[33]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[34]  Tom Mens,et al.  On the Development and Distribution of R Packages: An Empirical Analysis of the R Ecosystem , 2015, ECSA Workshops.

[35]  Jeroen Ooms,et al.  Possible Directions for Improving Dependency Versioning in R , 2013, R J..

[36]  Daniel M. Germán,et al.  The Evolution of the R Software Ecosystem , 2013, 2013 17th European Conference on Software Maintenance and Reengineering.

[37]  Krzysztof Czarnecki,et al.  Variability mechanisms in software ecosystems , 2014, Inf. Softw. Technol..

[38]  Jan Bosch,et al.  From software product lines to software ecosystems , 2009, SPLC.

[39]  Slinger Jansen,et al.  Relating Health to Platform Success: Exploring Three E-commerce Ecosystems , 2015, ECSA Workshops.