Mining Software Repositories to Support OSS Developers: A Recommender Systems Approach

To facilitate the development activities, software developers frequently look up external sources for related information. Consulting data available at open source software (OSS) repositories can be considered as their daily routine. Nonetheless, the heterogeneity of resources and their corresponding dependencies are the main obstacles to the effective mining and exploitation of the data. Given the context, the manual search for every single resource to find the most suitable ones is a daunting and inefficient task. Thus, equipping developers with techniques and tools to accelerate the search process as well as to improve the search results will help them enhance the work efficiency. Within the scope of the EU funded CROSSMINER project, advanced techniques and tools are being conceived for providing open software developers with innovative features aiming at obtaining improvements in terms of development effort, cost savings, developer productivity, etc. To this end, cutting-edge technologies are applied, such as information retrieval and recommender systems to solve the problem of mining the rich metadata available at OSS repositories to support software developers. In this paper, we present the main research problems as well the proposed approach together with some preliminary results.

[1]  Ying Zou,et al.  API usage pattern recommendation for software development , 2017, J. Syst. Softw..

[2]  Saul Vargas,et al.  Improving sales diversity by recommending users to items , 2014, RecSys '14.

[3]  Florian Deißenböck,et al.  A structured approach to assess third-party library usage , 2012, 2012 28th IEEE International Conference on Software Maintenance (ICSM).

[4]  Gabriele Bavota,et al.  How Can I Use This Method? , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[5]  Paolo Tomeo,et al.  Schema-summarization in linked-data-based feature selection for recommender systems , 2017, SAC.

[6]  Juri Di Rocco,et al.  CrossSim: Exploiting Mutual Relationships to Detect Similar OSS Projects , 2018, 2018 44th Euromicro Conference on Software Engineering and Advanced Applications (SEAA).

[7]  Collin McMillan,et al.  Detecting similar software applications , 2012, 2012 34th International Conference on Software Engineering (ICSE).

[8]  Martin P. Robillard,et al.  Recommendation Systems in Software Engineering , 2014, Springer Berlin Heidelberg.

[9]  Jian Pei,et al.  MAPO: Mining and Recommending API Usage Patterns , 2009, ECOOP.

[10]  Diomidis Spinellis,et al.  Developer-Centric Knowledge Mining from Large Open-Source Software Repositories (CROSSMINER) , 2017, STAF Workshops.

[11]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[12]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[13]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[14]  Paolo Tomeo,et al.  Content-Based Recommendations via DBpedia and Freebase: A Case Study in the Music Domain , 2015, International Semantic Web Conference.

[15]  David Lo,et al.  Automated library recommendation , 2013, 2013 20th Working Conference on Reverse Engineering (WCRE).

[16]  Saul Vargas,et al.  Rank and relevance in novelty and diversity metrics for recommender systems , 2011, RecSys '11.

[17]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[18]  Ioannis Korkontzelos,et al.  OSSMETER: a software measurement platform for automatically analysing open source software projects , 2015, ESEC/SIGSOFT FSE.

[19]  Ning Chen,et al.  SimApp: A Framework for Detecting Similar Mobile Applications by Online Kernel Learning , 2015, WSDM.

[20]  Neil Yorke-Smith,et al.  A Novel Bayesian Similarity Measure for Recommender Systems , 2013, IJCAI.

[21]  Evangelos E. Milios,et al.  Node similarity in the citation graph , 2006, Knowledge and Information Systems.

[22]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[23]  Paul Van Dooren,et al.  A MEASURE OF SIMILARITY BETWEEN GRAPH VERTICES . WITH APPLICATIONS TO SYNONYM EXTRACTION AND WEB SEARCHING , 2002 .

[24]  Gabriele Bavota,et al.  Mining StackOverflow to turn the IDE into a self-confident programming prompter , 2014, MSR 2014.

[25]  Mira Mezini,et al.  Ieee Transactions on Software Engineering 1 Automated Api Property Inference Techniques , 2022 .

[26]  M. Matteucci,et al.  An Evaluation Methodology for Collaborative Recommender Systems , 2008, 2008 International Conference on Automated Solutions for Cross Media Content and Multi-Channel Distribution.

[27]  Alejandro Bellogín,et al.  A comparative study of heterogeneous item recommendations in social systems , 2013, Inf. Sci..

[28]  Collin McMillan,et al.  Recommending source code examples via API call usages and documentation , 2010, RSSE '10.

[29]  Markus Zanker,et al.  Linked open data to support content-based recommender systems , 2012, I-SEMANTICS '12.