论文信息 - Who should review my code? A file location-based code-reviewer recommendation approach for Modern Code Review

Who should review my code? A file location-based code-reviewer recommendation approach for Modern Code Review

Software code review is an inspection of a code change by an independent third-party developer in order to identify and fix defects before an integration. Effectively performing code review can improve the overall software quality. In recent years, Modern Code Review (MCR), a lightweight and tool-based code inspection, has been widely adopted in both proprietary and open-source software systems. Finding appropriate code-reviewers in MCR is a necessary step of reviewing a code change. However, little research is known the difficulty of finding code-reviewers in a distributed software development and its impact on reviewing time. In this paper, we investigate the impact of reviews with code-reviewer assignment problem has on reviewing time. We find that reviews with code-reviewer assignment problem take 12 days longer to approve a code change. To help developers find appropriate code-reviewers, we propose RevFinder, a file location-based code-reviewer recommendation approach. We leverage a similarity of previously reviewed file path to recommend an appropriate code-reviewer. The intuition is that files that are located in similar file paths would be managed and reviewed by similar experienced code-reviewers. Through an empirical evaluation on a case study of 42,045 reviews of Android Open Source Project (AOSP), OpenStack, Qt and LibreOffice projects, we find that RevFinder accurately recommended 79% of reviews with a top 10 recommendation. RevFinder also correctly recommended the code-reviewers with a median rank of 4. The overall ranking of RevFinder is 3 times better than that of a baseline approach. We believe that RevFinder could be applied to MCR in order to help developers find appropriate code-reviewers and speed up the overall code review process.

[1] Wei-Tek Tsai,et al. Distributed, collaborative software inspection , 1993, IEEE Software.

[2] Hajimu Iida,et al. Improving code review effectiveness through reviewer recommendations , 2014, CHASE.

[3] Gail C. Murphy,et al. Who should fix this bug? , 2006, ICSE.

[4] Dan Gusfield,et al. Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[5] A. Frank Ackerman,et al. Software inspections: an effective verification process , 1989, IEEE Software.

[6] Claes Wohlin,et al. State‐of‐the‐art: software inspections after 25 years , 2002, Softw. Test. Verification Reliab..

[7] Audris Mockus,et al. Expertise Browser: a quantitative approach to identifying expertise , 2002, Proceedings of the 24th International Conference on Software Engineering. ICSE 2002.

[8] Christian Bird,et al. Convergent contemporary software peer review practices , 2013, ESEC/FSE 2013.

[9] Lawrence G. Votta,et al. Does every inspection need a meeting? , 1993, SIGSOFT '93.

[10] David Lo,et al. Accurate developer recommendation for bug resolution , 2013, 2013 20th Working Conference on Reverse Engineering (WCRE).

[11] Daniel M. Germán,et al. Cohesive and Isolated Development with Branches , 2012, FASE.

[12] Shane McIntosh,et al. The impact of code review coverage and code review participation on software quality: a case study of the qt, VTK, and ITK projects , 2014, MSR 2014.

[13] Sargur N. Srihari,et al. Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[14] Richard C. Holt,et al. Linux as a case study: its extracted software architecture , 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002).

[15] James D. Herbsleb,et al. Let's talk about it: evaluating contributions through discussion in GitHub , 2014, SIGSOFT FSE.

[16] Edward Cutrell,et al. An eye tracking study of the effect of target rank on web search , 2007, CHI.

[17] David Ma,et al. Expert recommendation with usage expertise , 2009, 2009 IEEE International Conference on Software Maintenance.

[18] Thomas Zimmermann,et al. Improving Code Review by Predicting Reviewers and Acceptance of Patches , 2009 .

[19] Jiri Matas,et al. On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[20] Hajimu Iida,et al. Using Profiling Metrics to Categorise Peer Review Types in the Android Project , 2012, 2012 IEEE 23rd International Symposium on Software Reliability Engineering Workshops.

[21] Peter Kampstra,et al. Beanplot: A Boxplot Alternative for Visual Comparison of Distributions , 2008 .

[22] Ken-ichi Matsumoto,et al. Using Co-change Histories to Improve Bug Localization Performance , 2013, 2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing.

[23] Christos Faloutsos,et al. Recommending People in Developers' Collaboration Network , 2011, 2011 18th Working Conference on Reverse Engineering.

[24] Ken-ichi Matsumoto,et al. Mining A change history to quickly identify bug locations : A case study of the Eclipse project , 2013, 2013 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW).

[25] Margaret-Anne D. Storey,et al. Understanding broadcast based peer review on open source software projects , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[26] Daniel M. Germán,et al. Will my patch make it? And how fast? Case study on the Linux kernel , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[27] Andy Zaidman,et al. Modern code reviews in open-source projects: which problems do they fix? , 2014, MSR 2014.

[28] Priscilla J. Fowler,et al. Software inspections and the industrial production of software , 1984 .

[29] Dan Gusfield,et al. Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[30] Gang Yin,et al. Reviewer Recommender of Pull-Requests in GitHub , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[31] Arie van Deursen,et al. An exploratory study of the pull-based software development model , 2014, ICSE.

[32] David Lo,et al. Predicting Best Answerers for New Questions: An Approach Leveraging Topic Modeling and Collaborative Voting , 2013, SocInfo Workshops.

[33] Kate Smith-Miles,et al. Maximum-entropy estimated distribution model for classification problems , 2006, Int. J. Hybrid Intell. Syst..

[34] Vipin Balachandran,et al. Reducing human effort and improving quality in peer code reviews using automatic static analysis and reviewer recommendation , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[35] Zarinah Mohd Kasirun,et al. Why so complicated? Simple term filtering and weighting for location-based bug report assignment recommendation , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[36] Stephan Diehl,et al. Small patches get in! , 2008, MSR '08.

[37] Vasile Palade,et al. Multi-Classifier Systems: Review and a roadmap for developers , 2006, Int. J. Hybrid Intell. Syst..

[38] Hajimu Iida,et al. Who does what during a code review? Datasets of OSS peer review repositories , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[39] Alberto Bacchelli,et al. Expectations, outcomes, and challenges of modern code review , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[40] Hajimu Iida,et al. ReDA: A Web-Based Visualization Tool for Analyzing Modern Code Review Dataset , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[41] Jeffrey C. Carver,et al. Impact of developer reputation on code review outcomes in OSS projects: an empirical investigation , 2014, ESEM '14.

[42] Daniel M. German,et al. Open source software peer review practices , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[43] Michael E. Fagan. Design and Code Inspections to Reduce Errors in Program Development , 1976, IBM Syst. J..