CORRECT-ToolDemo-ASE2016

CORRECT: CODE REVIEWER
RECOMMENDATION AT GITHUB FOR
VENDASTA TECHNOLOGIES
Mohammad Masudur Rahman, Chanchal K. Roy,
Jesse Redl$ and Jason A. Collins*
Department of Computer Science
University of Saskatchewan, Canada
Vendasta Technologies$, Canada, Google Inc.*, USA
31st IEEE/ACM International Conference on
Automated Software Engineering (ASE 2016), Singapore

PEER CODE REVIEW
2
Code review is a systematic examination of source
code for detecting bugs or defects and coding
rule violations.
Early bug detection
Stop coding rule violation
Enhance developer skill
Peer Code Review

PULL REQUEST (CODE CHANGES)
SUBMISSION AT GITHUB
3
Change title
Change
description
Member mention
feature
Whom should I choose?
Well, where there is a will, there is a way!

TRADITIONAL WAY: CHOOSE A CODE
REVIEWER
4
NOT Productive at all!

FOR:
5
Novice developers
Distributed software development
Delayed reviews for 12 days
(Thongtanunam et al, SANER 2015)
WHY?
EVEN MORE CHALLENGES!!

WHAT DO WE NEED?
 Recommendation Tool
 Recommends appropriate code reviewers
 Recommends automatically
 Does all heavy lifting (i.e., mining) for the developers.
 Provides recommendation rationale
 Fits within developer’s work flow
 Advanced Features
 Provides personalized recommendation
 Provides optimized performance
 Architecture
 Platform-independent & scalable 6

CORRECT: CODE REVIEWER RECOMMENDATION
AT GITHUB FOR
VENDASTA TECHNOLOGIES
7

WALKTHROUGH WITH CORRECT–
NEW PULL REQUEST
8
CORRECT
Code reviewers
Rationale

WALKTHROUGH WITH CORRECT—
EXISTING PULL REQUEST
9
Existing PR
Code reviewers
RefreshMatched

WALKTHROUGH WITH CORRECT—
ADVANCED FEATURES
10
Open authentication
Parallel/optimized processing
Client-server architecture

CORRECT: CODE REVIEWER
RECOMMENDATION (RAHMAN ET AL, ICSE 2016)
11
R1 R2
R3
PR Review R1 PR Review R2
PR Review R3
Review
Similarity
Review
Similarity

EXISTING LITERATURE
 Line Change History (LCH)
 ReviewBot (Balachandran, ICSE 2013)
 File Path Similarity (FPS)
 RevFinder (Thongtanunam et al, SANER 2015)
 FPS (Thongtanunam et al, CHASE 2014)
 Tie (Xia et al, ICSME 2015)
 Code Review Content and Comments
 Tie (Xia et al, ICSME 2015)
 SNA (Yu et al, ICSME 2014)
12
 Issues & Limitations
 Mine developer’s contributions from
within a single project only.
 Library & Technology Similarity
Library
Technology

OUR CONTRIBUTIONS
13
State-of-the-art (Thongtanunam et al, SANER 2015)
IF
IF
Our proposed technique--CORRECT
= New PR = Reviewed PR = Source file
= External library & specialized technology

LIBRARY EXPERIENCE & TECHNOLOGY
EXPERIENCE (ANSWERED: RQ1)
Metric Library Similarity Technology Similarity Combined Similarity
Top-3 Top-5 Top-3 Top-5 Top-3 Top-5
Accuracy 83.57% 92.02% 82.18% 91.83% 83.75% 92.15%
MRR 0.66 0.67 0.62 0.64 0.65 0.67
MP 65.93% 85.28% 62.99% 83.93% 65.98% 85.93%
MR 58.34% 80.77% 55.77% 79.50% 58.43% 81.39%
14
[ MP = Mean Precision, MR = Mean Recall, MRR = Mean Reciprocal Rank ]
 Both library experience and technology experience are
found as good proxies, provide over 90% accuracy.
 Combined experience provides the maximum performance.
 92.15% recommendation accuracy with 85.93% precision and
81.39% recall.
 Evaluation results align with exploratory study findings.

COMPARATIVE STUDY FINDINGS (ANSWERED:
RQ2)
 CoRReCT performs better than the competing technique in all
metrics (p-value=0.003<0.05 for Top-5 accuracy)
 Performs better both on average and on individual projects.
 RevFinder uses PR similarity using source file name and file’s
directory matching
15
Metric RevFinder[18] CoRReCT
Top-5 Top-5
Accuracy 80.72% 92.15%
MRR 0.65 0.67
MP 77.24% 85.93%
MR 73.27% 81.39%
[ MP = Mean Precision, MR = Mean Recall,
MRR = Mean Reciprocal Rank ]

COMPARISON ON OPEN SOURCE PROJECTS
(ANSWERED: RQ3)
 In OSS projects, CoRReCT also performs better than the
baseline technique.
 85.20% accuracy with 84.76% precision and 78.73% recall,
and not significantly different than earlier (p-value=0.239>0.05
for precision)
 Results for private and public codebase are quite close.
16
Metric RevFinder [18] CoRReCT (OSS) CoRReCT (VA)
Top-5 Top-5 Top-5
Accuracy 62.90% 85.20% 92.15%
MRR 0.55 0.69 0.67
MP 62.57% 84.76% 85.93%
MR 58.63% 78.73% 81.39%
[ MP = Mean Precision, MR = Mean Recall, MRR = Mean Reciprocal Rank ]

SUMMARY
 CORRECT: A Recommendation Tool
 Recommends appropriate code reviewers
 Recommends automatically
 Does all heavy lifting (i.e., mining) for the developers.
 Provides recommendation rationale
 Fits within developer’s work flow
 Advanced Features
 Provides personalized recommendation
 Provides optimized performance
 Architecture
 Platform-independent & scalable 17

HANDS ON CORRECT
18
You are cordially invited!

THANK YOU!! QUESTIONS?
19
Masud Rahman (masud.rahman@usask.ca)
CORRECT site (http://www.usask.ca/~masud.rahman/correct)
Acknowledgement: This work is supported by NSERC

THREATS TO VALIDITY
 Threats to Internal Validity
 Skewed dataset: Each of the 10 selected projects is
medium sized (i.e., 1.1K PR) except CS.
 Threats to External Validity
 Limited OSS dataset: Only 6 OSS projects considered—
not sufficient for generalization.
 Issue of heavy PRs: PRs containing hundreds of files can
make the recommendation slower.
 Threats to Construct Validity
 Top-K Accuracy: Does the metric represent effectiveness
of the technique? Widely used by relevant literature
(Thongtanunam et al, SANER 2015)
20

CORRECT-ToolDemo-ASE2016

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to CORRECT-ToolDemo-ASE2016

Similar to CORRECT-ToolDemo-ASE2016 (20)

More from Masud Rahman

More from Masud Rahman (20)

Recently uploaded

Recently uploaded (20)

CORRECT-ToolDemo-ASE2016

Editor's Notes