Fri 27 Oct 2023 10:40 - 11:00 at Rhythms 2 - 5A - Code review Chair(s): Eray Tüzün

Background: The existence of toxic conversations in open-source platforms can degrade relationships among software developers and may negatively impact software product quality. To help mitigate this, some initial work has been done to detect toxic comments in the Software Engineering (SE) domain.

Aims: Since automatically classifying an entire text as toxic or non-toxic does not help human moderators to understand the specific reason(s) for toxicity, we worked to develop an explainable toxicity detector for the SE domain.

Method: Our explainable toxicity detector can detect specific spans of toxic content from SE texts, which can help human moderators by automatically highlighting those spans. This toxic span detection model, ToxiSpanSE, is trained with the 19,651 code review (CR) comments with labeled toxic spans. Our annotators labeled the toxic spans within 3,757 toxic CR samples. We explored several types of models, including one lexicon-based approach and five different transformer-based encoders.

Results: After an extensive evaluation of all models, we found that our fine-tuned RoBERTa model achieved the best score with 0.88 $F1$, 0.87 precision, and 0.93 recall for toxic class tokens, providing an explainable toxicity classifier for the SE domain.

Conclusion: Since ToxiSpanSE is the first tool to detect toxic spans in the SE domain, this tool will pave a path to combat toxicity in the SE community.

Fri 27 Oct

Displayed time zone: Central Time (US & Canada) change

10:40 - 12:15
10:40
20m
Full-paper
ToxiSpanSE: An Explainable Toxicity Detection in Code Review Comments
ESEM Technical Papers
Jaydeb Sarker Department of Computer Science, Wayne State University, Sayma Sultana Wayne State University, Steven Wilson , Amiangshu Bosu Wayne State University
Pre-print Media Attached
11:00
20m
Full-paper
Towards Automated Classification of Code Review Feedback to Support Analytics
ESEM Technical Papers
Asif Kamal Turzo Wayne State University, Fahim Faysal , Ovi Poddar , Jaydeb Sarker Department of Computer Science, Wayne State University, Anindya Iqbal Bangladesh University of Engineering and Technology Dhaka, Bangladesh, Amiangshu Bosu Wayne State University
Pre-print Media Attached
11:20
20m
Full-paper
Security Defect Detection via Code Review: A Study of the OpenStack and Qt Communities
ESEM Technical Papers
Jiaxin Yu , Liming Fu Wuhan University, Peng Liang Wuhan University, China, Amjed Tahir Massey University, Mojtaba Shahin RMIT University, Australia
Link to publication Pre-print Media Attached
11:40
15m
Vision and Emerging Results
Exploring the Advances in Identifying Useful Code Review Comments
Emerging Results, Vision and Reflection Papers Track
Sharif Ahmed Boise State University, USA, Nasir Eisty Boise State University
11:55
10m
Journal Early-Feedback
Using a Balanced Scorecard to Identify Opportunities to Improve Code Review Effectiveness: An Industrial Experience Report
ESEM Journal-First Papers
Masum Hasan , Anindya Iqbal Bangladesh University of Engineering and Technology Dhaka, Bangladesh, Mohammad Rafid Ul Islam , Ajm Imtiajur Rahman , Amiangshu Bosu Wayne State University
12:05
10m
Journal Early-Feedback
A Critical Comparison on Six Static Analysis Tools: Detection, Agreement, and Precision
ESEM Journal-First Papers
Valentina Lenarduzzi University of Oulu, Fabiano Pecorelli Jheronimus Academy of Data Science, Nyyti Saarimäki Tampere University, Savanna Lujan Tampere University, Fabio Palomba University of Salerno