Data Augmentation for Improving Emotion Recognition in Software Engineering Communication (ASE 2022 - Research Papers)

Write a Blog >>

Mon 10 - Fri 14 October 2022 Oakland Center, Michigan, United States

Who

Mia Mohammad Imran, Yashasvi Jain, Preetha Chatterjee, Kostadin Damevski

Track

ASE 2022 Research Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 13 Oct 2022 10:20 - 10:40 at Gold A - Technical Session 24 - Human Aspects Chair(s): Silvia Abrahão

Abstract

Emotions (e.g., Joy, Anger) are prevalent in daily software engineering (SE) activities, and are known to be significant indicators of work productivity (e.g., bug fixing efficiency). Recent studies have shown that directly applying general purpose emotion classification tools to SE corpora is not effective. Even within the SE domain, tool performance degrades significantly when trained on one communication channel and evaluated on another (e.g, StackOverflow vs. GitHub comments). Retraining a tool with channel-specific data takes significant effort since manually annotating large datasets of ground truth data is expensive.

In this paper, we address this data scarcity problem by automatically creating new training data using a data augmentation technique. Based on an analysis of the types of errors made by popular SE-specific emotion recognition tools, we specifically target our data augmentation strategy in order to improve the performance of emotion recognition. Our results show an average improvement of 9.3% in micro F1-Score for three existing emotion classification tools (ESEM-E, EMTk, SEntiMoji) when trained with our best augmentation strategy.

Link to Preprint

https://preethac.github.io/files/ASE_2022.pdf

Mia Mohammad Imran

Virginia Commonwealth University

Yashasvi Jain

Drexel University

Preetha Chatterjee

Drexel University, USA

United States

Kostadin Damevski