(Cancelled by the author) Downstream bias mitigation is all you need (17th Innovations in Software Engineering Conference (ISEC 2024) - Software Engineering in Practice) - 17th Innovations in Software Engineering Conference (ISEC 2024)

Track

17th Innovations in Software Engineering Conference (ISEC 2024) Software Engineering in Practice

Time Zone

The program is currently displayed in (GMT+05:30) Chennai, Kolkata, Mumbai, New Delhi.

Use conference time zone: (GMT+05:30) Chennai, Kolkata, Mumbai, New DelhiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 23 Feb 2024 12:30 - 13:00 at Room 2 - R102 - SEIP Session 1

Abstract

Natural language processing (NLP) models are increasingly being based on transformers-based language models that are initialized with pre-trained parameters. These parameters are learnt from the huge amount of text present on internet and provide a strong initialization through transfer learning but at the same time contain harmful prejudices. These harmful prejudices can lead to bias against certain demographics as and when these models are used in production. In this paper we study the bias transfer due to transfer learning into downstream tasks and analyze the number of biases absorbed by language models during pre-training and their transfer into task-specific behavior after fine-tuning. We discover that minimizing inherent bias with controlled interventions prior to fine-tuning has minimal effect on lowering the biased behavior of the classifier. Biases present in the domain-specific dataset appear to be a more plausible explanation for the subsequent biased behavior. However, we also observe that pre-training matters: after the model has been pre-trained, even small changes to co-occurrence rates in the fine-tuning dataset has a significant effect on the performance of the model. The outcomes of our study motivate practitioners to concentrate more on context-specific hazards and dataset quality.

Time Zone

The program is currently displayed in (GMT+05:30) Chennai, Kolkata, Mumbai, New Delhi.

Use conference time zone: (GMT+05:30) Chennai, Kolkata, Mumbai, New DelhiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 23 Feb
Displayed time zone: Chennai, Kolkata, Mumbai, New Delhi change

11:30 - 13:00	SEIP Session 1Software Engineering in Practice at Room 2 - R102

11:30 30m Talk		Product configuration generation from plan documents using document digitization tool Software Engineering in Practice Pavan Kumar Chittimalli , Chandan Prakash TCS Research, Ravindra Naik
12:00 30m Talk		(Cancelled by the author) Measuring Prediction Sensitivity using Protected Gradients Software Engineering in Practice Sunil Gopa Wells Fargo
12:30 30m Industry talk		(Cancelled by the author) Downstream bias mitigation is all you need Software Engineering in Practice Sunil Gopa Wells Fargo

(Cancelled by the author) Downstream bias mitigation is all you need

Program Display Configuration

Program Display Configuration

Fri 23 FebDisplayed time zone: Chennai, Kolkata, Mumbai, New Delhi change

Sunil Gopa

Wells Fargo

Fri 23 Feb
Displayed time zone: Chennai, Kolkata, Mumbai, New Delhi change