On the Robustness of Fairness Practices: A Causal Framework for Systematic Evaluation
Machine learning (ML) algorithms are increasingly deployed to make critical decisions in socioeconomic applications such as finance, criminal justice, and autonomous driving. However, due to their data-driven and pattern-seeking nature, ML algorithms may develop decision logic that disproportionately distributes opportunities, benefits, resources, or information among different population groups, potentially harming marginalized communities. In response to such fairness concerns, the software engineering and ML communities have made significant efforts to establish the best practices for creating fair ML software. These include fairness interventions for training ML models such as including sensitive features, selecting non-sensitive attributes, and applying bias mitigators. But how reliably can software professionals tasked with developing data-driven systems depend on these recommendations? And how well do these practices generalize in the presence of faulty labels, missing data, or distribution shifts? These questions form the core theme of this paper.
We present a testing tool and technique based on causality theory to assess the robustness of best practices in fair ML software development. Given a practice—specified as a first-order logic property—and a socio-critical dataset that satisfies the property, our goal is to search for neighborhood datasets to determine whether the property continues to hold. This process is akin to testing the robustness of a neural network for image classification, except that the “image" is an entire dataset, and its “neighbors" are datasets in which certain causal hypotheses are altered. Since computing neighborhood datasets while accounting for various factors—such as noise, faulty labeling, and demographic shifts—is challenging, we utilize causal graph representations of the dataset and leverage a search algorithm to explore equivalent causal graphs to generate datasets. Our results across various fairness-sensitive tasks, derived from prevalent fairness-sensitive applications, identify best practices that preserve robustness under the varying factors.
Fri 17 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
14:00 - 15:30 | Testing and Analysis 19Research Track at Oceania II Chair(s): Nasir Eisty University of Tennessee-Knoxville | ||
14:00 15mTalk | E-Test: E'er-Improving Test Suites Research Track Ketai Qiu USI Università della Svizzera Italiana, Luca Di Grazia University of St. Gallen, Leonardo Mariani University of Milano-Bicocca, Mauro Pezze Università della Svizzera italiana (USI) and Università degli Studi di Milano Bicocca Pre-print | ||
14:15 15mTalk | AssertFlip: Reproducing Bugs via Inversion of LLM-Generated Passing Tests Research Track Lara Khatib University of Waterloo, Noble Saji Mathews University of Waterloo, Canada, Mei Nagappan University of Waterloo | ||
14:30 15mTalk | Boosting Gas Revenues of Ethereum Miners Research Track Togzhan Barakbayeva HKUST, Soroush Farokhnia Hong Kong University of Science and Technology, Amir K. Goharshady University of Oxford, Sergei Novozhilov The Hong Kong University of Science and Technology | ||
14:45 15mTalk | LLM4Perf: Large Language Models Are Effective Samplers for Multi-Objective Performance Modeling Research Track Xin Wang The Hong Kong University of Science and Technology (Guangzhou), Zhenhao Li York University, Zishuo Ding The Hong Kong University of Science and Technology (Guangzhou) Pre-print | ||
15:00 15mTalk | On the Robustness of Fairness Practices: A Causal Framework for Systematic Evaluation Research Track Verya Monjezi University of Illinois Chicago, Ashish Kumar Pennsylvania State University, Ashutosh Trivedi University of Colorado Boulder, Gang (Gary) Tan Pennsylvania State University, Saeid Tizpaz-Niari University of Illinois Chicago | ||
15:15 15mTalk | Characterizing Regression Bug‑Inducing Changes and Improving LLM‑Based Regression Bug Detection Research Track Xuezhi Song Fudan University, Yijian Wu Fudan University, Bihuan Chen Fudan University, Zhengjie Lu Fudan University, Shuning Liu Fudan University, Xin Peng Fudan University Pre-print Media Attached | ||