ICSME 2025
Sun 7 - Fri 12 September 2025 Auckland, New Zealand
Thu 11 Sep 2025 14:10 - 14:20 at Case Room 3 260-055 - Session 9 - Testing 3 Chair(s): Sigrid Eldh

Test smells, suboptimal patterns in test code, impair software maintainability and reliability, especially in resource-constrained open-source Python projects. While detection tools such as PyNose identify python-specific test smells, prioritizing them for refactoring remains a challenge due to the lack of test-specific frameworks. This study proposes a metric-driven approach that integrates Change Proneness (CP) and Fault Proneness (FP) metrics, computed via Spearman’s rank correlation, to quantify maintenance and reliability risks across 15 test smells in 52 open-source Python projects. Complementing this, a survey of 45 developers captures subjective severity perceptions. By applying Martin Fowler’s Technical Debt Quadrant, we classify smells based on empirical risk and developer insights into four categories, enabling better prioritization. Out of the 15 analyzed smells, Conditional Test Logic, Duplicate Assert, Obscure In-Line Setup, and Redundant Assertion belong to the highest-priority category for refactoring. These smells are characterized by both high empirical risk and strong developer agreement. This integrated framework advances test smell prioritization by combining data-driven analysis with practitioner perspectives, facilitating efficient refactoring decisions and improved test suite quality.

Thu 11 Sep

Displayed time zone: Auckland, Wellington change

13:30 - 15:00
Session 9 - Testing 3Journal First Track / NIER Track / Tool Demonstration Track / Research Papers Track / Registered Reports at Case Room 3 260-055
Chair(s): Sigrid Eldh Ericsson AB, Mälardalen University, Carleton University
13:30
15m
Full-paper
Metamorphic Testing of Large Language Models for Natural Language Processing
Research Papers Track
Steven Cho The University of Auckland, New Zealand, Stefano Ruberto JRC European Commission, Valerio Terragni University of Auckland
Pre-print
13:45
15m
Onweer: Automated Resilience Testing through Fuzzing
Research Papers Track
Gilles Coremans Vrije Universiteit Brussel, Coen De Roover Vrije Universiteit Brussel
Pre-print
14:00
10m
Generating Highly Structured Test Inputs Leveraging Constraint-Guided Graph Refinement
Registered Reports
Zhaorui Yang University of California, Riverside, Yuxin Qiu University of California at Riverside, Haichao Zhu Meta, Qian Zhang University of California at Riverside
14:10
10m
Prioritizing Test Smells: An Empirical Evaluation of Quality Metrics and Developer Perceptions
NIER Track
Md Arif Hasan University of Dhaka, Bangladesh, Toukir Ahammed Institute of Information Technology, University of Dhaka
14:20
10m
LLMShot: Reducing snapshot testing maintanence via LLMs
NIER Track
Ergün Batuhan Kaynak Bilkent University, Mayasah Lami Bilkent University, Sahand Moslemi Yengejeh Bilkent University, Anil Koyuncu Bilkent University
Pre-print
14:30
15m
Combinatorial Transition Testing in Dynamically Adaptive Systems: Implementation and Test Oracle
Journal First Track
Pierre Martou UCLouvain / ICTEAM, Benoît Duhoux Université catholique de Louvain, Belgium, Kim Mens Université catholique de Louvain, ICTEAM institute, Belgium, Axel Legay Université Catholique de Louvain, Belgium
14:45
10m
LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops
Tool Demonstration Track
Ravin Ravi University of Auckland, Dylan Bradshaw University of Auckland, Stefano Ruberto JRC European Commission, Gunel Jahangirova King's College London, Valerio Terragni University of Auckland
Pre-print