The Research Papers track of ICST 2025 invites high-quality submissions in all areas of software testing, verification, and validation. Papers for the research track should present novel and original work that advances the state-of-the-art. Case studies and empirical research papers are also welcome. For more information, check out the Call for Papers.
Wed 2 AprDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
10:30 - 11:00 | |||
10:30 30mCoffee break | Break Social |
11:00 - 12:30 | LLMs in TestingResearch Papers / Industry / Journal-First Papers at Aula Magna (AM) Chair(s): Phil McMinn University of Sheffield | ||
11:00 15mTalk | AugmenTest: Enhancing Tests with LLM-driven Oracles Research Papers Shaker Mahmud Khandaker Fondazione Bruno Kessler, Fitsum Kifetew Fondazione Bruno Kessler, Davide Prandi Fondazione Bruno Kessler, Angelo Susi Fondazione Bruno Kessler Pre-print | ||
11:15 15mTalk | Impact of Large Language Models of Code on Fault Localization Research Papers Suhwan Ji Yonsei University, Sanghwa Lee Kangwon National University, Changsup Lee Kangwon National University, Yo-Sub Han Yonsei University, Hyeonseung Im Kangwon National University, South Korea | ||
11:30 15mTalk | An Analysis of LLM Fine-Tuning and Few-Shot Learning for Flaky Test Detection and Classification Research Papers | ||
11:45 15mTalk | Evaluating the Effectiveness of LLMs in Detecting Security Vulnerabilities Research Papers Avishree Khare , Saikat Dutta Cornell University, Ziyang Li University of Pennsylvania, Alaia Solko-Breslin University of Pennsylvania, Mayur Naik UPenn, Rajeev Alur University of Pennsylvania | ||
12:00 15mTalk | FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categories and Test Code Repair Journal-First Papers Sakina Fatima University of Ottawa, Hadi Hemmati York University, Lionel Briand University of Ottawa, Canada; Lero centre, University of Limerick, Ireland | ||
12:15 15mTalk | Integrating LLM-based Text Generation with Dynamic Context Retrieval for GUI Testing Industry Juyeon Yoon Korea Advanced Institute of Science and Technology, Seah Kim Samsung Research, Somin Kim Korea Advanced Institute of Science and Technology, Sukchul Jung Samsung Research, Shin Yoo KAIST |
14:00 - 15:30 | Test OracleJournal-First Papers / Research Papers / Industry at Aula Magna (AM) Chair(s): Mike Papadakis University of Luxembourg | ||
14:00 15mTalk | Differential Testing of Concurrent Classes Research Papers Valerio Terragni University of Auckland, Shing-Chi Cheung Hong Kong University of Science and Technology | ||
14:15 15mTalk | Hybrid Equivalence/Non-Equivalence Testing Research Papers Laboni Sarker University of California at Santa Barbara, Tevfik Bultan University of California at Santa Barbara | ||
14:30 15mTalk | Metamorphic Testing for Pose Estimation Systems Research Papers Matias Duran Trinity College Dublin, Ireland, Thomas Laurent Lero@Trinity College Dublin, Ellen Rushe School of Computing Dublin City University, Anthony Ventresque Trinity College Dublin & Lero, Ireland | ||
14:45 15mTalk | Mutation-Guided Metamorphic Testing of Optimality in AI Planning Journal-First Papers Quentin Mazouni Simula Research Laboratory, Arnaud Gotlieb Simula Research Laboratory, Helge Spieker Simula Research Laboratory, Mathieu Acher University of Rennes - Inria - CNRS - IRISA, Benoit Combemale University of Rennes, Inria, CNRS, IRISA Link to publication DOI | ||
15:00 15mTalk | LLMs in the Heart of Differential Testing: A Case Study on a Medical Rule Engine Industry Erblin Isaku Simula Research Laboratory, and University of Oslo (UiO), Christoph Laaber Simula Research Laboratory, Hassan Sartaj Simula Research Laboratory, Shaukat Ali Simula Research Laboratory and Oslo Metropolitan University, Thomas Schwitalla Cancer Registry of Norway, Jan F. Nygård Cancer Registry of Norway |
15:30 - 16:00 | |||
15:30 30mCoffee break | Break Social |
16:00 - 17:00 | Test Case Selection, Prioritisation, ReductionResearch Papers / Industry at Aula Magna (AM) Chair(s): Andrea Stocco Technical University of Munich, fortiss | ||
16:00 15mTalk | The Impact of List Reduction for Language Agnostic Test Case Reducers Research Papers Tobias Heineken Friedrich-Alexander-Universität Erlangen-Nürnberg, Michael Philippsen Friedrich-Alexander Universität Erlangen-Nürnberg (FAU), Programming Systems Group | ||
16:15 15mTalk | RustyRTS: Regression Test Selection for Rust Research Papers Simon Hundsdorfer Technical University of Munich, Roland Würsching Technical University of Munich, Alexander Pretschner TU Munich | ||
16:30 15mTalk | ML-Based Test Case Prioritization: A Research and Production Perspective in CI Environments Industry Md Asif Khan Ontario Tech University, Akramul Azim Ontario Tech University, Ramiro Liscano Ontario Tech University, Kevin Smith International Business Machines Corporation (IBM), Yee-Kang Chang International Business Machines Corporation (IBM), Gkerta Seferi International Business Machines Corporation (IBM), Qasim Tauseef International Business Machines Corporation (IBM) | ||
16:45 15mTalk | Evaluating Machine Learning-Based Test Case Prioritization in the Real World: An Experiment with SAP HANA Industry |
17:00 - 18:00 | |||
17:00 60mSocial Event | Open Steering Committee Meeting Social |
Thu 3 AprDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
10:30 - 11:00 | |||
10:30 30mCoffee break | Break Social |
11:00 - 12:30 | Testing ML Systems and Fault LocalisationIndustry / Research Papers at Aula Magna (AM) Chair(s): Atif Memon Apple | ||
11:00 15mTalk | On Accelerating Deep Neural Network Mutation Analysis by Neuron and Mutant Clustering Research Papers Pre-print | ||
11:15 15mTalk | Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems Research Papers Stefano Carlo Lambertenghi Technische Universität München, fortiss GmbH, Hannes Leonhard Technical University of Munich, Andrea Stocco Technical University of Munich, fortiss Pre-print | ||
11:30 15mTalk | Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code Research Papers Shahin Honarvar Imperial College London, Mark van der Wilk University of Oxford, Alastair F. Donaldson Imperial College London | ||
11:45 15mTalk | Taming Uncertainty for Critical Scenario Generation in Automated Driving Industry Selma Grosse DENSO Automotive GmbH, Dejan Nickovic Austrian Institute of Technology, Cristinel Mateis AIT Austrian Institute of Technology GmbH, Alessio Gambi Austrian Institute of Technology (AIT), Adam Molin DENSO AUTOMOTIVE | ||
12:00 15mTalk | Multi-Project Just-in-Time Software Defect Prediction Based on Multi-Task Learning for Mobile Applications Research Papers Feng Chen Chongqing University of Posts and Telecommunications, Ke Yuxin Chongqing University of Posts and Telecommunications, Liu Xin Chongqing University of Posts and Telecommunications, Wei Qingjie Chongqing University of Posts and Telecommunications | ||
12:15 15mTalk | Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces Industry Neetha Jambigi University of Cologne, Bartosz Bogacz SAP SE, Moritz Mueller SAP SE, Thomas Bach SAP, Michael Felderer German Aerospace Center (DLR) & University of Cologne |
11:00 - 12:30 | Testing Tools and Data ShowcaseTesting Tools and Data Showcase at Room A1 Chair(s): Domenico Bianculli University of Luxembourg | ||
11:00 15mTalk | Rocket: A System-Level Fuzz-Testing Framework for the XRPL Consensus Algorithm Testing Tools and Data Showcase Wishaal Kanhai , Ivar van Loon , Yuraj Mangalgi , Thijs van der Valk , Lucas Witte Delft University of Technology, Annibale Panichella Delft University of Technology, Mitchell Olsthoorn Delft University of Technology, Burcu Kulahcioglu Ozkan Delft University of Technology | ||
11:15 15mTalk | RESTgym: A Flexible Infrastructure for Empirical Assessment of Automated REST API Testing Tools Testing Tools and Data Showcase Davide Corradini University of Luxembourg, Michele Pasqua University of Verona, Mariano Ceccato University of Verona | ||
11:30 15mTalk | ViMoTest: A Tool to Specify ViewModel-based GUI Test Scenarios using Projectional Editing Testing Tools and Data Showcase Mario Fuksa University of Stuttgart, Sandro Speth Institute of Software Engineering, University of Stuttgart, Steffen Becker University of Stuttgart | ||
11:45 15mTalk | E2E-Loader: A Tool to Generate Performance Tests from End-to-End GUI-level Tests Testing Tools and Data Showcase Sergio Di Meglio Università degli Studi di Napoli Federico II, Luigi Libero Lucio Starace Università degli Studi di Napoli Federico II, Sergio Di Martino Università degli Studi di Napoli Federico II | ||
12:00 15mTalk | AMBER: AI-enabled Java Microbenchmark Harness Testing Tools and Data Showcase Antonio Trovato University of Salerno, Luca Traini University of L'Aquila, Federico Di Menna University of L'Aquila, Dario Di Nucci University of Salerno Pre-print | ||
12:15 15mTalk | Codehacks: A Dataset of Adversarial Tests For Competitive Programming Problems Obtained From Codeforces Testing Tools and Data Showcase |
Fri 4 AprDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
09:00 - 10:30 | Awards & Most Influential PaperSocial at Aula Magna (AM) & A1 (streaming) Chair(s): Aldeida Aleti Monash University, Ali Mesbah University of British Columbia | ||
09:00 30mAwards | Awards Social | ||
09:30 60mAwards | Most Influential Paper Social |
10:30 - 11:00 | |||
10:30 30mCoffee break | Break Social |
11:00 - 12:30 | Automated TestingIndustry / Research Papers / Journal-First Papers / Education at Aula Magna (AM) Chair(s): Cristian Cadar Imperial College London | ||
11:00 15mTalk | Testing Practices, Challenges, and Developer Perspectives in Open-Source IoT Platforms Research Papers Daniel Rodriguez-Cardenas William & Mary, Safwat Ali Khan George Mason University, Prianka Mandal William & Mary, Adwait Nadkarni William & Mary, Kevin Moran University of Central Florida, Denys Poshyvanyk William & Mary Pre-print | ||
11:15 15mTalk | Many-Objective Neuroevolution for Testing Games Research Papers Patric Feldmeier University of Passau, Katrin Schmelz University of Passau, Gordon Fraser University of Passau Pre-print | ||
11:30 15mTalk | Black-Box Testing for Practitioners Education Matthias Hamburg IEEE Computer Society; International Software Testing Qualifications Board, Adam Roman Jagiellonian University, Faculty of Mathematics and Computer Science; International Software Testing Qualifications Board | ||
11:45 15mTalk | CUBETESTERAI: Automated JUnit Test Generation using the LLaMA Model Industry Daniele Gorla Department of Computer Science, Sapienza University of Rome, Shivam Kumar , Pietro Nicolaus Roselli Lorenzini , Alireza Alipourfaz | ||
12:00 15mTalk | Can Search-Based Testing with Pareto Optimization Effectively Cover Failure-Revealing Test Inputs? Journal-First Papers Lev Sorokin Technische Universität München, Germany, Damir Safin fortiss, Shiva Nejati University of Ottawa | ||
12:15 15mTalk | [prerecorded] ADGE: Automated Directed GUI Explorer for Android Applications Research Papers Yue Jiang Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China, Xiaobo Xiang Singular Security Lab, Beijing, China, Qingli Guo Institute of Information Engineering, Chinese Academy of Sciences, Qi Gong Key Laboratory of Network Assessment Technology, Institute of Information Engineering, Chinese Academy of Sciences, China, Xiaorui Gong Institute of Information Engineering, Chinese Academy of Science |
14:00 - 15:28 | Advances in Software TestingResearch Papers / Short Papers, Vision and Emerging Results / Industry / Education at Room A1 Chair(s): Annibale Panichella Delft University of Technology | ||
14:00 15mTalk | Code, Test and Coverage Evolution in Mature Software Systems: Changes over the Past Decade Research Papers | ||
14:15 15mTalk | Coverage Metrics for T-Wise Feature Interactions Research Papers Sabrina Böhm University of Ulm, Germany, Tim Jannik Schmidt University of Ulm, TU Braunschweig, Germany, Sebastian Krieter TU Braunschweig, Germany, Tobias Pett Karlsruhe Institute for Technology (KIT), Germany, Thomas Thüm TU Braunschweig, Germany, Malte Lochau University of Siegen | ||
14:30 15mTalk | LLM-based Labelling of Recorded Automated GUI-based Test cases Industry Diogo Buarque Franzosi Blekinge Institute of Technology, Emil Alégroth Blekinge Institute of Technology, Maycel Isaac Synteda | ||
14:45 15mTalk | A Tool-assisted Training Approach for Empowering Localization and Internationalization Testing Proficiency Education Maria Couto Federal University of Pernambuco, Breno Miranda Centro de Informática at Universidade Federal de Pernambuco, Kiev Gama Universidade Federal de Pernambuco | ||
15:00 7mShort-paper | Pre-trained Models for Bytecode Instructions Short Papers, Vision and Emerging Results Donggyu Kim Handong Global University, Taemin Kim Handong Global University, Jiho Shin York University, Song Wang York University, Heeyoul Choi , Jaechang Nam Handong Global University Pre-print | ||
15:07 7mShort-paper | EnCus: Customizing Search Space for Automated Program Repair Short Papers, Vision and Emerging Results Seongbin Kim , Sechang Jang , Jindae Kim Seoul National University of Science and Technology, Jaechang Nam Handong Global University Pre-print | ||
15:14 7mTalk | Test Generation from Use Case Specifications for IoT Systems: Custom, LLM-Based, and Hybrid Approaches Short Papers, Vision and Emerging Results Zacharie Chenail-Larcher École de technologie supérieure (ÉTS), Jean Baptiste Minani Concordia University, Montreal, Canada, Naouel Moha École de Technologie Supérieure (ETS) |
15:30 - 16:00 | |||
15:30 30mCoffee break | Break Social |
16:00 - 16:30 | |||
16:00 30mSocial Event | Closing Social |
Accepted Papers
Call for Papers
ICST 2025 invites high-quality submissions in all software testing, verification, and validation areas. Topics of interest include, but are not limited to, the following:
- AI-assisted software testing
- Automated test generation
- Formal verification
- Manual and social aspects of the software testing process
- Replications, empirical studies, case studies, experience reports
- Software reliability
- Static and dynamic analysis
- Test automation
- Test-driven development and Agile testing
- Testability, test design, and adequacy criteria
- Testing AI-based systems
- Testing and development processes
- Testing, debugging, and repair tools
- Testing education
- Testing in specific domains, such as mobile, web, embedded/cyber-physical systems, concurrent, distributed, cloud, GUI, context-aware, IoT, real-time systems, video games, augmented/virtual reality applications
- Testing of non-functional properties such as security, accessibility, and performance
- Theory of software testing
- Validation and user acceptance techniques
Papers for the research track should present novel and original work that advances the state-of-the-art. Case studies, replication studies, and empirical research papers are also welcome.
Papers with a strong industrial/practical component and focus more on impact than (technical) novelty are encouraged to consider the industry track instead.
Submission Format
Full Research papers must conform to the two-column IEEE conference publication format, and not exceed 10 pages, including all text, figures, tables, and appendices; two additional pages containing only references are permitted. It must conform to the IEEE Conference Proceedings Formatting Guidelines (use the letter format template and conference option). The ICST 2025 research track only accepts full research papers. Short papers are not accepted to the research track.
Authors must comply with the IEEE Plagiarism Guidelines. In particular, a submission must not have been published elsewhere and must not be under review elsewhere while under review for ICST. The submission must also comply with the IEEE Policy on Authorship.
Double-blind Reviewing
Each submission will be reviewed by at least three members of the ICST 2025 Program Committee. We employ a double-blind reviewing process. No submission may reveal its authors’ identities. Authors must make every effort to honour the double-anonymous review process. In particular, the authors’ names must be omitted from the submission, and references to their prior work should be in the third person. All artifacts, such as replication packages and tools, associated with the submission must also be anonymized. Any submission that does not comply with the above requirements may be rejected by the PC Chairs without further review. Authors with further questions on double-blind reviewing are encouraged to contact the Program Chairs by email.
Submissions
Submissions to the Research Papers Track that meet the above requirements can be made via HotCRP at https://icst2025.hotcrp.com/. If a submission is accepted, at least one author of the paper is required to register for ICST 2025 and present the paper. We expect that the conference will be in-person.
Replication Material
Submissions must supply all information needed to replicate the results and therefore are expected to include or point to an anonymized replication package with the necessary software, data, and instructions. The program committee may consult these packages to resolve open issues. If a paper does not come with a replication package, authors should comment on its absence.