AST 2026 Program
This program is tentative and subject to change.
Mon 13 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
09:00 - 10:30 | Session 1: Opening & Keynote 1AST 2026 at Oceania VI Chair(s): Markus Borg CodeScene, Breno Miranda Federal University of Pernambuco, Ana Paiva INESC TEC, Faculty of Engineering, University of Porto, Andy Zaidman TU Delft 9:00 — Opening of AST 2026 by Organizers9:20 — Keynote by Gunel Jahangirova, King’s College LondonTitle: Deep Learning Fault Localisation and Repair: Benchmarks, Limitations, and the Role of LLMs Abstract: As Deep Learning (DL) systems become increasingly pervasive in safety-critical and high-impact domains, the need for effective techniques to test, localise, and repair faults in Deep Neural Networks (DNNs) has never been greater. Over the past few years, numerous fault localisation (FL) and repair approaches have been proposed, leveraging both static and dynamic analyses, as well as rule-based heuristics. However, a fundamental question remains: how effective and reliable are these techniques in practice? In this talk, I will present a comprehensive empirical investigation into the current state of fault localisation and repair for DL systems. First, I will discuss a large-scale comparative evaluation of state-of-the-art FL techniques, conducted on a benchmark comprising both real-world faults collected from bug reporting platforms and faults generated via mutation testing. Our findings reveal that current techniques struggle to achieve strong and consistent performance when evaluated against a single human-defined ground truth, raising concerns about how effectiveness is currently assessed. Next, I will examine the broader ecosystem of DL fault localisation and repair techniques, highlighting their strengths and limitations. I will then present an empirical study investigating whether Large Language Models (LLMs) can effectively localise and repair faults in DL systems. Our evaluation shows that LLMs demonstrate strong performance compared to existing approaches, suggesting that they may offer a promising direction for advancing automated DL debugging. Finally, I will address a critical but often overlooked issue: the realism and reproducibility of existing DL fault benchmarks that are used to evaluate DL faults localisation and repair approaches. Through a manual analysis of hundreds of reported faults across widely used benchmarks, we find that only a limited subset satisfies strong realism criteria, and reproducibility remains a significant challenge. These findings raise important concerns about current evaluation practices and underscore the need for more rigorous assessment methodologies. Bio: Gunel Jahangirova is a Lecturer (Assistant Professor) at King’s College London (KCL), United Kingdom. Prior to joining KCL, she was a Postdoctoral Researcher at Università della Svizzera Italiana (USI) in Lugano, Switzerland. She obtained her PhD through a joint programme between Fondazione Bruno Kessler (FBK) in Trento, Italy, and University College London (UCL), UK. Her research focuses on the automatic generation and evaluation of test oracles, error propagation in software systems, testing of deep learning systems, oracle design and quality metrics for autonomous vehicles, and the application of artificial intelligence to software engineering tasks. | ||
09:00 45mTalk | Session 1: Opening & Keynote AST 2026 Markus Borg CodeScene, Breno Miranda Federal University of Pernambuco, Ana Paiva INESC TEC, Faculty of Engineering, University of Porto, Andy Zaidman TU Delft, Gunel Jahangirova King's College London | ||
11:00 - 12:30 | Session 2: AI for Automated Software TestingAST 2026 at Oceania VI Chair(s): Ana Paiva INESC TEC, Faculty of Engineering, University of Porto | ||
11:00 30mTalk | LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing AST 2026 Hongxiang Zhang University of California, Davis, Yuyang Rong UC Davis, Yifeng He University of California at Davis, USA, Hao Chen University of California at Davis Pre-print Media Attached | ||
11:30 30mTalk | REST-at: An LLM-Based Tool for Automating Traceability between Requirements and Test Cases AST 2026 Nicole Leon-Quinstedt University of Gothenburg, Bao Lindgren University of Gothenburg, Mert Yurdakul Test Scouts Sweden AB, Francisco Gomes de Oliveira Neto Chalmers | University of Gothenburg | ||
12:00 30mTalk | Testing Framework Migration with Large Language Models AST 2026 Altino Alves Júnior UFMG, João Eduardo Montandon Universidade Federal de Minas Gerais (UFMG), Andre Hora UFMG Pre-print | ||
14:00 - 15:30 | Session 3: Test Case Generation and FuzzingAST 2026 at Oceania VI Chair(s): Cristian Augusto University of Oviedo | ||
14:00 30mTalk | Improving Deep Learning Library Testing with Machine Learning AST 2026 Facundo Molina Complutense University of Madrid, M M Abid Naziri North Carolina State University, Feiran Qin North Carolina State University, Alessandra Gorla IMDEA Software Institute, Marcelo d'Amorim North Carolina State University | ||
14:30 30mTalk | Understanding on the Edge: LLM-generated Boundary Test Explanations AST 2026 Sabina Akbarova Chalmers University of Technology, Felix Dobslaw Mid Sweden University, Robert Feldt Chalmers | University of Gothenburg Pre-print | ||
15:00 30mTalk | Search-Based Fuzzing For RESTful APIs That Use MongoDB AST 2026 Hernan Ghianni University of Buenos Aires, Man Zhang Beihang University, China, Juan Pablo Galeotti University of Buenos Aires, Andrea Arcuri Kristiania University College and Oslo Metropolitan University | ||
16:00 - 17:30 | Session 4: Test Automation and the Software ProcessAST 2026 at Oceania VI Chair(s): Phil McMinn University of Sheffield | ||
16:00 30mTalk | A Framework for Similarity-based and Resource-aware Orchestration of End-to-End Test Cases AST 2026 Cristian Augusto University of Oviedo, Antonia Bertolino Gran Sasso Science Institute, Guglielmo De Angelis CNR-IASI, Claudio de la Riva University of Oviedo, Francesca Lonetti CNR-ISTI, Jesús Morán University of Oviedo | ||
16:30 30mTalk | Understanding Bug-Reproducing Tests: A First Empirical Study AST 2026 Pre-print Media Attached | ||
17:00 30mTalk | Exploring Mocking Techniques for Managing External Dependencies in Service-Based Systems: A Mapping Study AST 2026 Benedito Fernando Albuquerque de Oliveira Federal University of Pernambuco, Fernando Castor University of Twente, Leo Fernandes Federal Institute of Alagoas (IFAL), Samuel Amorim IFAL/Brazil | ||
Tue 14 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
09:00 - 10:30 | Session 5: Keynote 2AST 2026 at Oceania VI Chair(s): Breno Miranda Federal University of Pernambuco 9:00 — Keynote by Marcelo José Ruv Lemes, EMBRAERTitle: Verification of Embedded Software for Aeronautical Applications Abstract: The development of embedded software for aeronautical applications is carried out in a highly regulated environment, with one of the main activities being the verification. Software verification in this context encompasses reviews, analyses, and testing. This presentation aims to provide an overview of verification activities for embedded software for aeronautical applications, placing them within the broader context of aircraft verification. Bio: Marcelo Lemes holds a degree in Data Processing Technology and Mathematics from the University of Taubaté. He also holds a Master degree in Software Engineering from the Aeronautics Institute of Technology and a Doctorate in Digital Systems from the Polytechnic School of the University of São Paulo. He worked for 12 years at the Institute of Aeronautics and Space (IAE) of the Aerospace Technical Center (CTA), spending most of the time involved in the development of embedded software for the Brazilian Satellite Launch Vehicle (VLS). Since 1997, he has worked at EMBRAER involved with development and certification of embedded software applications. He currently works alongside the company’s Chief Engineer, coordinating embedded software activities for the company. | ||
09:00 90mKeynote | Session 5: Keynote 2 AST 2026 Breno Miranda Federal University of Pernambuco | ||
11:00 - 12:30 | Session 6: Testing Around the WorldAST 2026 at Oceania VI Chair(s): Hokeun Kim Arizona State University | ||
11:00 30mTalk | Understanding and Detecting Platform-Specific Violations in Android Auto Apps AST 2026 Pre-print Media Attached | ||
11:30 30mTalk | A Unified Benchmark for Out-of-Distribution Detection for Autonomous Driving Systems AST 2026 Xiangyu Li SeysoAI, Jingyu ZHANG Hong Kong Metropolitan University, Jacky Keung City University of Hong Kong, Xiaoxue Ma Hong Kong Metropolitan University, Yihan Liao City University of Hong Kong Pre-print Media Attached | ||
12:00 30mTalk | HYDRA: A Hybrid Heuristic-Guided Deep Representation Architecture for Predicting Latent Zero-Day Vulnerabilities in Patched Functions AST 2026 Mohammad Farhad University of Louisiana at Lafayette, Sabbir Rahman University of Louisiana at Lafayette, Shuvalaxmi Dass University of Louisiana at Lafayette Pre-print Media Attached | ||
14:00 - 15:30 | |||
14:00 30mTalk | L-SCALE: Locality-Sensitive Coverage for Automata LEarning AST 2026 Mark Giraud Fraunhofer IOSB, Bastian Engel Fraunhofer IOSB, Lea Nasarek Fraunhofer IOSB, Yannis Storrer Fraunhofer IOSB, Philipp Takacs Fraunhofer IOSB, Leon Philipp Wittemund Fraunhofer IOSB | ||
14:30 30mTalk | APITestGenie: Generating Web API Tests from Requirements and API Specifications with LLMs AST 2026 André Pereira Deloitte and Faculty of Engineering, University of Porto, Bruno Lima LIACC, Faculty of Engineering, University of Porto, João Pascoal Faria Faculty of Engineering, University of Porto and INESC TEC Pre-print | ||
15:00 30mTalk | Software Testing Education in the LLM Era: Insights and Emerging Theory AST 2026 Samhitha Dwarakanath Pennsylvania State University, Nathalia Nascimento Pennsylvania State University, Everton Guimaraes Pennsylvania State University | ||
16:00 - 17:30 | |||
16:00 30mTalk | ACT: Automated CPS Testing for Open-Source Robotic Platforms AST 2026 Aditya A. Krishnan Arizona State University, Donghoon Kim Arkansas State University, Hokeun Kim Arizona State University DOI Pre-print | ||
16:30 30mTalk | From Logs to Lessons: An Exploration of LLM-based Log Summarization for Debugging Automotive Software AST 2026 Anton Ekström Chalmers University of Technology, Hampus Rhedin Stam Chalmers University of Technology, Francisco Gomes de Oliveira Neto Chalmers | University of Gothenburg, Gregory Gay Chalmers University of Technology and University of Gothenburg, Sabina Edenlund Volvo Cars AB | ||
17:00 30mTalk | Separating Valid from Invalid Inputs for a Digital Aircraft Design Tool AST 2026 Malte Christian Struck German Aerospace Center (DLR), Institute of Software Technology, Andreas Schuster German Aerospace Center (DLR), Institute of Lightweight Systems, Alexander Weinert German Aerospace Center (DLR) Institute for Software Technology, Michael Felderer German Aerospace Center (DLR) & University of Cologne | ||