ICSE 2025
Sat 26 April - Sun 4 May 2025 Ottawa, Ontario, Canada
Tue 29 Apr 2025 14:30 - 15:30 at 104 - Paper Presentations 3 and Tutorial 2 Chair(s): Matteo Biagiola

Testing the Evilness of Large Language Models

Large Language Models (LLMs) are becoming an integral part of our daily lives. But what if they provide dangerous advice—like instructions on poisoning a neighbor? Or if they make wrong assumptions that influence real-world decisions, such as recommending men for leadership roles while relegating women to supportive positions? At first glance, LLMs often appear polite and helpful… but can we uncover their hidden “evilness”?

In this tutorial, we will explore practical techniques and tools for testing what we refer to as the evilness of LLMs. Specifically, we will focus on two critical aspects: safety and bias. We will start by introducing the key concepts behind these issues, explaining why they matter and how they manifest in LLM behavior. Then, through hands-on exercises, we will demonstrate how to systematically test LLM safety using our tool ASTRAL, followed by an interactive session on detecting and analyzing bias with our tool suite Meta-Fair.

By the end of the tutorial, participants will be equipped with practical skills and tools to automatically test and evaluate the evilness of LLMs.

Tue 29 Apr

Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:30
Paper Presentations 3 and Tutorial 2SBFT at 104
Chair(s): Matteo Biagiola Università della Svizzera italiana
14:00
15m
Paper
AutoStub: Genetic Programming-Based Stub Creation for Symbolic Execution
SBFT
Felix Mächtle University of Luebeck, Nils Loose University of Luebeck, Jan-Niclas Serr University of Luebeck, Jonas Sander University of Luebeck, Thomas Eisenbarth University of Lübeck
14:15
15m
Research paper
Mimicry-Based Testing of Runtime SQLi Prevention Approaches
SBFT
Anjana Perera Oracle Labs, Australia, François Gauthier Oracle Labs, Kostyantyn Vorobyov Oracle Labs, Matthew Harris Oracle Labs, Paddy Krishnan Oracle Labs, Australia
14:30
60m
Tutorial
Tutorial by Miguel Romero-Arjona and Aitor Arrieta
SBFT
Miguel Romero-Arjona University of Seville, Aitor Arrieta Mondragon University