ChatGPT and Software Testing Education: Promises & Perils (TestEd 2023)

Who

Sajed Jalil, Suzzana Rafi, Thomas LaToza, Kevin Moran, Wing Lam

Track

TestEd 2023

Time Zone

The program is currently displayed in (GMT+01:00) Dublin.

Use conference time zone: (GMT+01:00) DublinSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 16 Apr 2023 16:20 - 16:40 at Hanover - Session 3 Chair(s): Lin Deng

Abstract

Over the past decade, predictive language modeling for code has proven to be a valuable tool for enabling new forms of automation for developers. More recently, we have seen the advent of general purpose “large language models”, based on neural transformer architectures, that have been trained on massive datasets of human written text, which includes code and natural language. However, despite the demonstrated representational power of such models, interacting with them has historically been constrained to specific task settings, limiting their general applicability. Many of these limitations were recently overcome with the introduction of ChatGPT, a language model created by OpenAI and trained to operate as a conversational agent, enabling it to answer questions and respond to a wide variety of commands from end users.

The introduction of models, such as ChatGPT, has already spurred fervent discussion from educators, ranging from fear that students could use these AI tools to circumvent learning, to excitement about the new types of learning opportunities that they might unlock. However, given the nascent nature of these tools, we currently lack fundamental knowledge related to how well they perform in different educational settings, and the potential promise (or danger) that they might pose to traditional forms of instruction. As such, in this paper, we examine how well ChatGPT performs when tasked with answering practice questions in a popular software testing curriculum. We found that given its current capabilities, ChatGPT is able to respond to 77.5% of the questions we examined and that, of these questions, it is able to provide correct or partially correct answers in 55.6% of cases, provide correct or partially correct explanations of answers in 53.0% of cases, and that prompting the tool in a shared question context leads to a marginally higher rate of correct answers and explanations. Based on these findings, we discuss the potential promises and perils related to the use of ChatGPT by students and instructors.

Link to Preprint

https://arxiv.org/abs/2302.03287

Sajed JalilAuthor

George Mason University

United States

Suzzana RafiAuthor

Thomas LaTozaAuthor

George Mason University

United States

Kevin MoranAuthor

George Mason University

United States

Wing LamAuthor

George Mason University

Time Zone

The program is currently displayed in (GMT+01:00) Dublin.

Use conference time zone: (GMT+01:00) DublinSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 16 Apr
Displayed time zone: Dublin change

16:00 - 17:30	Session 3TestEd at Hanover Chair(s): Lin Deng Towson University

16:00 10m Paper		Intracompany Training in Software Testing: Experience Report TestEd A: Iosif Itkin Exactpro Systems, A: Natia Sirbiladze Exactpro Systems, A: Elena Treshcheva Exactpro Systems, A: Rostislav Yavorskiy Exactpro Systems
16:10 10m Paper		Code Critters: A Block-Based Testing Game TestEd A: Philipp Straubinger University of Passau, A: Laura Caspari University of Passau, A: Gordon Fraser University of Passau Pre-print
16:20 20m Paper		ChatGPT and Software Testing Education: Promises & Perils TestEd A: Sajed Jalil George Mason University, A: Suzzana Rafi , A: Thomas LaToza George Mason University, A: Kevin Moran George Mason University, A: Wing Lam George Mason University Pre-print
16:40 50m Panel		Panel Discussion on ChatGPT TestEd