Can ChatGPT emulate humans in software engineering surveys? (ESEIW 2024 - ESEM Emerging Results, Vision and Reflection Papers Track)

Who

Igor Steinmacher, Jacob Mcauley Penney, Katia Romero Felizardo, Alessandro Garcia, Marco Gerosa

Track

ESEIW 2024 ESEM Emerging Results, Vision and Reflection Papers Track

Time Zone

The program is currently displayed in (GMT+02:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+02:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 24 Oct 2024 15:15 - 15:30 at Telensenyament (B3 Building - 1st Floor) - Empirical research methods and applications Chair(s): Valentina Lenarduzzi

Abstract

Context: There is a growing belief in the literature that large language models (LLMs), such as those employed by ChatGPT, can mimic human behavior in surveys. Gap: While the literature has shown promising results in social sciences and market research, there is scant evidence of its effectiveness in technical fields like software engineering. Objective: Inspired by previous work, this paper explores ChatGPT’s ability to replicate findings from prior software engineering research. Given the frequent use of surveys in this field, if LLMs can accurately emulate human responses, this technique could address common methodological challenges like recruitment difficulties, representational shortcomings, and respondent fatigue. Method: We prompted ChatGPT to reflect the behavior of a ‘mega-persona’ representing the demographic distribution of interest. We replicated surveys from 2019 to 2023 from leading SE conferences, examining ChatGPT’s proficiency in mimicking responses from diverse demographics. Results: Our findings reveal that ChatGPT can successfully replicate the outcomes of some studies, but in others, the results were not significantly better than a random baseline. Conclusions: This reflection paper discusses the challenges and potential research opportunities in leveraging LLMs for representing humans in software engineering surveys.

Igor Steinmacher

Northern Arizona University

United States

Jacob Mcauley Penney

NAU

United States

Katia Romero Felizardo

UTFPR-CP

Brazil

Alessandro Garcia

Pontifical Catholic University of Rio de Janeiro (PUC-Rio)

Brazil

Marco Gerosa

Northern Arizona University

United States

Time Zone

The program is currently displayed in (GMT+02:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+02:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 24 Oct
Displayed time zone: Brussels, Copenhagen, Madrid, Paris change

14:00 - 15:30	Empirical research methods and applicationsESEM Technical Papers / ESEM Emerging Results, Vision and Reflection Papers Track at Telensenyament (B3 Building - 1st Floor) Chair(s): Valentina Lenarduzzi University of Oulu

14:00 20m Full-paper		Game Software Engineering: A Controlled Experiment Comparing Automated Content Generation Techniques ESEM Technical Papers Mar Zamorano López University College London, África Domingo Universidad San Jorge, Carlos Cetina Universitat Politècnica de València, Spain, Federica Sarro University College London
14:20 20m Full-paper		Evaluating Software Modelling Recommendations: Towards Systematic Guidelines for Modelling ESEM Technical Papers Shalini Chakraborty Reykjavik University, Grischa Liebel Reykjavik University
14:40 20m Full-paper		What do we know about Hugging Face? A systematic literature review and quantitative validation of qualitative claims ESEM Technical Papers Jason Jones Purdue University, Wenxin Jiang Purdue University, Nicholas Synovic Loyola University Chicago, George K. Thiruvathukal Loyola University Chicago and Argonne National Laboratory, James C. Davis Purdue University DOI Pre-print
15:00 15m Vision and Emerging Results		On the Creation of Representative Samples of Software Repositories ESEM Emerging Results, Vision and Reflection Papers Track June Gorostidi IN3 - UOC, Adem Ait University of Luxembourg, Jordi Cabot Luxembourg Institute of Science and Technology, Javier Luis Cánovas Izquierdo IN3 - UOC Pre-print
15:15 15m Vision and Emerging Results		Can ChatGPT emulate humans in software engineering surveys? ESEM Emerging Results, Vision and Reflection Papers Track Igor Steinmacher Northern Arizona University, Jacob Mcauley Penney NAU, Katia Romero Felizardo UTFPR-CP, Alessandro Garcia Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Marco Gerosa Northern Arizona University