Automated Program Repair, What Is It Good For? Not Absolutely Nothing! (ICSE 2024 - Artifact Evaluation)

Fri 12 - Sun 21 April 2024 Lisbon, Portugal

Who

Hadeel Eladawy, Claire Le Goues, Yuriy Brun

Track

ICSE 2024 Artifact Evaluation

Abstract

Industrial deployments of automated program repair (APR), e.g., at Facebook and Bloomberg, signal a new milestone for this exciting and potentially impactful technology. In these deployments, developers use APR-generated patch suggestions as part of a human-driven debugging process. Unfortunately, little is known about how using patch suggestions affects developers during debugging. This paper conducts a controlled user study with 40 developers with a median of 6 years of experience. The developers engage in debugging tasks on nine naturally-occurring defects in real-world, open-source, Java projects, using Recoder, SimFix, and TBar, three state-of-the-art APR tools. For each debugging task, the developers either have access to the project’s tests, or, also, to code suggestions that make all the tests pass. These suggestions are either developer-written or APR-generated, which can be correct or deceptive. Deceptive suggestions, which are a common APR occurrence, make all the available tests pass but fail to generalize to the intended specification. Through a total of 160 debugging sessions, we find that access to a code suggestion significantly increases the odds of submitting a patch. Correct APR suggestions increase the odds of debugging success by 14,000%, but deceptive suggestions decrease the odds of success by 65%. Correct suggestions also speed up debugging. Surprisingly, we observe no significant difference in how novice and experienced developers are affected by APR, suggesting that APR may find uses across the experience spectrum. Overall, developers come away with a strong positive impression of APR, suggesting promise for APR-mediated, human-driven debugging, despite existing challenges in APR-generated repair quality.

Replication artifact is available at http://doi.org/10.17605/OSF.IO/9JZHR

Link to Publication

https://doi.org/10.1145/3597503.3639095

Link to Preprint

https://people.cs.umass.edu/~brun/pubs/pubs/Eladawy24icse.pdf

DOI

https://doi.org/10.1145/3597503.3639095

Automated Program Repair, What Is It Good For? Not Absolutely Nothing!

Hadeel Eladawy

University of Massachusetts

Claire Le Goues

Carnegie Mellon University

Yuriy Brun

University of Massachusetts

United States

Tracks

Co-hosted Conferences

Workshops