Comparing Programming Language Models for Design Pattern Recognition (ICSA 2024 - Workshops)

Tue 4 - Sat 8 June 2024 Hyderabad, Telangana, India

Who

Sushant Kumar Pandey, Miroslaw Staron, Jennifer Horkoff, Miroslaw Ochodek, Darko Durisic

Track

ICSA 2024 Workshops

Abstract

Design patterns (DPs) facilitate effective software architecture and design and must be maintained and enforced in existing complex software products, for example, automotive software. Implementing DPs in source code facilitates the development of high-quality software products with less effort. However, recognizing DPs in program code is challenging, and this makes it difficult to keep architectural evolution under control in large software products over time. As DPs are abstract solutions, the programs used to recognize them in source code have significant limitations. In this paper, we employ four programming language models based on Bidirectional Encoder Representations from Transformers (BERT) to study to which extent these models can recognize an exemplar DP, in this case, Singleton. We compare four language representation models – OpenAI CodeX, Facebook AI TransCoder, ACoRA/BERT, and CCFlex/bag-of-words, and compare the models’ rankings to a simple base metric. We found a discrepancy between models in identifying Singletons and found that the models are inconsistently sensitive to name and semantic changes. Specifically, CodeX recognizes the existence of Singletons better than other models, while only ACoRA shows some signs of recognizing DP semantics.

Sushant Kumar PandeyAuthor

Chalmers and University of Gothenburg

Comparing Programming Language Models for Design Pattern Recognition

WASA 2024

Sushant Kumar PandeyAuthor

Chalmers and University of Gothenburg

Sweden

Miroslaw StaronAuthor

University of Gothenburg

Sweden

Jennifer HorkoffAuthor

Chalmers and the University of Gothenburg

Sweden

Miroslaw OchodekAuthor

Poznan University of Technology

Darko DurisicAuthor

Tracks