LLM-based Quality Assessment of Software Architecture Diagrams: A Preliminary Study with Four Open-Source Projects (ECSA 2025 - Research Papers)

Who

Glauber Oliveira, Nabor Mendonca

Track

ECSA 2025 Research Papers

Time Zone

The program is currently displayed in (GMT+03:00) Athens.

Use conference time zone: (GMT+03:00) AthensSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 19 Sep 2025 12:00 - 12:15 at Phoenix - Session 7 - LLMs in Software Architecture (II) Chair(s): Salah Sadou

Abstract

This works explores the feasibility and challenges of using a Large Language Model (LLM) to automatically assess the quality of software architecture diagrams. Our approach is based on a structured prompt that guides the LLM to evaluate architecture diagrams and their accompanying descriptions according to five core quality criteria: clarity, consistency, completeness, accuracy, and level of detail. Preliminary experimental results using OpenAI’s ChatGPT-4o in four open-source projects suggest that LLMs can provide valuable feedback and detect diagrammatic inconsistencies, often in alignment with human expert evaluations. However, the LLM also struggled with context-specific design choices, sometimes misjudging deliberate omissions or the appropriate level of detail, indicating that human oversight remains indispensable. To guide researchers and practitioners, we further recommend practical guidelines for data preparation, prompt construction, and result interpretation, aiming to maximize the reliability and utility of LLM-based architectural evaluations.

Glauber Oliveira

University of Fortaleza

Brazil

Nabor Mendonca