Seven Failure Points When Engineering a Retrieval Augmented Generation System (CAIN 2024 - Research and Experience Papers)

Who

Scott Barnett, Stefanus Kurniawan, Srikanth Thudumu, Zach Brannelly, Mohamed Abdelrazek

Track

CAIN 2024 Research and Experience Papers

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 15 Apr 2024 16:10 - 16:20 at Pequeno Auditório - System Qualities Chair(s): Andrei Paleyes

Abstract

Software engineers are increasingly adding semantic search capabilities to applications using a strategy known as Retrieval Augmented Generation (RAG). A RAG system involves finding documents that semantically match a query and then passing the documents to a large language model (LLM) such as ChatGPT to extract the right answer using an LLM. RAG systems aim to: a) reduce the problem of hallucinated responses from LLMs, b) link sources/references to generated responses, and c) remove the need for annotating documents with meta-data. However, RAG systems suffer from limitations inherent to information retrieval systems and from reliance on LLMs. In this paper, we present an experience report on the failure points of RAG systems from three case studies from separate domains: research, education, and biomedical. We share the lessons learned and present 7 failure points to consider when designing a RAG system. The two key takeaways arising from our work are: 1) validation of a RAG system is only feasible during operation, and 2) the robustness of a RAG system evolves rather than designed in at the start. We conclude with a list of potential research directions on RAG systems for the software engineering community.

Scott Barnett

Applied Artificial Intelligence Institute, Deakin University

Stefanus Kurniawan

Deakin University

Srikanth Thudumu

Deakin University

Zach Brannelly

Deakin University

Mohamed Abdelrazek

Deakin University, Australia

Australia

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 15 Apr
Displayed time zone: Lisbon change

16:00 - 18:00	System QualitiesResearch and Experience Papers / Industry Talks at Pequeno Auditório Chair(s): Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge

16:00 10m Talk		Modeling Resilience of Collaborative AI Systems Research and Experience Papers Diaeddin Rimawi Free University of Bozen-Bolzano, Antonio Liotta Free University of Bozen-Bolzano, Marco Todescato Fraunhofer Italia, Barbara Russo
16:10 10m Talk		Seven Failure Points When Engineering a Retrieval Augmented Generation System Research and Experience Papers Scott Barnett Applied Artificial Intelligence Institute, Deakin University, Stefanus Kurniawan Deakin University, Srikanth Thudumu Deakin University, Zach Brannelly Deakin University, Mohamed Abdelrazek Deakin University, Australia
16:20 15m Talk		POLARIS: A framework to guide the development of Trustworthy AI systems Research and Experience Papers Maria Teresa Baldassarre Department of Computer Science, University of Bari , Domenico Gigante SER&Practices and University of Bari, Marcos Kalinowski Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Azzurra Ragone University of Bari
16:35 15m Talk		Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory Research and Experience Papers A: Saeid Tizpaz-Niari University of Texas at El Paso, A: Sriram Sankaranarayanan University of Colorado, Boulder
16:50 15m Talk		Is Your Anomaly Detector Ready for Change? Adapting AIOps Solutions to the Real World Research and Experience Papers Lorena Poenaru-Olaru TU Delft, Natalia Karpova TU Delft, Luís Cruz Delft University of Technology, Jan S. Rellermeyer Leibniz University Hannover, Arie van Deursen Delft University of Technology
17:05 15m Talk		Novel Contract-based Runtime Explainability Framework for End-to-End Ensemble Machine Learning Serving Research and Experience Papers Minh-Tri Nguyen Aalto University, Hong-Linh Truong Aalto University, Tram Truong-Huu Singapore Institute of Technology
17:20 10m Industry talk		Trustworthy AI: Industry-Guided Tooling of the Methods Industry Talks Zakaria Chihani CEA, LIST, France
17:30 15m Live Q&A		System Qualities: Q&A Session Research and Experience Papers
17:45 15m Day closing		Closing Research and Experience Papers Jan Bosch Chalmers University of Technology