Software systems must comply with legal regulations, which is a resource-intensive task, particularly for small organizations and startups lacking dedicated legal expertise. Extracting metadata from regulations to elicit legal requirements for software is a critical step to ensure compliance. However, it is a cumbersome task due to the length and complex nature of legal text. Although prior work has pursued automated methods for extracting structural and semantic metadata from legal text, key limitations remain: they do not consider the interplay and interrelationships among attributes associated with these metadata types, and they rely on manual labeling or heuristic-driven machine learning, which does not generalize well to new documents. In this paper, we introduce an approach based on textual entailment and in-context learning for automatically generating a canonical representation of legal text—encodable and executable as Python code. Our representation is minimal yet sufficiently expressive to capture individual structural and semantic metadata attributes from legal text while preserving their interrelationships. This design choice reduces the need for large, manually labeled datasets and enhances applicability to unseen legislation. We evaluate our approach on 13 U.S. state data breach notification laws, demonstrating that our generated representations pass approximately 89.4% of test cases and achieve a precision and recall of 82.2 and 88.7 respectively.
Wed 3 SepDisplayed time zone: Brussels, Copenhagen, Madrid, Paris change
14:00 - 15:30 | LLMs for Requirements Elicitation and ExtractionResearch Papers at Salon de Actos Chair(s): Marc Oriol Universitat Politècnica de Catalunya | ||
14:00 30mPaper | LLMREI: Automating Requirements Elicitation Interviews with LLMs Research Papers Alexander Korn University of Duisburg-Essen, Smuel Gorsch University of Cologne, Andreas Vogelsang paluno – The Ruhr Institute for Software Technology, University of Duisburg-Essen Pre-print | ||
14:30 30mPaper | Requirements Elicitation Follow-up Question Generation Research Papers Anmol Singhal Carnegie Mellon University, Pittsburgh, Pennsylvania, United States, Yuchen Shen Carnegie Mellon University, Travis Breaux Carnegie Mellon University Pre-print | ||
15:00 30mPaper | Legal Requirements Translation from Law Research Papers Anmol Singhal Carnegie Mellon University, Pittsburgh, Pennsylvania, United States, Travis Breaux Carnegie Mellon University Pre-print | ||