Leveraging Natural Language Processing and Data Mining to Augment and Validate APIs
APIs are increasingly prominent for modern web applications and software, allowing millions of users around the world to access data. Reducing the risk of API defects - and consequently failures - is key, notably for security, availability, and maintainability purposes. Documenting an API is crucial, as it allows the user to better understand it. Moreover, API testing techniques often require formal documentation as input. However, documenting is a time-consuming and error-prone task, often overlooked by developers. Natural Language Processing (NLP) could assist API development, as recent Large Language Models (LLMs) demonstrated exceptional abilities to automate tasks based on their colossal training data. Data mining could also be utilized, by synthesizing API information scattered across the web. Hence, I present my PhD project aimed at exploring the usage of NLP-related technologies and data mining to augment and validate APIs. The research questions of this PhD project are: (1) What types of APIs can benefit from NLP and data mining assistance? (2) What API problems can be solved with such methods? (3) How effective are the methods/tools (i.e. LLMs) in assisting APIs? (4) How efficient are the methods in assisting APIs (i.e. time and costs)?
Tue 17 SepDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
10:30 - 12:00 | |||
10:30 15mTalk | Soft Verification for Actor Contract Systems Doctoral Symposium S: Bram Vandenbogaerde Vrije Universiteit Brussel, P: Andreas Zeller CISPA Helmholtz Center for Information Security, P: Stefan Brunthaler μCSRL, CODE Research Institute, University of the Bundeswehr Munich | ||
10:45 10mTalk | Integrating Mutation Techniques to Keep Specification and Source Code in Sync Doctoral Symposium S: Kerstin Jacob University of Bamberg, P: Andreas Zeller CISPA Helmholtz Center for Information Security, P: Stefan Brunthaler μCSRL, CODE Research Institute, University of the Bundeswehr Munich | ||
10:55 10mTalk | Leveraging Natural Language Processing and Data Mining to Augment and Validate APIs Doctoral Symposium S: Alix Decrop University of Namur, P: Andreas Zeller CISPA Helmholtz Center for Information Security, P: Stefan Brunthaler μCSRL, CODE Research Institute, University of the Bundeswehr Munich | ||
11:05 15mTalk | From Fault Injection to Formal Verification: A Holistic Approach to Fault Diagnosis in Cyber-Physical Systems Doctoral Symposium S: Drishti Yadav Technische Universität Wien, P: Andreas Zeller CISPA Helmholtz Center for Information Security, P: Stefan Brunthaler μCSRL, CODE Research Institute, University of the Bundeswehr Munich | ||
11:20 15mTalk | Robustness against the C/C++11 memory model Doctoral Symposium S: Roy Margalit Tel Aviv University, Israel, P: Andreas Zeller CISPA Helmholtz Center for Information Security, P: Stefan Brunthaler μCSRL, CODE Research Institute, University of the Bundeswehr Munich | ||
11:35 10mTalk | Automated Testing of Networked Systems Reliability Doctoral Symposium S: Michal Rozsíval Faculty of Information Technology, Brno University of Technology, P: Andreas Zeller CISPA Helmholtz Center for Information Security, P: Stefan Brunthaler μCSRL, CODE Research Institute, University of the Bundeswehr Munich | ||
11:45 10mTalk | Search-Based Translations for Tensor Operations Doctoral Symposium Link to publication |