Repairing DNN Architecture: Are We There Yet? (ICST 2023 - Research Papers)

Who

Jinhan Kim, Nargiz Humbatova, Gunel Jahangirova, Paolo Tonella, Shin Yoo

Track

ICST 2023 Research Papers

Time Zone

The program is currently displayed in (GMT+01:00) Dublin.

Use conference time zone: (GMT+01:00) DublinSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 17 Apr 2023 17:20 - 17:40 at Grand canal - Session 5: Testing AI/ML systems Chair(s): Jie M. Zhang

Abstract

As Deep Neural Networks (DNNs) are rapidly being adopted within large software systems, software developers are increasingly required to design, train, and deploy such models into the systems they develop. Consequently, testing and improving the robustness of these models have received a lot of attention lately. However, relatively little effort has been made to address the difficulties developers experience when designing and training such models: if the evaluation of a model shows poor performance after the initial training, what should the developer change? We survey and evaluate existing state-of-the-art techniques that can be used to repair model performance, using a benchmark of both real-world mistakes developers made while designing DNN models and artificial faulty models generated by mutating the model code. The empirical evaluation shows that random baseline is comparable with or sometimes outperforms existing state-of-the-art techniques. However, for larger and more complicated models, all repair techniques fail to find fixes. Our findings call for further research to develop more sophisticated techniques for Deep Learning repair.

Link to Preprint

https://arxiv.org/pdf/2301.11568.pdf

Jinhan Kim

KAIST

South Korea

Nargiz Humbatova

USI Lugano

Switzerland

Gunel Jahangirova

King's College London

United Kingdom

Paolo Tonella