MLTEing Models: Negotiating, Evaluating, and Documenting Model and System Qualities (ICSE 2023 - NIER - New Ideas and Emerging Results)

Who

Katherine R. Maffey, Kyle Dotterrer, Jennifer Niemann, Iain Cruickshank, Grace Lewis, Christian Kästner

Track

ICSE 2023 NIER - New Ideas and Emerging Results

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 17 May 2023 12:15 - 12:22 at Meeting Room 110 - Model-driven engineering Chair(s): Henry Muccini

Abstract

Many organizations seek to ensure that machine learning (ML) and artificial intelligence (AI) systems work as intended in production but currently do not have a cohesive methodology in place to do so. To fill this gap, we propose MLTE (Machine Learning Test and Evaluation, colloquially referred to as “melt”), a framework and implementation to evaluate ML models and systems. The framework compiles state-of-the-art evaluation techniques into an organizational process for interdisciplinary teams, including model developers, software engineers, system owners, and other stakeholders. The MLTE tool supports this process by providing a domain-specific language that teams can use to express model requirements, an infrastructure to define, generate, and collect ML evaluation metrics, and the means to communicate results.

Link to Preprint

https://arxiv.org/abs/2303.01998

Katherine R. Maffey

AI Integration Center

United States

Kyle Dotterrer

AI Integration Center

Jennifer Niemann

AI Integration Center

Iain Cruickshank

Army Cyber Institute

Grace Lewis

Carnegie Mellon Software Engineering Institute

United States

Christian Kästner

Carnegie Mellon University

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 17 May
Displayed time zone: Hobart change

11:00 - 12:30	Model-driven engineeringJournal-First Papers / SEIP - Software Engineering in Practice / NIER - New Ideas and Emerging Results / Showcase / DEMO - Demonstrations at Meeting Room 110 Chair(s): Henry Muccini University of L'Aquila, Italy

11:00 15m Talk		A Model-based, Quality Attribute-guided Architecture Re-Design Process at Google SEIP - Software Engineering in Practice Qin Jia Google LLC, Yuanfang Cai Drexel University, Onur Çakmak Google LLC
11:15 15m Talk		Efficient Replay-based Regression Testing for Distributed Reactive Systems in the Context of Model-driven Development Showcase Majid Babaei McGill University, Juergen Dingel Queen's University, Kingston, Ontario
11:30 15m Talk		A GNN-based Recommender System to Assist the Specification of Metamodels and Models Showcase Juri Di Rocco University of L'Aquila, Claudio Di Sipio University of L'Aquila, Davide Di Ruscio University of L'Aquila, Phuong T. Nguyen University of L’Aquila
11:45 7m Talk		RM2DM: A Tool for Automatic Generation of OO Design Models from Requirements Models DEMO - Demonstrations Zhen Tian Beihang University, Yilong Yang Beihang University, Sheng Cheng Software Engineering and Digitalization Center of China Manned Space Engineering
11:52 7m Talk		(Journal-First Track) PRINS: Scalable Model Inference for Component-Based System Logs Journal-First Papers Donghwan Shin The University of Sheffield, Domenico Bianculli University of Luxembourg, Lionel Briand University of Luxembourg; University of Ottawa Link to publication DOI
12:00 7m Talk		Advantages and disadvantages of (dedicated) model transformation languages: A qualitative interview study Journal-First Papers Stefan Höppner Ulm University, Yves Haas Institute of Software Engineering and Programming Languages, Ulm University, Matthias Tichy Ulm University, Germany, Katharina Juhnke Institute of Software Engineering and Programming Languages, Ulm University
12:07 7m Talk		Automated Generation of Consistent Graph Models With Multiplicity Reasoning Journal-First Papers Kristóf Marussy Budapest University of Technology and Economics, Oszkár Semeráth Budapest University of Technology and Economics, Daniel Varro Linköping University / McGill University
12:15 7m Talk		MLTEing Models: Negotiating, Evaluating, and Documenting Model and System Qualities NIER - New Ideas and Emerging Results Katherine R. Maffey AI Integration Center, Kyle Dotterrer AI Integration Center, Jennifer Niemann AI Integration Center, Iain Cruickshank Army Cyber Institute, Grace Lewis Carnegie Mellon Software Engineering Institute, Christian Kästner Carnegie Mellon University Pre-print