When to Say What: Learning to Find Condition-Message Inconsistencies (ICSE 2023 - Technical Track) - ICSE 2023

Write a Blog >>

Sun 14 - Sat 20 May 2023 Melbourne, Australia

Who

Islem BOUZENIA, Michael Pradel

Track

ICSE 2023 Technical Track

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

When

Wed 17 May 2023 16:45 - 17:00 at Meeting Room 101 - Software logging Chair(s): Hongyu Zhang
Thu 18 May 2023 15:26 - 15:28 at Meeting Room 105 - Posters 2

Abstract

Programs often emit natural language messages, e.g., in logging statements or exceptions raised on unexpected paths. To be meaningful to users and developers, the message, i.e., \emph{what} to say, must be consistent with the condition under which it gets trigger, i.e., \emph{when} to say it. However, checking for inconsistencies between conditions and messages is challenging because the conditions are expressed in the logic of the programming language, while messages are informally expressed in natural language. This paper presents CMI-Finder, an approach for detecting \emph{condition-message inconsistencies}. CMI-Finder is based on a neural model that takes a condition and a message as its input and then predicts whether the two are consistent. To address the problem of obtaining realistic, diverse, and large-scale training data, we present six techniques to generate large numbers of inconsistent examples to learn from automatically. Moreover, we describe and compare three neural models, which are based on binary classification, triplet loss, and fine-tuning, respectively. Our evaluation applies the approach to 300K condition-message statements extracted from 42 million lines of Python code. The best model achieves a precision of 78% at a recall of 72% on a dataset of past bug fixes. Applying the approach to the newest versions of popular open-source projects reveals 50 previously unknown bugs, eight of which have been confirmed by the developers so far.

Link to Preprint

https://software-lab.org/publications/icse2023_CMI-Finder.pdf

Islem BOUZENIA

University of Stuttgart

Michael Pradel

University of Stuttgart

Germany

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Session Program

Wed 17 May
Displayed time zone: Hobart change

	15:45 - 17:15	Software loggingTechnical Track at Meeting Room 101 Chair(s): Hongyu Zhang The University of Newcastle

	15:45 15m Talk		PILAR: Studying and Mitigating the Influence of Configurations on Log Parsing Technical Track Hetong Dai Concordia University, Yiming Tang Concordia University, Heng Li Polytechnique Montréal, Weiyi Shang University of Waterloo
	16:00 15m Talk		Did We Miss Something Important? Studying and Exploring Variable-Aware Log Abstraction Technical Track Zhenhao Li Concordia University, Chuan Luo Beihang University, Tse-Hsun (Peter) Chen Concordia University, Weiyi Shang University of Waterloo, Shilin He Microsoft Research, Qingwei Lin Microsoft Research, Dongmei Zhang Microsoft Research
	16:15 15m Talk		On the Temporal Relations between Logging and Code Technical Track Zishuo Ding Concordia University, Yiming Tang Concordia University, Yang Li Beijing University of Posts and Telecommunications, Heng Li Polytechnique Montréal, Weiyi Shang University of Waterloo Pre-print
	16:30 15m Talk		How Do Developers' Profiles and Experiences Influence their Logging Practices? An Empirical Study of Industrial Practitioners Technical Track Guoping Rong Nanjing University, shenghui gu Nanjing University, Haifeng Shen Australian Catholic University, He Zhang Nanjing University, Hongyu Kuang Nanjing University
	16:45 15m Talk		When to Say What: Learning to Find Condition-Message Inconsistencies Technical Track Islem BOUZENIA University of Stuttgart, Michael Pradel University of Stuttgart Pre-print
	17:00 15m Talk		A Semantic-aware Parsing Approach for Log Analytics Technical Track Yintong Huo The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Cheryl Lee The Chinese University of Hong Kong, Michael Lyu The Chinese University of Hong Kong Pre-print

Thu 18 May
Displayed time zone: Hobart change

	15:15 - 15:45	Posters 2SEIS - Software Engineering in Society / Technical Track / Posters / Showcase at Meeting Room 105

	15:15 2m Short-paper		Values@Runtime: An Adaptive Framework for Operationalising Values SEIS - Software Engineering in Society Amel Bennaceur The Open University, UK, Diane Hassett Lero, University of Limerick, Ireland, Bashar Nuseibeh The Open University, UK; Lero, University of Limerick, Ireland, Andrea Zisman The Open University, UK
	15:17 2m Talk		PExReport: Automatic Creation of Pruned Executable Cross-Project Failure Reports Technical Track Sunzhou Huang University of Texas at San Antonio, Xiaoyin Wang University of Texas at San Antonio Pre-print Media Attached
	15:19 2m Talk		Chronos: Time-Aware Zero-Shot Identification of Libraries from Vulnerability Reports Technical Track Yunbo Lyu Singapore Management University, Le-Cong Thanh The University of Melbourne, Hong Jin Kang UCLA, Ratnadira Widyasari Singapore Management University, Singapore, Zhipeng Zhao Singapore Management University, Xuan-Bach D. Le University of Melbourne, Ming Li Nanjing University, David Lo Singapore Management University Pre-print
	15:21 2m Short-paper		Contradicting Motivations in Civic Tech Software Development: Analysis of a Grassroots Project SEIS - Software Engineering in Society Antti Knutas LUT University, Dominik Siemon LUT University, Natasha Tylosky LUT University, Giovanni Maccani Ideas for Change Pre-print
	15:24 2m Talk		Usability-Oriented Design of Liquid Types for Java Technical Track Catarina Gamboa CMU and LASIGE, Paulo Canelas Carnegie Mellon University, Christopher Steven Timperley Carnegie Mellon University, Alcides Fonseca University of Lisbon DOI
	15:26 2m Talk		When to Say What: Learning to Find Condition-Message Inconsistencies Technical Track Islem BOUZENIA University of Stuttgart, Michael Pradel University of Stuttgart Pre-print
	15:31 2m Poster		GLAD: Neural Predicate Synthesis to Repair Omission Faults Posters Sungmin Kang KAIST, Shin Yoo KAIST
	15:33 2m Short-paper		Gender Representation Among Contributors to Open-Source Infrastructure - An Analysis of 20 Package Manager Ecosystems SEIS - Software Engineering in Society Huilian Sophie Qiu Carnegie Mellon University, Zihe H Zhao Rice University, Tielin Katy Yu Carnegie Mellon University, Justin Wang Carnegie Mellon University, Alexander Ma Carnegie Mellon University, Hongbo Fang Carnegie Mellon University, Laura Dabbish Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University
	15:35 2m Short-paper		Software Engineering for Smart Things in Public Spaces: Initial Insights and Challenges SEIS - Software Engineering in Society Amna Batool Deakin University, Seng W.Loke Deakin University, Niroshinie Fernando Deakin University, Jonathan Kua Deakin University
	15:38 2m Short-paper		Workplace Discrimination in Software Engineering: Where We Stand Today SEIS - Software Engineering in Society Xin Zhao Seattle University, Riley Young Seattle University
	15:40 2m Talk		A GNN-based Recommender System to Assist the Specification of Metamodels and Models Showcase Juri Di Rocco University of L'Aquila, Claudio Di Sipio University of L'Aquila, Davide Di Ruscio University of L'Aquila, Phuong T. Nguyen University of L’Aquila
	15:42 2m Talk		ECSTATIC: An Extensible Framework for Testing and Debugging Configurable Static Analysis Technical Track Austin Mordahl University of Texas at Dallas, Zenong Zhang The University of Texas at Dallas, Dakota Soles The University of Texas at Dallas, Shiyi Wei University of Texas at Dallas Pre-print

:

:

:

: