BenchCloud: A Platform for Scalable Performance Benchmarking (ASE 2024 - Tool Demonstrations)

Who

Dirk Beyer, Po-Chun Chien, Marek Jankola

Track

ASE 2024 Tool Demonstrations

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 29 Oct 2024 16:15 - 16:25 at Carr - Performance and load

Abstract

Performance evaluation is a crucial method for assessing automated-reasoning tools. Evaluating automated tools requires rigorous benchmarking to accurately measure resource consumption, including time and memory, which are essential for understanding the tools’ capabilities. BenchExec, a widely used benchmarking framework, reliably measures resource usage for tools executed locally on a single node. This paper describes BenchCloud, a solution for elastic and scalable job distribution across hundreds of nodes, enabling large-scale experiments on distributed and heterogeneous computing environments. BenchCloud seamlessly integrates with BenchExec, allowing BenchExec to delegate the actual execution to BenchCloud. The system has been employed in several prominent international competitions in automated reasoning, including SMT-COMP, SV-COMP, and Test-Comp, underscoring its importance in rigorous tool evaluation across various research domains. It helps to ensure both internal and external validity of the experimental results. This paper presents an overview of BenchCloud’s architecture and highlights its primary use cases in facilitating scalable benchmarking.

Link to Preprint

https://www.sosy-lab.org/research/pub/2024-ASE24.BenchCloud_A_Platform_for_Scalable_Performance_Benchmarking.pdf

DOI

https://doi.org/10.1145/3691620.3695358

Dirk Beyer

LMU Munich

Germany

Po-Chun Chien

LMU Munich

Marek Jankola

LMU Munich

Slovakia

Running System

BencCloud Release 1.1

Demonstration Video

Slides

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 29 Oct
Displayed time zone: Pacific Time (US & Canada) change

15:30 - 17:00	Performance and loadResearch Papers / Industry Showcase / NIER Track / Tool Demonstrations at Carr

15:30 15m Talk		AI-driven Java Performance Testing: Balancing Result Quality with Testing Time Research Papers Luca Traini University of L'Aquila, Federico Di Menna University of L'Aquila, Vittorio Cortellessa University of L'Aquila DOI Pre-print
15:45 15m Talk		MLOLET - Machine Learning Optimized Load and Endurance Testing: An industrial experience report Industry Showcase Arthur Vitui Concordia University, Tse-Hsun (Peter) Chen Concordia University
16:00 15m Talk		Dynamic Scoring Code Token Tree: A Novel Decoding Strategy for Generating High-Performance Code Research Papers Muzi Qu University of Chinese Academy of Sciences, Jie Liu Institute of Software, Chinese Academy of Sciences, Liangyi Kang Institute of Software, Chinese Academy of Sciences, Shuai Wang Institute of Software, Chinese Academy of Sciences, Dan Ye Institute of Software, Chinese Academy of Sciences, Tao Huang Institute of Software at Chinese Academy of Sciences
16:15 10m Talk		BenchCloud: A Platform for Scalable Performance Benchmarking Tool Demonstrations Dirk Beyer LMU Munich, Po-Chun Chien LMU Munich, Marek Jankola LMU Munich DOI Pre-print Media Attached
16:25 10m Talk		A Formal Treatment of Performance BugsRecorded Talk NIER Track Omar I. Al-Bataineh Gran Sasso Science Institute (GSSI)