ML4PL 2018
Mon 16 - Sat 21 July 2018 Amsterdam, Netherlands
co-located with ECOOP and ISSTA 2018
Wed 18 Jul 2018 11:40 - 12:00 at Hanoi - Real-World Benchmarking

In 2017, the Software Development Team at King’s College London performed a benchmarking experiment to compare the warmup time and peak performance of modern programming language Virtual Machines (VMs). The experiment was intended to be the most rigorous to date. Our results were both surprising and disappointing. Not only did few modern VMs achieve a steady state of peak performance when running well known benchmarks, but some even slowed down over time.

This talk focuses not on the results of our experiment, but on our experiences of developing the “Krun” benchmarking system and the statistical analyses we used to process our data. The talk will discuss the difficulties we encountered in eliminating confounding variables and will show you how to present performance results in the absence of steady states.

Whilst Krun enabled us to collect robust and accurate results for our experiment, it tends towards being overkill. Ideally we’d like to build a cut-down version of Krun, but this raises the question of “which of Krun’s features make the most difference to benchmarking quality?”.

Slides (benchwork.pdf)2.56MiB

Wed 18 Jul
Times are displayed in time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

11:00 - 12:30: BenchWork - Real-World Benchmarking at Hanoi
benchwork-2018-talks11:00 - 11:10
Karim AliUniversity of Alberta, Cristina CifuentesOracle Labs
benchwork-2018-talks11:10 - 11:40
File Attached
benchwork-2018-talks11:40 - 12:00
Edd BarrettKing's College London, Sarah MountKing's College London, Laurence TrattKing's College London
File Attached
benchwork-2018-talks12:00 - 12:30
Kevin AllixUniversity of Luxembourg