Hulk: Exploring Data-Sensitive Performance Anomalies in DBMSs via Data-Driven Analysis
Performance is crucial for database management systems (DBMSs), and they are always designed to handle ever-changing workloads efficiently. However, the complexity of the cost-based optimizer (CBO) and its interactions can introduce implementation errors, leading to data-sensitive performance anomalies. These anomalies may cause significant performance degradation compared to the expected design under certain datasets. To diagnose performance issues, DBMS developers often rely on intuitions or compare execution times to a baseline DBMS. These approaches overlook the impact of datasets on performance. As a result, only a subset of performance issues is identified and resolved.
In this paper, we propose HULK to automatically explore these data-sensitive performance anomalies via data-driven analysis. The key idea is to identify performance anomalies as the dataset evolves. Specifically, HULK estimates a reasonable response time range for each data volume to pinpoint performance cliffs. Then performance cliffs are checked for deviations from expected performance by finding a reasonable plan that aligns with performance expectations. We evaluate HULK on six widely-used DBMSs, namely MySQL, MariaDB, Percona, TiDB, PostgreSQL, and Antdb. \tool{} reports 135 anomalies, with 129 have been confirmed as new bugs, including 14 CVEs. Among them, 94 are data-sensitive performance bugs.
Wed 25 JunDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
11:00 - 12:15 | Systems and PerformanceResearch Papers at Aurora C Chair(s): Lars Grunske Humboldt-Universität zu Berlin | ||
11:00 25mTalk | Preventing Disruption of System Backup Against Ransomware Attacks Research Papers Yiwei Hou Tsinghua University, Lihua Guo Tsinghua University, Chijin Zhou Tsinghua University, Quan Zhang Tsinghua University, Wenhuan Liu Tsinghua University/Union Tech, Chengnian Sun University of Waterloo, Yu Jiang Tsinghua University DOI | ||
11:25 25mTalk | Rethinking Performance Analysis for Configurable Software Systems: A Case Study from a Fitness Landscape Perspective Research Papers Mingyu Huang University of Electronic Science and Technology of China, Peili Mao University of Electronic Science and Technology of China, Ke Li University of Exeter DOI | ||
11:50 25mTalk | Hulk: Exploring Data-Sensitive Performance Anomalies in DBMSs via Data-Driven Analysis Research Papers Zhiyong Wu Tsinghua University, China, Jie Liang , Jingzhou Fu School of Software, Tsinghua University, Mingzhe Wang Tsinghua University, Yu Jiang Tsinghua University DOI |
Aurora C is the third room in the Aurora wing.
When facing the main Cosmos Hall, access to the Aurora wing is on the right, close to the side entrance of the hotel.