"What makes my queries slow?": Subgroup Discovery for SQL Workload Analysis
Among daily tasks of database administrators (DBAs), the analysis of query workloads for identifying schema issues and improving performances is crucial. Although DBAs can easily identify queries that repeatedly cause performance issues, it remains challenging to automatically identify subsets of queries that share some properties only (a pattern) and foster at the same time some target measures, such as execution time. Patterns are defined on combinations of query clause, environment variables, database alerts and metrics and help answer questions like what makes SQL queries slow? What makes I/O communications high? Automatically discovering these patterns in a huge search space and providing them as hypotheses for helping DBAs to localize issues and root-causes is an actual problem for explainable AI. To tackle it, we introduce an original approach rooted on Subgroup Discovery. We show how to instantiate and develop this generic data-mining framework to identify potential causes of SQL workloads issues. We believe indeed that such data-mining technique is not trivial to apply for DBAs. As such, we also provide a visualization tool for interactive knowledge discovery. We analyse a one week workload from hundreds of databases from our company, make both the dataset and source code available, and experimentally show that insightful hypotheses can be discovered.
Wed 17 NovDisplayed time zone: Hobart change
22:00 - 23:00 | PerformanceResearch Papers / Journal-first Papers / Tool Demonstrations at Koala Chair(s): Ming Wen Huazhong University of Science and Technology | ||
22:00 20mTalk | "What makes my queries slow?": Subgroup Discovery for SQL Workload Analysis Research Papers Youcef Remil Infologic, INSA Lyon, Anes Bendimerad Infologic, Romain Mathonat Infologic, Philippe Chaleat Infologic, Mehdi Kaytoue INFOLOGIC | ||
22:20 20mTalk | AID: Efficient Prediction of Aggregated Intensity of Dependency in Large-scale Cloud Systems Research Papers Tianyi Yang The Chinese University of Hong Kong, Jiacheng Shen The Chinese University of Hong Kong, Yuxin Su The Chinese University of Hong Kong, Xiao Ling Huawei Technologies, Yongqiang Yang Huawei Technologies, Michael Lyu The Chinese University of Hong Kong | ||
22:40 10mTalk | Assessment of Off-the-Shelf SE-specific Sentiment Analysis Tools: An Extended Replication Study Journal-first Papers Nicole Novielli University of Bari, Fabio Calefato University of Bari, Filippo Lanubile University of Bari, Alexander Serebrenik Eindhoven University of Technology | ||
22:50 5mTalk | EvoMe: A Software Evolution Management Engine Based on Differential Factbase Tool Demonstrations Xiuheng Wu Nanyang Technological University, Mengyang Li Nanyang Technological University, Yi Li Nanyang Technological University Pre-print | ||
22:55 5mTalk | RefactorInsight: Enhancing IDE Representation of Changes in Git with Refactorings Information Tool Demonstrations Zarina Kurbatova JetBrains Research, Vladimir Kovalenko JetBrains Research, Ioana Savu Delft University of Technology, Bob Brockbernd Delft University of Technology, Dan Andreescu Delft University of Technology, Matei Anton Delft University of Technology, Roman Venediktov Higher School of Economics, Elena Tikhomirova JetBrains Research, Timofey Bryksin JetBrains Research; HSE University Pre-print |