The simplicity of Python and its rich set of libraries has made it the most popular language for data science. Moreover, the interpreted nature of Python offers an easy debugging experience for the developers. However, it comes with the price of poor performance compared to the compiled code. In this paper, we adopt and extend state-of-the-art research in query compilers to propose an efficient query engine embedded in Python. Our open-sourced framework enables the developers to do the debugging in Python, while being able to easily build a compiled version of the code for deployment. Our benchmark results on the entire set of TPC-H queries show that our approach covers different types of relational workloads and is competitive with state-of-the-art in-memory engines in both single- and multi-threaded settings.
Sun 26 FebDisplayed time zone: Eastern Time (US & Canada) change
10:20 - 11:20 | Domain Specific LanguagesResearch Papers at St. Laurent 3 Chair(s): Martin Kong The Ohio State University | ||
10:20 20mTalk | Building a Compiled Query Engine in Python Research Papers DOI | ||
10:40 20mTalk | Codon: A Compiler for High-Performance Pythonic Applications and DSLs Research Papers Ariya Shajii Exaloop, Gabriel Ramirez Massachusetts Institute of Technology, Haris Smajlović University of Victoria, Jessica Ray Massachusetts Institute of Technology, Bonnie Berger Massachusetts Institute of Technology, Saman Amarasinghe Massachusetts Institute of Technology, Ibrahim Numanagić University of Victoria DOI | ||
11:00 20mTalk | MOD2IR: High-Performance Code Generation for a Biophysically Detailed Neuronal Simulation DSL Research Papers George Mitenkov Imperial College London, Ioannis Magkanaris EPFL, Omar Awile EPFL, Pramod Kumbhar EPFL, Felix Schürmann EPFL, Alastair F. Donaldson Imperial College London DOI |