Selective Use of Yannakakis’ Algorithm for Consistent Performance Gains
D. Böhm, G. Gottlob, M. Lanzinger, D.M. Longo, C. Okulmus, R. Pichler, A. Selzer, in: Proceedings of the 28th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2026), Tampere, Finland, 2026.
Download (ext.)
Conference Paper
| English
Author
Böhm, Daniela;
Gottlob, Georg;
Lanzinger, Matthias;
Longo, Davide Mario;
Okulmus, CemLibreCat
;
Pichler, Reinhard;
Selzer, Alexander
Department
Abstract
Large intermediate results can cause join queries to run unexpectedly long. This problem is particularly common for analytical queries, which aggregate data over many tables to produce a comparatively small final output, and queries on graph data, where intermediate results blow up quickly. Recent work inspired by Yannakakis’ algorithm approaches this by modifying the query engine to avoid materializing unnecessary tuples. However, this requires significant changes to the core of the system, which is not feasible in many situations such as cloud environments or proprietary systems.
In this work, we propose a flexible approach for optimizing long-running join queries from the outside of the DBMS. Rewriting-based realizations of Yannakakis’ algorithm suffer from inherent overhead due to the creation of intermediate tables. Thus, we present an approach for detecting and targeting queries which would benefit from a Yannakakis-style optimization. We introduce a new benchmark combining 5 standard benchmarks and augmenting them with additional instances, which provides a sufficient size and diversity for a machine learning based solution. On PostgreSQL, DuckDB and SparkSQL, slowdowns on queries where the rewriting is counterproductive are mostly avoided, as opposed to a naïve application of the rewriting, and we observe significant improvements in end-to-end runtimes over standard query execution and unconditional rewriting.
Keywords
Publishing Year
Proceedings Title
Proceedings of the 28th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2026)
LibreCat-ID
Cite this
Böhm D, Gottlob G, Lanzinger M, et al. Selective Use of Yannakakis’ Algorithm for Consistent Performance Gains. In: Proceedings of the 28th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2026). ; 2026.
Böhm, D., Gottlob, G., Lanzinger, M., Longo, D. M., Okulmus, C., Pichler, R., & Selzer, A. (2026). Selective Use of Yannakakis’ Algorithm for Consistent Performance Gains. Proceedings of the 28th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2026).
@inproceedings{Böhm_Gottlob_Lanzinger_Longo_Okulmus_Pichler_Selzer_2026, place={Tampere, Finland}, title={Selective Use of Yannakakis’ Algorithm for Consistent Performance Gains}, booktitle={Proceedings of the 28th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2026)}, author={Böhm, Daniela and Gottlob, Georg and Lanzinger, Matthias and Longo, Davide Mario and Okulmus, Cem and Pichler, Reinhard and Selzer, Alexander}, year={2026} }
Böhm, Daniela, Georg Gottlob, Matthias Lanzinger, Davide Mario Longo, Cem Okulmus, Reinhard Pichler, and Alexander Selzer. “Selective Use of Yannakakis’ Algorithm for Consistent Performance Gains.” In Proceedings of the 28th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2026). Tampere, Finland, 2026.
D. Böhm et al., “Selective Use of Yannakakis’ Algorithm for Consistent Performance Gains,” 2026.
Böhm, Daniela, et al. “Selective Use of Yannakakis’ Algorithm for Consistent Performance Gains.” Proceedings of the 28th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2026), 2026.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Closed Access
