research-article

Efficient Join Order Selection Learning with Graph-based Representation

Authors:

Xu Chen,

Kai ZhengAuthors Info & Claims

KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 97 - 107

https://doi.org/10.1145/3534678.3539303

Published: 14 August 2022 Publication History

Get Access

Abstract

Join order selection plays an important role in DBMS query optimizers. The problem aims to find the optimal join order with the minimum cost, and usually becomes an NP-hard problem due to the exponentially increasing search space. Recent advanced studies attempt to use deep reinforcement learning (DRL) to generate better join plans than the ones provided by conventional query optimizers. However, DRL-based methods require time-consuming training, which is not suitable for online applications that need frequent periodic re-training. In this paper, we propose a novel framework, namely efficient Join Order selection learninG with Graph-basEd Representation (JOGGER). We firstly construct a schema graph based on the primary-foreign key relationships, from which table representations are well learned to capture the correlations between tables. The second component is the state representation, where a graph convolutional network is utilized to encode the query graph and a tailored-tree-based attention module is designed to encode the join plan. To speed up the convergence of DRL training process, we exploit the idea of curriculum learning, in which queries are incrementally added into the training set according to the level of difficulties. We conduct extensive experiments on JOB and TPC-H datasets, which demonstrate the effectiveness and efficiency of the proposed solutions.

Supplemental Material

MP4 File

Join order selection plays an important role in DBMS query optimizers. The problem aims to find the optimal join order with the minimum cost. Recent advanced studies attempt to use deep reinforcement learning (DRL) to generate better join plans than the ones provided by conventional query optimizers. However, DRL-based methods require time-consuming training. In this paper, we propose a novel framework, namely efficient Join Order selection learninG with Graph-basEd Representation (JOGGER). We firstly construct a schema graph based on the primary-foreign key relationships, from which table representations are learned to capture the correlations between tables. The second component is the state representation, where GCN is utilized to encode the query graph and a tailored-tree-based attention module to encode the join plan. To speed up the convergence of training, we incorporate curriculum learning, in which queries are incrementally added into the training set according to the level of difficulties.

Download
241.35 MB

References

[1]

Mahtab Ahmed, Muhammad Rifayat Samee, and Robert E Mercer. 2019. Improving Tree-LSTM with Tree Attention. In 2019 IEEE 13th International Conference on Semantic Computing (ICSC). IEEE Computer Society, 247--254.

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Coral: federated query join order optimization based on deep reinforcement learning

Join Ordering of SPARQL Property Path Queries

Multi-way spatial join selectivity for the ring join graph

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations