A New Statistics Collecting Method with Adaptive Strategy

Gao, Jin-Tao; Liu, Wen-Jie; Li, Zhan-Huai; Du, Hong-Tao; Pei, Ou-Ya

doi:10.1007/978-3-030-18590-9_59

Jin-Tao Gao¹⁹,
Wen-Jie Liu¹⁹,
Zhan-Huai Li¹⁹,
Hong-Tao Du¹⁹ &
…
Ou-Ya Pei¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11448))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

3507 Accesses

Abstract

Collecting statistics is a time- and resource-consuming operation in distributed database systems. It is even more challenging to efficiently collect statistics without affecting system performance, meanwhile keeping correctness in a distributed environment. Traditional strategies usually consider one dimension during collecting statistics, which is lack of generalization. In this paper, we propose a new statistics collecting method with adaptive strategy (APCS), which well leverages collecting efficiency, correctness of statistics and effect to system performance. APCS picks appropriate time to trigger collecting action and filter unnecessary tasks, meanwhile reasonably allocates collecting tasks to appropriate executing locations with right executing model.

Supported by Key Research and Development Program of China (2018YFB1003403), National Natural Science Foundation of China (61732014,61672432,61672434) and Natural Science Basic Research Plan in Shaanxi Province of China (No. 2017JM6104).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Harmouch, H., Naumann, F.: Cardinality estimation: an experimental survey. Proc. VLDB Endow. 11(4), 499–512 (2018)
Google Scholar
Woodruff, D.P., Zhang, Q.: Distributed statistical estimation of matrix products with applications (2018)
Google Scholar
Chen, J., Jindel, S., Walzer, R., et al.: The MemSQL query optimizer. Proc. VLDB Endow. 9(13), 1401–1412 (2016)
Google Scholar
Soliman, M.A., Antova, L., Raghavan, V., et al.: Orca: a modular query optimizer architecture for big data. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 337–348. ACM (2014)
Google Scholar
Shankar, S., Nehme, R., Aguilar-Saborit, J., et al.: Query optimization in Microsoft SQL server PDW. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 767–776. ACM (2012)
Google Scholar
Grohe, M., Schweikardt, N.: First-order query evaluation with cardinality conditions (2017)
Google Scholar
Müller, M., Moerkotte, G., Kolb, O.: Improved selectivity estimation by combining knowledge from sampling and synopses. Proc. VLDB Endow. 11(9), 1016–1028 (2018)
Google Scholar
Chakkappen, S., Cruanes, T., Dageville, B., et al.: Efficient and scalable statistics gathering for large databases in Oracle 11g. In: ACM SIGMOD International Conference on Management of Data, DBLP (2008)
Google Scholar
Chakkappen, S., Budalakoti, S., Krishnamachari, R., et al.: Adaptive statistics in Oracle 12c. Proc. VLDB Endow. 10(12), 1813–1824 (2017)
Google Scholar
Macke, S., Zhang, Y., Huang, S., et al.: Adaptive sampling for rapidly matching histograms. Proc. VLDB Endow. 11(10), 1262–1275 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Northwestern Polytechnical University, Xi’an, 710072, China
Jin-Tao Gao, Wen-Jie Liu, Zhan-Huai Li, Hong-Tao Du & Ou-Ya Pei

Authors

Jin-Tao Gao
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Jie Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhan-Huai Li
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Tao Du
View author publications
You can also search for this author in PubMed Google Scholar
Ou-Ya Pei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jin-Tao Gao .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Guoliang Li
Duke University, Durham, NC, USA
Jun Yang
University of Porto, Porto, Portugal
Joao Gama
Chiang Mai University, Chiang Mai, Thailand
Juggapong Natwichai
Beihang University, Beijing, China
Yongxin Tong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gao, JT., Liu, WJ., Li, ZH., Du, HT., Pei, OY. (2019). A New Statistics Collecting Method with Adaptive Strategy. In: Li, G., Yang, J., Gama, J., Natwichai, J., Tong, Y. (eds) Database Systems for Advanced Applications. DASFAA 2019. Lecture Notes in Computer Science(), vol 11448. Springer, Cham. https://doi.org/10.1007/978-3-030-18590-9_59

Download citation

DOI: https://doi.org/10.1007/978-3-030-18590-9_59
Published: 24 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-18589-3
Online ISBN: 978-3-030-18590-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics