Abstract
Collecting statistics is a time- and resource-consuming operation in distributed database systems. It is even more challenging to efficiently collect statistics without affecting system performance, meanwhile keeping correctness in a distributed environment. Traditional strategies usually consider one dimension during collecting statistics, which is lack of generalization. In this paper, we propose a new statistics collecting method with adaptive strategy (APCS), which well leverages collecting efficiency, correctness of statistics and effect to system performance. APCS picks appropriate time to trigger collecting action and filter unnecessary tasks, meanwhile reasonably allocates collecting tasks to appropriate executing locations with right executing model.
Supported by Key Research and Development Program of China (2018YFB1003403), National Natural Science Foundation of China (61732014,61672432,61672434) and Natural Science Basic Research Plan in Shaanxi Province of China (No. 2017JM6104).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Harmouch, H., Naumann, F.: Cardinality estimation: an experimental survey. Proc. VLDB Endow. 11(4), 499–512 (2018)
Woodruff, D.P., Zhang, Q.: Distributed statistical estimation of matrix products with applications (2018)
Chen, J., Jindel, S., Walzer, R., et al.: The MemSQL query optimizer. Proc. VLDB Endow. 9(13), 1401–1412 (2016)
Soliman, M.A., Antova, L., Raghavan, V., et al.: Orca: a modular query optimizer architecture for big data. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 337–348. ACM (2014)
Shankar, S., Nehme, R., Aguilar-Saborit, J., et al.: Query optimization in Microsoft SQL server PDW. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 767–776. ACM (2012)
Grohe, M., Schweikardt, N.: First-order query evaluation with cardinality conditions (2017)
Müller, M., Moerkotte, G., Kolb, O.: Improved selectivity estimation by combining knowledge from sampling and synopses. Proc. VLDB Endow. 11(9), 1016–1028 (2018)
Chakkappen, S., Cruanes, T., Dageville, B., et al.: Efficient and scalable statistics gathering for large databases in Oracle 11g. In: ACM SIGMOD International Conference on Management of Data, DBLP (2008)
Chakkappen, S., Budalakoti, S., Krishnamachari, R., et al.: Adaptive statistics in Oracle 12c. Proc. VLDB Endow. 10(12), 1813–1824 (2017)
Macke, S., Zhang, Y., Huang, S., et al.: Adaptive sampling for rapidly matching histograms. Proc. VLDB Endow. 11(10), 1262–1275 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Gao, JT., Liu, WJ., Li, ZH., Du, HT., Pei, OY. (2019). A New Statistics Collecting Method with Adaptive Strategy. In: Li, G., Yang, J., Gama, J., Natwichai, J., Tong, Y. (eds) Database Systems for Advanced Applications. DASFAA 2019. Lecture Notes in Computer Science(), vol 11448. Springer, Cham. https://doi.org/10.1007/978-3-030-18590-9_59
Download citation
DOI: https://doi.org/10.1007/978-3-030-18590-9_59
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-18589-3
Online ISBN: 978-3-030-18590-9
eBook Packages: Computer ScienceComputer Science (R0)